Can We Predict the Burden of Wasting in Crisis-Affected Countries? Findings from Somalia and South Sudan

doi:10.21203/rs.3.rs-1250806/v1

Download PDF

Research Article

Can We Predict the Burden of Wasting in Crisis-Affected Countries? Findings from Somalia and South Sudan

https://doi.org/10.21203/rs.3.rs-1250806/v1

This work is licensed under a CC BY 4.0 License

You are reading this latest preprint version

Background

Sample surveys are the mainstay of surveillance for wasting in settings affected by crises, but are burdensome and have limited geographical coverage due to insecurity and other access issues. As a possible complement to surveys, we explored a statistical approach to predict the prevalent burden of wasting malnutrition for small population strata in two crisis-affected countries, Somalia (2014-2018) and South Sudan (2015-2018).

Methods

For each country, we sourced datasets generated by humanitarian actors or other entities on insecurity, displacement, food insecurity, access to services, epidemic occurrence and other factors on the causal pathway to malnutrition. We merged these with datasets of sample household anthropometric surveys done at administrative level 3 (district, county) as part of nutritional surveillance, and, for each of several outcomes including binary and continuous indices based on either weight-for-height or middle-upper-arm circumference, fitted and evaluated the predictive performance of generalised linear models and, as an alternative, machine learning random forests.

Results

We developed models based on 85 ground surveys in Somalia and 175 in South Sudan. Livelihood type, measles incidence, vegetation index and water price were important predictors in Somalia, and livelihood, rainfall and terms of trade (purchasing power) in South Sudan. However, both generalised linear models and random forests had low performance for both binary and continuous anthropometric outcomes.

Conclusions

Predictive models had disappointing performance and are not usable for action. The range of data used and their quality probably limited our analysis. The predictive approach remains theoretically attractive, and deserves further evaluation with larger datasets across multiple settings.

Malnutrition

wasting

undernutrition

South Sudan

Somalia

food insecurity

crisis

humanitarian

prediction

statistical model

In settings affected by crises due to armed conflict, community violence, displacement and/or food insecurity, wasting malnutrition is a prominent public health threat that, at the individual level, presents a short-term mortality risk, exacerbates endemic and epidemic infectious diseases and worsens long-term developmental outcomes. Wasting prevalence among children is also a key summative indicator of crisis severity, as it reflects the wider situation of food security, livelihoods and the public health and social environment [1]. For the purpose of this paper, and in accordance with current Unicef guidance, we refer to wasting (also commonly known as acute malnutrition) as the occurrence of two partially overlapping presentations: marasmus, characterised by a recent and severe weight loss, and the rarer but more lethal oedematous form (kwashiorkor). Anthropometric indices including weight-for-height or -length, middle-upper arm circumference (MUAC) and presence of bilateral pitting oedema may be combined into continuous indicators (e.g. weight-for-height/length Z-score, relative to the mean of a well-nourished reference population: WHZ) or dichotomised based on thresholds to classify children as severely or moderately wasted, and, at the population level, compute prevalence estimates [2]. Such information helps to assess progress towards national and global targets, identify an appropriate package of food security and nutritional services, estimate resources needed (e.g. treatment caseload), monitor the performance of services and detect changes in crisis severity as part of early warning systems such as the integrated food security phase classification (IPC) [3–5].

Cross-sectional anthropometric surveys among children 6 to 59 months old (mo) are an important component of nutritional surveillance in crisis settings, along with facility-based and programmatic data [6]. Over the past decade, considerable progress has been made to standardise methods and analysis of these surveys. In particular, the Standardised Monitoring and Assessment of Relief and Transitions (SMART) project [7] provides generic study protocols and aides for survey design, training and quality control, as well as the bespoke Emergency Nutrition Software for sample selection, data entry and analysis. SMART surveys, usually implemented at a small geographic scale (e.g. districts or individual camps), are the most common population-based method to measure malnutrition burden in humanitarian response. However, SMART surveys are somewhat burdensome in terms of human and financial resources, require several weeks to plan, implement and report on, and may have limited geographic reach due to insecurity or other access constraints, thereby resulting in potentially biased, untimely, and/or insufficiently granular information. Otherwise put, surveys alone may not adequately support early detection of deteriorating situations and efficient resource allocation [8]. More recently, COVID-19 related restrictions temporarily curtailed SMART survey implementation, just as the pandemic was expected to contribute to a projected doubling in the global population facing food insecurity crisis conditions, and, consequently, a substantial increase in wasting burden [9].

To complement small-scale nutrition surveys and other surveillance data, and in order to reduce the burden of repeated surveys while also generating timely information on a more regular basis at operationally useful geographical resolution, we explored the performance of predictive statistical models of wasting burden in Somalia and South Sudan, two crisis-affected countries prominently affected by service access constraints, food insecurity and malnutrition.

Study design

We used a combination of existing datasets collected for programmatic purposes by humanitarian and government actors (see below) to develop and evaluate country-specific models to predict various anthropometric indicators at the resolution of one month and a single administrative level 2 unit (district in Somalia, county in South Sudan), hereafter referred to as a ‘stratum’.

Drawing from an a priori causal framework of factors leading to wasting (Additional file 1, Figure S5), we identified potential predictor variables collected at the desired resolution, and merged these with individual child-level data from SMART surveys designed to be representative of single strata. We fitted various candidate models to a training data subset, and evaluated their predictive accuracy on a validation data subset, as well as on cross-validation.

Study population and timeframe

For Somalia (including Somaliland and Puntland), we sourced predictor and anthropometric survey data from January 2014 to December 2018 inclusive. During this period, Somalia’s population rose from about 12.8M to 14.5M [10]. Surveys were done in 22 (29%) of Somalia’s 75 districts. For South Sudan, the analysis spanned January 2015 to April 2018, and featured surveys from 63 (80%) of the country’s 79 counties, as per 2013 administrative borders. South Sudan’s population declined from 10.2M to 9.7M during the period, reflecting refugee movements to neighbouring countries [11].

Data sources

Anthropometric surveys. We accessed reports and raw datasets of 177 SMART surveys from South Sudan (two were excluded due to very unusual values, leaving 175 analysis-eligible), and 167 from Somalia (82 were excluded: 76, mainly done before 2016, were representative of livelihood zones rather than districts, and thus could not be coupled with predictor data; five appeared to have followed a non-representative sampling design; one had no available dataset, leaving 85 analysis-eligible). For each survey, we inspected the report to identify any possible bias sources and, in particular, any reported restriction of the effective sampling frame due to insecurity or inaccessibility (e.g. if a report stated that two out of 12 boma, South Sudan’s administrative level 3 unit, could not be included in the sample, we approximated the sampling coverage as 2/12 ≈ 83%). We also rescaled the ENA software-reported quality score for the survey (a composite of several indicators including proportion of outlier values, digit preference and properties of the distribution of observed values, ranging from 0% = best to 50% = worst [12]) to a 0-100% range, where best = 100%. We reanalysed all surveys by converting the raw anthropometric readings (weight, height or length, age, middle-upper arm circumference or MUAC) into z-score indices as per the World Health Organization 2006 standardised anthropometric distributions using the anthro package in R, flagging and excluding all observations with missing values, <> 5 z-scores from the mean and/or outside the allowed age range (6-59mo). Lastly, we classified all children into severe wasting or wasting according to two alternative definitions: (i) bilateral oedema and/or weight-for-height (WHZ) <3Z (severe wasting) or <2Z (wasting); (ii) bilateral oedema and/or MUAC < 115mm (severe wasting) or < 125mm (wasting) [13]. We fitted generalised linear models (binomial for severe wasting and wasting, gaussian otherwise) with standard errors adjusted for cluster design to verify concordance with point estimates and 95% confidence intervals (CI) contained in the survey reports.

Predictors. We developed a causal framework of wasting (Additional file 1, Figure S5) based on existing evidence and plausibility reasoning. We used this framework to identify factors potentially predicting the outcomes of interest. We searched for candidate predictor data representing these factors online and through contacts with humanitarian actors in both Somalia and South Sudan, the main desirable characteristics of datasets being stratification by stratum and month, and that data be generated routinely for programmatic purposes, i.e. realistically available without further primary data collection. Most datasets had already been sourced as part of similar projects to retrospectively estimate mortality in both countries [10, 11]. Candidate predictors for both Somalia and South Sudan are detailed in Table 1 and Table 2, respectively. Each predictor dataset was subjected to data cleaning to remove obvious errors. We excluded predictors that were missing for ≥30% of strata or ≥30% of months. Remaining completeness problems were resolved through interpolation (humanitarian presence), manual imputation (missing market data points were attributed a weighted average of the geographically nearest market’s value and the mean of all other non-missing markets, with 0.7 and 0.3 weights respectively) and automatic imputation using the mice R package [14] (water price, severe wasting and wasting treatment quality). To reduce stochastic noise in the time series, we computed three-month window rolling means for all time-varying predictors, and applied moderate local spline smoothing to terms of trade or market price variables. Where appropriate, we computed per-population rates using stratum-month population figures previously estimated as part of mortality estimation projects for each country. Briefly, these combine available base estimates (census projections in South Sudan; quality-weighted averages of four alternative sources in Somalia), natural growth assumptions and data on refugee as well as internal displacement to and from each stratum, by month.

Table 1

Candidate predictor datasets, Somalia.
Predictor	Variable(s)	Domain	Time span of availability	Source(s)	Notes and assumptions
Administrative level	Administrative entity within Somalia	(various)	n/a (static variable)	n/a	Somaliland, Puntland, south-central Somalia
Rainfall	Total rainfall (mm)	Climate	2013 to 2018	Climate Engine (https://clim-engine-development.appspot.com/fewsNet) [15]
Rainfall	Mean of Standard Precipitation Index	Climate	2016 to 2018		Compares current rainfall with historical averages.
Vegetation density	Normalised Difference Vegetation Index	Climate	2013 to 2018	Food Security and Nutrition Analysis Unit - Somalia (FSNAU)
Incidence of armed conflict events	events per 100,000 population deaths per 100,000 population	Exposure to armed conflict / insecurity	2010 to 2018	Armed Conflict Location & Event Data Project (ACLED, https://www.acleddata.com/) [16]	Meta-data on individual armed conflict events based on extensive review of multi-language media sources and other public information.
Incidence of attacks against aid workers	deaths per 100,000 population injuries per 100,000 population	Exposure to armed conflict / insecurity	2010 to 2018	Aid Worker Security Database (AWSD, https://aidworkersecurity.org/incidents)	Data on various types of attacks to aid workers, capturing information from media sources, aid organisations and security actors.
Proportion of IDPs	proportion of IDPs among total district population	Forced displacement	2016 to 2018	Estimated by authors as part of a separate mortality study [10].
Main local livelihood type	Pastoral, agropastoral, riverine and urban	Food security and livelihoods	n/a (static variable)	FSNAU	Assumed to be constant over time.
Water price	Price of 200L drum of water in Somali Shillings	Food insecurity and livelihoods	2013 to 2018	FSNAU
Terms of trade purchasing power index	Kcal equivalent of local cereals that an average local-quality goat can be exchanged for	Food insecurity and livelihoods	2013 to 2018	Calculated by the authors based on FSNAU price data from 100 sentinel markets.	See Annex.
Terms of trade purchasing power index	Kcal equivalent of local cereals that can be purchased with an average daily labourer wage	Food insecurity and livelihoods	2013 to 2018		See Annex.
Incidence of admission to nutritional therapeutic services	cases of severe wasting admitted to treatment services per 100,000 population	Nutritional status	2011 to 2018	Nutrition Cluster, Somalia	Unpublished data.
Incidence of admission to nutritional therapeutic services	cases of wasting admitted to treatment services per 100,000 population	Nutritional status	2013 to 2018	Nutrition Cluster, Somalia	Unpublished data.
Cholera incidence	cases per 100,000 population	Disease burden (epidemic)	2013 to 2018	FSNAU	Suspected and confirmed cases.
Measles incidence	cases per 100,000 population	Disease burden (epidemic)	2013 to 2018	FSNAU	Suspected and confirmed cases.
Malaria incidence	cases per 100,000 population	Disease burden (endemic)	2013 to 2018	FSNAU	Suspected and confirmed cases.
Humanitarian actor presence	Ongoing humanitarian projects per 100,000 population (all sectors)	Humanitarian (public health) service functionality	2010 to 2018	United Nations Office for Coordination of Humanitarian Affairs	Proxy of intensity of humanitarian response. Unpublished data.
Humanitarian actor presence	Ongoing projects per 100,000 population (health, nutrition and water, hygiene and sanitation)	Humanitarian (public health) service functionality	2010 to 2018
Food security humanitarian services	Proportion of the population that are a beneficiary of any food security service	Humanitarian (public health) service coverage	Jan 2013 to Apr 2018	Food Security Cluster, Somalia	Unpublished data.
	Proportion of the population that are a beneficiary of cash-based food security services	Humanitarian (public health) service coverage
	Proportion of the population that are a beneficiary of food distributions	Humanitarian (public health) service coverage
Quality of severe wasting treatment	Proportion of severe wasting admissions that exit the treatment programme cured	Humanitarian (public health) service quality	2011 to 2018	Nutrition Cluster, Somalia	Unpublished data.

Table 2

Candidate predictor datasets, South Sudan.
Variable	Value(s)	Domain	Time span of availability	Source(s)	Notes and assumptions
Administrative level	Broad region within South Sudan	(various)	n/a (static variable)	n/a	northeast, northwest, southern
Rainfall	Difference between current rainfall and 10y historical average (mm)	Climate	2014 to 2018	United Nations World Food Programme Food Security Analysis data site ( http://dataviz.vam.wfp.org/seasonal_explorer/rainfall_vegetation/visualizations)
Incidence of armed conflict events	events per 100,000 population deaths per 100,000 population	Exposure to armed conflict / insecurity	2010 to 2018	Armed Conflict Location & Event Data Project (ACLE, https://www.acleddata.com/) [16]	Meta-data on individual armed conflict events based on extensive review of multi-language media sources and other public information.
Incidence of attacks against aid workers	deaths per 100,000 population injuries per 100,000 population	Exposure to armed conflict / insecurity	2010 to 2018	Aid Worker Security Database (AWSD, https://aidworkersecurity.org/incidents)	Data on various types of attacks to aid workers, capturing information from media sources, aid organisations and security actors.
Proportion of IDPs	proportion	Forced displacement	2012 to 2018	Estimated by authors as part of a separate mortality study [11].
Main local livelihood type	agriculturalist, agropastoral, pastoralist, displaced (Protection of Civilians camps only)	Food security and livelihoods	n/a (static variable)	Famine Early Warning Systems Network (FEWS NET) [17]	Assumed to be constant over time.
Terms of trade purchasing power index	Kg of white wheat flour that an average medium goat can be exchanged for	Food insecurity and livelihoods	2011 to 2018	CLiMIS portal (http://climis-southsudan.org/)
Food distributions	metric tonnes per 100,000 population	Food insecurity and livelihoods	2013 to 2018	United Nations World Food Programme	Unpublished data.
Incidence of admission to nutritional therapeutic services	cases of severe wasting admitted to treatment services per 100,000 population	Nutritional status	2015 to 2018	Nutrition Cluster, South Sudan	Unpublished data.
Incidence of admission to nutritional therapeutic services	cases of wasting admitted to treatment services per 100,000 population	Nutritional status	2015 to 2018	Nutrition Cluster, South Sudan	Unpublished data.
Cholera incidence	cases per 100,000 population	Disease burden (epidemic)	2012 to 2018	World Health Organization	Suspected and confirmed cases. No cases reported before 2014. Unpublished data.
Measles incidence	cases per 100,000 population	Disease burden (epidemic)	2012 to 2018	World Health Organization	Suspected and confirmed cases. Unpublished data.
Humanitarian actor presence	actors per 100,000 population (all sectors; health, nutrition and water, hygiene & sanitation; health only)	Humanitarian (public health) service functionality	2014 to 2018	United Nations Office for Coordination of Humanitarian Affairs	Proxy of intensity of humanitarian response. Unpublished data.
Acute flaccid paralysis incidence	cases per 100,000 population	Humanitarian (public health) service functionality	2012 to 2018	World Health Organization	Proxy of functionality of public health surveillance.
Uptake of measles routine vaccination	doses given per 100,000 population	Humanitarian (public health) service coverage	2012 to 2018	World Health Organization	Assume no value = no routine vaccination taking place.
Quality of severe wasting treatment	Proportion of severe wasting admissions that exit the treatment programme cured	Humanitarian (public health) service quality	2015 to 2018	Nutrition Cluster, Somalia	Unpublished data.
Quality of wasting treatment	Proportion of wasting admissions that exit the treatment programme cured	Humanitarian (public health) service quality	2015 to 2018	Nutrition Cluster, Somalia	Unpublished data.

While for both countries data on food security and nutritional therapeutic services were available (Table 1, Table 2) and moderately predictive (data not shown), we ultimately decided to exclude them as candidate predictors for two reasons: (i) we considered that improved prediction could plausibly result in better targeting of these humanitarian services, which in turn would result in improved nutrition, a reverse-causal effect whose future size the model might fail to predict; and (ii) we assumed that end-users would benefit from a model that could be used to predict malnutrition burden even where none of these services were available, e.g. due to access constraints.

Predictive models

We explored two prediction approaches, as follows.

Generalised linear modelling. We first split the data by period into a training set (consisting of approximately the chronologically first 70% of the data) and a ‘holdout’ (i.e. validation) set (the most recent 30%). For each anthropometric indicator, we fitted generalised linear models (GLM) to individual child observations in the training dataset, with robust standard errors to account for the cluster sampling design of most surveys, a quasi-binomial distribution for binary outcomes (severe wasting, wasting) and a gaussian distribution for continuous outcomes (WHZ, MUAC), which we did not transform as they were normally distributed. We specified model weights as the product of survey quality score and survey sample coverage.

After visual inspection, we categorised continuous predictors, and selected categorical versus continuous versions of these based on linearity of the association and the smallest-possible Chi-square (for binary outcomes) or F-test (continuous outcomes) p-value testing whether the univariate model provided better fit than a null model. We also used this p-value to select among candidate lags for each predictor; however, we modelled climate variables (rainfall, Normalised Difference Vegetation Index or NDVI) as either the means of the two trimesters, or the mean over the semester prior to each survey observation. We then fitted models consisting of all possible combinations of predictors, and shortlisted the best 10% based on predictive accuracy (lowest mean square error, MSE) of model predictions at stratum-month level, relative to observations in the holdout dataset. We manually selected the best fixed effects model among these based on relative accuracy on holdout data, accuracy on external data simulated through leave-one-out cross-validation (LOOCV) [18], the plausibility of observed associations, and model parsimony (while the latter characteristic is relatively unimportant for prediction, in practice we wished to avoid users of the model having to collect a large amount of predictor data). Lastly, we explored plausible two-way interactions.

We also fitted mixed models (with stratum as a random effect, given that in both countries surveys were repeated in many districts / counties). The latter, however, offered inconsistent accuracy advantages over fixed effects models on either cross-validation or holdout datasets. Furthermore, we assumed that end users would be most interested in predicting malnutrition prevalence in hard-to-survey districts / counties, i.e. where no a priori random effects would be estimable. For these reasons, we discarded mixed models altogether.

Machine learning. After splitting data as above, we used the ranger package [19] to grow random forest (RF) regression models on the training dataset, aggregated at stratum-month level: this approach makes minimal assumptions about data structure; briefly, it partitions the data according to various randomly generated ‘trees’, where each node is defined by a particular value of one of the predictor variables, with branches being the resulting split in the data; the ‘depth’ of each tree is defined by the number of variables that are used to create nodes; randomness is introduced by the choice of variables to build any given tree, values at which splits occur, and the order of variables in the tree structure. The distribution of the outcome arising from the partitions in each tree is compared to the observed data to determine accuracy. RF averages predictions across a large ensemble of trees. We grew RFs with 1000 trees, using all candidate predictors as above, and computed prediction CIs using a jack-knife estimator [20].

Performance evaluation

For both the GLM and RF approach, we present various metrics of predictive accuracy, for estimation: (i) effective coverage, defined here as the proportion of stratum-months for which the predicted point estimate fell within the 95% or 80%CIs of the observed data; (ii) relative bias, defined as \(\frac{1}{n}\sum _{i=1}^{i=n}\frac{{\widehat{y}}_{i}-{y}_{i}}{{y}_{i}}\), where \(n\) is the number of stratum-months, \({\widehat{y}}_{i}\) the prediction and \({y}_{i}\)the observation for stratum-month \(i\); and (iii) relative precision, namely the mean ratio of predicted stratum-month one-sided 95%CIs to point estimate; and for classification: (iv) sensitivity and (v) specificity of predictions against severe wasting or wasting prevalence thresholds commonly used in humanitarian response, and adopting observed point estimates as the gold standard. For brevity we present only best models for ‘now-casting’ (i.e. prediction of malnutrition based on data collected up to the present). We also explored models for forecasting malnutrition 3 months into the future (i.e. prediction based on data collected up to 3 months previously), but found that these had low performance (data not shown). All analysis was done using R software [21] through the RStudio [22] platform.

Anthropometric survey patterns

Details of eligible surveys from Somalia are reported in Table 3 and Figure 1. Most surveys were done in 2016 and 2018 and the majority relied on multi-stage cluster sampling, with a fairly constant sample size range over time. The highest severe wasting and wasting prevalence, but also the lowest quality scores, were noted in 2017, during a drought-triggered food insecurity crisis. In South Sudan all surveys relied on cluster sampling, and there was minimal change in average severe wasting and wasting prevalence over time; quality scores and the proportion of flagged observations suggested higher survey quality in South Sudan than in Somalia (Table 4, Figure 2).

Table 3

Characteristics of analysis-eligible anthropometric surveys from Somalia. Medians are reported unless noted. Numbers in parentheses indicate the interquartile range.
Characteristic	Overall	2013	2014	2015	2016	2017	2018
Eligible surveys (N)	85	3	4	2	25	6	45
Percentage using a cluster sampling design	85.9	100.0	75.0	100.0	80.0	100.0	86.7
Sample size	640 (265 to 1075)	534 (510 to 630)	668 (641 to 833)	683 (501 to 865)	636 (265 to 886)	915 (509 to 1018)	630 (420 to 1075)
Wasting prevalence (weight-for-height + oedema), %	17.4 (6.0 to 57.8)	14.4 (9.6 to 20.0)	12.8 (9.2 to 27.6)	13.6 (9.4 to 17.8)	18.4 (7.7 to 37.3)	27.1 (21.2 to 57.8)	16.8 (6.0 to 26.6)
Severe wasting prevalence (weight-for-height + oedema), %	3.3 (0.6 to 10.2)	3.1 (2.9 to 4.3)	2.0 (0.6 to 4.9)	3.1 (2.2 to 4.0)	4.0 (0.6 to 6.8)	8.0 (4.6 to 10.2)	3.1 (1.3 to 6.8)
Wasting prevalence (MUAC + oedema), %	8.2 (0.8 to 36.4)	9.1 (3.8 to 13.6)	3.3 (1.4 to 7.3)	6.1 (2.0 to 10.2)	8.0 (0.8 to 25.7)	22.0 (10.0 to 29.1)	8.3 (1.3 to 36.4)
Severe wasting prevalence (MUAC + oedema), %	1.1 (0.1 to 7.3)	2.2 (0.3 to 2.7)	0.6 (0.3 to 1.1)	1.6 (0.6 to 2.6)	1.3 (0.2 to 4.6)	3.0 (0.6 to 7.3)	1.1 (0.1 to 3.7)
Percentage of flagged observations	0.7 (0.0 to 4.8)	0.2 (0.2 to 1.0)	0.0 (0.0 to 2.4)	0.8 (0.2 to 1.4)	0.7 (0.0 to 3.0)	1.4 (1.1 to 2.6)	0.7 (0.0 to 4.8)

Table 4

Characteristics of analysis-eligible anthropometric surveys from South Sudan. Medians are reported unless noted. Numbers in parentheses indicate the interquartile range.
Characteristic	Overall	2015	2016	2017	2018
Eligible surveys (N)	175	55	57	52	11
Percentage using a cluster sampling design	100.0	100.0	100.0	100.0	100.0
Sample size	530 (207 to 949)	532 (251 to 790)	523 (325 to 881)	526 (207 to 949)	545 (466 to 768)
Wasting prevalence (weight-for-height + oedema), %	21.7 (5.6 to 55.1)	21.7 (6.2 to 50.8)	22.2 (5.6 to 53.0)	20.9 (8.1 to 55.1)	16.5 (6.3 to 34.5)
Severe wasting prevalence (weight-for-height + oedema), %	4.0 (0.4 to 13.6)	4.3 (0.4 to 11.8)	4.0 (1.0 to 12.4)	4.0 (0.6 to 13.6)	3.7 (0.9 to 7.6)
Wasting prevalence (MUAC + oedema), %	9.5 (0.8 to 35.8)	7.4 (0.8 to 29.0)	10.2 (2.4 to 24.3)	10.0 (3.7 to 35.8)	8.1 (2.9 to 30.5)
Severe wasting prevalence (MUAC + oedema), %	1.2 (0.0 to 7.9)	1.2 (0.0 to 5.1)	1.3 (0.2 to 7.9)	1.2 (0.2 to 7.8)	0.9 (0.0 to 3.0)
Percentage of flagged observations	0.4 (0.0 to 4.3)	0.5 (0.0 to 2.4)	0.6 (0.0 to 4.3)	0.4 (0.0 to 3.9)	0.3 (0.0 to 1.4)

[FIGURE 1 HERE]

Figure 1. Trends in key survey indicators, Somalia. Each dot represents the point estimate of a single survey. Box plots indicate the median and inter-quartile range, and whiskers the 95% percentile interval.

[FIGURE 2 HERE]

Figure 2. Trends in key survey indicators, South Sudan. Each dot represents the point estimate of a single survey. Box plots indicate the median and inter-quartile range, and whiskers the 95% percentile interval.

Performance of Somalia models

GLM model coefficients and performance metrics for Somalia are shown in Table 5: odds ratios, OR < 1 and linear coefficients > 0 indicate a protective effect, and vice versa. Four predictors consistently featured in the most predictive models: livelihood, measles occurrence over the previous trimester (a non-zero incidence was strongly associated with worse anthropometric status), NDVI over the previous semester and average market price of water in the second-to-last trimester (also negatively associated with anthropometric status). Theoretical predictor variables including intensity of armed conflict, terms of trade and rainfall intensity compared to the regional mean were inconsistently associated with the outcomes, and marginally useful for prediction. Generally, predictive performance was low: models yielded mostly upward-biased predictions that fell within the observed survey CIs for only 23–80% of stratum-months, depending on the outcome; while denominators were very small, only the model for severe wasting (WFH + oedema) reached a moderate combination of sensitivity and specificity to classify prevalence as per the 5% threshold. Graphs of predictions versus observations support this pattern; Figure 3 shows results for severe wasting (WFH + oedema), while remaining graphs are in the Additional file 1.

Table 5

Performance of predictive generalised linear models in Somalia for real-time estimation, by wasting outcome.
Statistic			Categorical outcomes								Continuous outcomes
Statistic			Severe wasting (WFH + oedema)		Wasting (WFH + oedema)		Severe wasting (MUAC + oedema)		Wasting (MUAC + oedema)		WFH	MUAC
Predictors: coefficient† on training data
Main local livelihood type
	agriculturalists		[ref.]								[ref.]
	displaced		0.63**								0.157***
	pastoralists		0.75								0.092
	urban		0.60								0.181*
Measles incidence rate, previous 3mths
	0		[ref.]		[ref.]		[ref.]		[ref.]		[ref.]	[ref.]
	> 0		1.40***		1.36***		1.47**		2.11***		-0.197***	-0.446***
Mean NDVI, previous 6mths
	< 0.20		[ref.]		[ref.]		[ref.]		[ref.]		[ref.]	[ref.]
	≥ 0.20		0.91		0.97		0.83		0.83**		-0.025	0.086**
Mean price of 200L water			3-5mths prior		3-5mths prior		1-3mths prior		1-3mths prior		3-5mths prior	1-3mths prior
	< 20,000 SOS		[ref.]		[ref.]		[ref.]		[ref.]		[ref.]	[ref.]
	≥ 20,000 SOS		1.34**		1.27***		1.41*		1.08		-0.131**	-0.120**
Estimation performance
Mean square error		training data	0.00025		0.00250		0.00016		0.00191		0.04426	0.07608
		LOOCV	0.00034		0.00293		0.00019		0.00222		0.05769	0.08977
		holdout data	0.00014		0.00230		0.00007		0.00381		0.06665	0.16096
Relative bias		LOOCV	+30.0%		-5.5%		+85.3%		+37.0%		+10.6%	-10.0%
Relative bias		holdout data	+32.8%		+35.4%		+131.0%		+98.2%		+28.7%	-80.0%
Relative precision of 95%CI		LOOCV	±16.0%		±7.6%		±19.3%		±9.4%		±3.9%	±20.0%
Relative precision of 95%CI		holdout data	±16.0%		±6.2%		±18.4%		±8.7%		±4.1%	±20.0%
Coverage of 95%CI		LOOCV	65.2%		59.6%		66.0%		40.4%		52.2%	36.2%
Coverage of 95%CI		holdout data	80.0%		46.7%		66.7%		26.7%		33.3%	30.0%
Coverage of 80%CI		LOOCV	50.0%		40.4%		44.7%		29.8%		41.3%	29.8%
Coverage of 80%CI		holdout data	60.0%		33.3%		43.3%		23.3%		23.3%	16.7%
Classification performance by severe wasting/wasting prevalence threshold (n = denominator of percentage)
Sensitivity, lower threshold		LOOCV	≥2%	100.0% (46)	≥15%	84.8% (33)	≥2%	42.1% (19)	≥15%	0.0% (9)	n/a
Sensitivity, lower threshold		holdout data	≥2%	100.0% (36)	≥15%	88.9% (18)	≥2%	25.0% (4)	≥15%	0.0% (4)
Sensitivity, upper threshold		LOOCV	≥5%	40.0% (10)	≥20%	0.0% (18)	≥5%	0.0% (1)	≥20%	0.0% (5)
Sensitivity, upper threshold		holdout data	≥5%	66.7% (3)	≥20%	57.2% (7)	≥5%	n/a (0)	≥20%	0.0% (3)
Specificity, lower threshold		LOOCV	<2%	0.0% (6)	<15%	35.7% (14)	<2%	82.1% (28)	<15%	100.0% (38)
Specificity, lower threshold		holdout data	<2%	0.0% (4)	<15%	25.0% (12)	<2%	76.9% (26)	<15%	100.0% (26)
Specificity, upper threshold		LOOCV	<5%	91.7% (36)	<20%	100.0% (29)	<5%	100.0% (46)	<20%	100.0% (42)
Specificity, upper threshold		holdout data	<5%	100.0% (27)	<20%	73.9% (23)	<5%	100.0% (30)	<20%	100.0% (27)
† Odds ratio for categorical outcomes; linear coefficient for continuous outcomes. * 0.01 ≤ p-value < 0.05 0.001 ≤ p-value < 0.01 * p-value < 0.001

[FIGURE 3 HERE]

Figure 3. GLM-predicted versus observed severe wasting (WFH + oedema) prevalence, Somalia, by district-month, on training data, LOOCV and holdout data. Shaded channels indicate different absolute deviance of predictions. Vertical dotted lines denote commonly used severe wasting prevalence thresholds.

RF models had similar performance to the GLM approach. For wasting (WFH + oedema: binary outcome), relative bias, relative precision and 95%CI coverage were -5.7% and +15.6%, ±21.8% and ±16.8%, and 48.9% and 63.3% on LOOCV and holdout data, respectively, with a sensitivity and specificity of 66.7% and 50.0% for the 15% prevalence threshold. The most important variables for prediction were measles incidence, NDVI, terms of trade and water price (Additional file 1). For WFH (continuous outcome), relative bias, relative precision and 95%CI coverage were +7.6% and +29.9%, ±18.2% and ±12.4%, and 57.4% and 33.3% on LOOCV and holdout data, respectively (Additional file 1).

Performance of South Sudan models

Table 6 shows GLM predictions for South Sudan. Here, the most significant associations were with livelihood type, total rainfall and terms of trade. Predictive performance was also low (Figure 4), with bias as high as +46.8% for severe wasting (WFH + oedema) prevalence, coverage no better than 75% across all outcomes and no instance of high sensitivity and specificity for classification.

Table 6

Performance of predictive generalised linear models in South Sudan, by wasting outcome.
Statistic			Categorical outcomes								Continuous outcomes
Statistic			Severe wasting (WFH + oedema)		Wasting (WFH + oedema)		Severe wasting (MUAC + oedema)		Wasting (MUAC + oedema)		WFH	MUAC
Predictors: coefficient† on training data
Incidence of acute flaccid paralysis, previous 3mths
	0 per 100,000		[ref.]				[ref.]		[ref.]			[ref.]
	0.01 to 0.49 per 100,000		0.90				0.80*		1.00			0.08**
	≥ 0.50 per 100,000		1.12				1.13		1.13*			-0.01
Main local livelihood type
	agriculturalists		[ref.]		[ref.]						[ref.]	[ref.]
	agro-pastoralists		1.16		1.46***						-0.30***	-0.21***
	displaced		1.29*		1.44***						-0.36***	-0.13**
	pastoralists		0.95		1.02						-0.15***	-0.11*
Total rainfall, previous 6mths
	< 50mm		[ref.]		[ref.]		[ref.]		[ref.]		[ref.]	[ref.]
	50 to 99mm		0.99		0.99		1.14		1.00		-0.02	-0.04
	100 to 149mm		0.77***		0.80***		1.06		0.85**		0.11***	0.13***
	≥ 150mm		0.82		0.84***		1.16		0.96		0.00	-0.16
Terms of trade (flour-goat exchange), previous 3-5mths
	< 20.0Kg		[ref.]		[ref.]		[ref.]		[ref.]		[ref.]	[ref.]
	20.0 to 29.9Kg		0.85*		0.80***		0.59***		0.73***		0.15***	0.14***
	30.0 to 39.9Kg		0.73***		0.84***		0.81*		0.92		0.10***	0.06**
	≥ 40.0 Kg		0.81**		1.00		1.16		1.17		0.02	0.04
Incidence of measles, previous 2-4mths
	0				[ref.]		[ref.]				[ref.]	[ref.]
	> 0				1.06		1.33*				-0.04	-0.05
Doses of measles vaccine administered, previous 3-5mths
	0 per 100,000						[ref.]
	0.1 to 99.9 per 100,000						1.05
	100.0 to 199.9 per 100,000						1.03
	200.0 to 299.9 per 100,000						1.40*
	≥ 300.0 per 100,000						1.04
Estimation performance
Mean square error		training data	0.00044		0.00297		0.00016		0.00180		0.05358	0.10190
		LOOCV	0.00058		0.00368		0.00025		0.00223		0.06670	0.14614
		holdout data	0.00037		0.00342		0.00009		0.001198		0.06680	0.10010
Relative bias		LOOCV	+38.7%		-5.9%		+80.9%		+26.7%		+11.0%	+0.1%
Relative bias		holdout data	+46.8%		+17.8%		+88.4%		+9.1%		+16.0%	+0.1%
Relative precision of 95%CI		LOOCV	±15.8%		±8.4%		±27.2%		±9.8%		±3.0%	±0.3%
Relative precision of 95%CI		holdout data	±14.2%		±7.4%		±25.8%		±9.1%		±3.3%	±0.3%
Coverage of 95%CI		LOOCV	60.0%		39.2%		65.8%		57.0%		26.5%	32.4%
Coverage of 95%CI		holdout data	71.4%		58.9%		75.0%		51.8%		21.4%	41.1%
Coverage of 80%CI		LOOCV	35.7%		27.5%		46.5%		36.8%		26.5%	22.5%
Coverage of 80%CI		holdout data	50.0%		33.9%		58.9%		41.1%		21.4%	30.4%
Classification performance by severe wasting/wasting prevalence threshold (n = denominator of percentage)
Sensitivity, lower threshold		LOOCV	≥2%	99.9% (98)	≥15%	92.5% (80)	≥2%	31.2% (32)	≥15%	0.0% (18)	n/a
Sensitivity, lower threshold		holdout data	≥2%	100.0% (48)	≥15%	97.6% (42)	≥2%	33.3% (12)	≥15%	0.0% (7)
Sensitivity, upper threshold		LOOCV	≥5%	29.8% (47)	≥20%	51.5% (66)	≥5%	0.0% (4)	≥20%	0.0% (8)
Sensitivity, upper threshold		holdout data	≥5%	41.2% (17)	≥20%	67.9% (28)	≥5%	n/a (0)	≥20%	0.0% (3)
Specificity, lower threshold		LOOCV	<2%	0.0% (17)	<15%	40.9% (22)	<2%	74.4% (82)	<15%	100.0% (96)
Specificity, lower threshold		holdout data	<2%	0.0% (8)	<15%	21.4% (14)	<2%	86.4% (44)	<15%	100.0% (49)
Specificity, upper threshold		LOOCV	<5%	73.5% (68)	<20%	72.2% (36)	<5%	99.1% (110)	<20%	100.0% (106)
Specificity, upper threshold		holdout data	<5%	87.2% (39)	<20%	39.3% (28)	<5%	100.0% (56)	<20%	100.0% (53)
† Odds ratio for categorical outcomes; linear coefficient for continuous outcomes. * 0.01 ≤ p-value < 0.05 0.001 ≤ p-value < 0.01 * p-value < 0.001

[FIGURE 4 HERE]

Figure 4. GLM-predicted versus observed severe wasting (WFH + oedema) prevalence, South Sudan, by district-month, on training data, LOOCV and holdout data. Shaded channels indicate different absolute deviance of predictions. Vertical dotted lines denote commonly used severe wasting prevalence thresholds.

RF models had far better fit to the training data than GLMs, but performed similarly on cross-validation and holdout data. The most important variables were livelihood, terms of trade, uptake of measles vaccination and intensity of insecurity (Additional file 1).

In this study we combined a range of previously collected, anthropometric household survey data with a range of potential population-level predictor datasets quantifying theoretical factors causally associated with wasting burden in crisis settings, to explore whether key quantities such as severe wasting or wasting prevalence could be estimated through prediction, as a complement to ground surveys. Resulting predictive models based on either GLM or machine learning approaches had disappointing performance in both Somalia and South Sudan across several anthropometric outcomes. Generally, predictive accuracy was better for outcomes based on WFH than on MUAC, but even for the former our models would not, in our opinion, provide actionable information.

Models to predict wasting risk at the individual or household level exist [23, 24]. While we did not search the literature systematically due to insufficient resources, we are aware of only two other population-level predictive studies. Osgood-Zimmerman et al. [25] produced gridded maps of various anthropometric indicators for all of Sub-Saharan Africa based on periodic countrywide surveys (e.g. Demographic and Health Surveys) and > 20 geospatial remotely sensed or previously estimated predictors; Mude et al. [26] predicted with reasonable accuracy MUAC across time and space in northern Kenya based on village-level data collected for food security surveillance by the Arid Lands Resource Management Project, with predictors including the characteristics of observed MUAC data themselves, cattle herd dynamics, extent of food aid, climate and season. At least one further research project is ongoing (https://www.actionagainsthunger.org/meriam). Bosco et al. [27] have used geospatial and remotely sensed covariates to map stunting prevalence, while Lentz et al. [28] have also demonstrated the potential of a GLM-based approach for predicting food insecurity in Malawi. We have previously used the same datasets as in this study to develop reasonably predictive models of population-level death rate (a farther-downstream and thus potentially even more multifactorial outcome), albeit only for retrospective estimation [10, 11].

Given the above, we expected better predictive performance. It is plausible that additional data on factors causally associated with wasting, including infant and young child feeding practices, use of food security coping strategies, dietary diversity, access to water, sanitation and hygiene services and health service utilisation would have improved prediction: these data are sometimes generated in crisis settings through cross-sectional surveys, but to our knowledge are not typically available at the granular level required for our predictive problem. It is also likely that problems with available data quality constrained model accuracy. Non-differential error or misclassification arising from measurement problems (e.g. imprecise child anthropometric measurements) and data entry errors would generally reduce model goodness-of-fit and bias estimated associations towards the null: observed-versus-predicted graphs generally suggest ‘regression dilution’ [29], a phenomenon whereby predictions align around an underestimated linear slope, consistent with high noise in predictor variables. Differential error may also have affected model accuracy in various ways. For example, the predictive value of certain variables would have been dampened if anthropometric surveys had systematically underestimated wasting in the very locations where those predictors exhibited their most extreme values, as might be plausible for surveys done in very remote, insecure locations and thus constrained by time, local staff competency or the need to exclude unreachable communities from the effective sampling frame. We attempted to mitigate such bias by down-weighting lower-quality surveys with evidence of sampling frame selection bias, but models without this weight were not substantively different (data not shown). Pragmatically, these data quality limitations illustrate the challenges of prediction based on data not collected for research.

Our study aim was not to explore associations: as such, we focussed on accuracy and, for example, ignored significant effect modifications that did not improve prediction. Observed GLM associations and variable importance metrics for RF are nonetheless informative. Measles incidence and rainfall or NDVI had plausible associations with most outcomes in both countries, while water price had a very strong association in Somalia. Terms of trade, however, were only important in South Sudan. We did not see strong associations with forced displacement or armed conflict intensity, as documented elsewhere [30], and, critically, rainfall abnormalities (as opposed to total precipitation) were not an important predictor in any model. A recent review of 90 studies concludes that wasting is understudied relative to stunting; the review also finds that, while adequate rainfall during the growing season has been associated with less wasting, relationships with drought and armed conflict are inconclusive [31]. Indeed, the interplay of unusual climate events and armed conflict has proved challenging for food security prediction [32]. More generally, our and others’ findings underscore the context-specific complexity of causal pathways leading to wasting. They may also reflect the relative noisiness of different datasets, i.e. their accuracy.

Aside from data limitations, our analysis does not thoroughly explore available predictive methods. Among GLM-based approaches, it is possible that different transformations of outcomes or predictors, as well as methods to identify the most informative variables, such as lasso regression, could have yielded improved performance. Among machine learning methods, boosted regression trees could have reduced bias. We note however that these methods would need to yield very considerable improvements over those we used in order to produce useful predictions.

This analysis suggests that predictive modelling for wasting burden in crisis settings may not be an immediately viable alternative to ground surveys, at least in the countries studied. Given the potential benefit of such an approach [5], we nonetheless recommend further study, possibly in other settings, using larger datasets and more advanced machine learning methods (boosted regression trees, support vectors, neural networks) and/or Bayesian frameworks. To facilitate such research, as well as other publicly beneficial analyses, humanitarian actors should systematically make key datasets, including but not limited to anthropometric surveys, publicly available in curated, accessible form [33]. These include, but are not limited to, service data from different sectors (e.g. outpatient consultations; vaccination coverage; anthropometric screening data among outpatient children and pregnant women; admissions and exit outcomes for management of wasting; water availability and quality; coverage of excreta disposal; food security service beneficiaries and Kcal equivalents); market data (e.g. staple prices); morbidity and mortality surveillance data; cross-sectional surveys measuring food security, dietary diversity and infant and young child feeding practices; protection assessments; surveys of perceptions of affected populations; humanitarian presence and activity who-does-what-where matrices; and alternative data on insecurity (e.g. incidents monitored by the UN country team) or humanitarian access (e.g. road safety). A simple principle could be to publish all data barring any whose public availability could place humanitarian actors or affected people at unacceptable risk; aggregation and anonymisation may mitigate such risks. Lastly, any studies to date to predict population-level nutrition burden should be synthesised to identify actionable evidence and guide further analysis.

ACLED Armed Conflict Location & Event Data Project

AWSD Aid Worker Security Database

CI confidence interval

ENA Emergency Nutritional Assessment software

FEWS NET Famine Early Warning Systems Network

FSNAU Food Security and Nutrition Analysis Unit – Somalia

GLM generalised linear model

IPC Integrated Food Security Phase Classification

LOOCV leave-one-out cross-validation

mo months old

MUAC middle-upper arm circumference

MSE mean square error

NDVI Normalised Difference Vegetation Index

RF random forest

SMART Standardised Monitoring and Assessment of Relief and Transitions

UN United Nations

WHZ weight-for-height Z score

Ethics approval and consent to participate

All data were previously collected for routine humanitarian response and/or public health service provision purposes, and were either in the public domain or shared in fully anonymised format. Caregivers of anthropometric survey participants provided verbal consent as per SMART protocol standard procedures. The study was approved by the Ethics Committee of the London School of Hygiene & Tropical Medicine (ref. 15334/RR/14437) and the Somali Ministry of Health and Human Services’ Research and Ethics Committee (ref. MOH&HS/DGO/0701/May/2019). No reply was received to a related application to the ethics committee of South Sudan’s Ministry of Health. All analyses were performed in accordance with relevant guidelines and regulations, including the Declaration of Helsinki.

Consent for publication

Not applicable.

Availability of data and materials

The data that support the findings of this study are available from various United Nations and non-governmental agencies, but restrictions apply to the availability of these data, which were used under license for the current study, and so are not all publicly available. Data are however available from the authors upon reasonable request and with permission of the above agencies. We have uploaded curated R scripts and all Somalia data on https://github.com/francescochecchi/acute_malnutrition_predictive_models.

Competing interests

The authors declare that they have no competing interests.

Funding

The analysis was funded by the United Nations Children’s Fund. SF, AW and FC were also partly funded by UK Research and Innovation as part of the Global Challenges Research Fund, grant number ES/P010873/1. Collection of predictor data was funded by the United States Institute of Peace (South Sudan) and the UK Foreign, Commonwealth and Development Office (FCDO; formerly Department for International Development) through the Research for Evidence Division (RED) for the benefit of developing countries (Somalia). The views expressed and information contained in his paper are solely those of the authors and are not necessarily those of or endorsed by UNICEF, FCDO, and USIP, none of which can accept no responsibility for such views or information or for any reliance placed on them.

Authors' contributions

FC designed the methods, managed data, did statistical analysis and wrote this paper. SF collected and managed data and wrote this paper. MN helped design the study and coordinated data collection. All other authors collected and managed data. All authors reviewed, edited and approved the final manuscript.

Acknowledgments

We are grateful to Anna Carnegie for project management support, and to Adrienne Testa for collection of South Sudan datasets used for a separate study. In both countries, we express gratitude to governmental agencies and partners who collected and/or contributed data used for this analysis.

Disclaimer

Geographical names and boundaries presented in this report are used solely for the purpose of producing scientific estimates, and do not necessarily represent the views or official positions of the authors, the London School of Hygiene and Tropical Medicine, any of the agencies that have supplied data for this analysis, or the donor. The authors are solely responsible for the analyses presented here, and acknowledgment of data sources does not imply that the agencies or individuals providing data endorse the results of the analysis.

Young H, Borrel A, Holland D, Salama P. Public nutrition in complex emergencies. Lancet. 2004;364:1899–909.
Young H, Jaspars S. The meaning and measurement of acute malnutrition in emergencies. Humanitarian Practice Network. 2006;44:1–60.
Tuffrey V. A perspective on the development and sustainability of nutrition surveillance in low-income countries. BMC Nutrition. 2016;2.
Tuffrey V, Hall A. Methods of nutrition surveillance in low-income countries. Emerging themes in epidemiology. 2016;13:4.
Maxwell D, Hailey P. Towards Anticipatory Information Systems and Action. Tufts - Feinstein International Center.
Checchi F, Warsame A, Treacy-Wong V, Polonsky J, van Ommeren M, Prudhon C. Public health information in crisis-affected populations: a review of methods and their use for advocacy and action. Lancet. 2017;390:2297–313.
Standardised Monitoring and Assessment of Relief and Transitions (SMART). Measuring Mortality, Nutritional Status, and Food Security in Crisis Situations: SMART Methodology. https://smartmethodology.org/. Accessed 14 Feb 2021.
Maxwell D, Hailey P, Spainhour Baker L, Kim JJ. Constraints and Complexities of Information and Analysis in Humanitarian Emergencies: Evidence from Yemen. Feinstein International Center, Tufts University and Centre for Humanitarian Change; 2019.
Global Report on Food Crises 2021. Global Network against Food Crises, Food Security Information Network; 2021.
Warsame A, Frison, Severine, Gimma A, Checchi F. Retrospective estimation of mortality in Somalia, 2014-2018: a statistical analysis - Somalia. ReliefWeb. 2020. https://reliefweb.int/report/somalia/retrospective-estimation-mortality-somalia-2014-2018-statistical-analysis. Accessed 11 Jan 2021.
Checchi F, Testa, Adrienne, Warsame, Abdihamid, Quach, Le, Burns, Rachel. Estimates of crisis-attributable mortality in South Sudan, December 2013- April 2018: A statistical analysis - South Sudan. ReliefWeb. 2018. https://reliefweb.int/report/south-sudan/estimates-crisis-attributable-mortality-south-sudan-december-2013-april-2018. Accessed 11 Jan 2021.
Erhardt J. Emergency Nutrition Assessment (ENA) Software for SMART. 2020.
Frison S, Kerac M, Checchi F, Prudhon C. Anthropometric indices and measures to assess change in the nutritional status of a population: a systematic literature review. BMC nutrition. 2016;2:76.
Buuren S van, Groothuis-Oudshoorn K. mice: Multivariate Imputation by Chained Equations in R. Journal of Statistical Software. 2011;45:1–67.
Funk C, Peterson P, Landsfeld M, Pedreros D, Verdin J, Shukla S, et al. The climate hazards infrared precipitation with stations—a new environmental record for monitoring extremes. Scientific Data. 2015;2:150066.
Raleigh C, Linke A, Hegre H, Karlsen J. Introducing ACLED: An Armed Conflict Location and Event Dataset: Special Data Feature. Journal of Peace Research. 2010;47:651–60.
South Sudan Livelihood Zones and Descriptions. Washington, DC: Famine Early Warning Systems Network.
Molinaro AM, Simon R, Pfeiffer RM. Prediction error estimation: a comparison of resampling methods. Bioinformatics. 2005;21:3301–7.
Wright MN, Ziegler A. <background-color:#CCCCFF;bvertical-align:super;>ranger</background-color:#CCCCFF;bvertical-align:super;>: A Fast Implementation of Random Forests for High Dimensional Data in C++ and R. J Stat Soft. 2017;77.
Wager S, Hastie T, Efron B. Confidence Intervals for Random Forests: The Jackknife and the Infinitesimal Jackknife. J Mach Learn Res. 2014;15:1625–51.
R Core Team. R: A Language and Environment for Statistical Computing. Vienna, Austria: R Foundation for Statistical Computing; 2020.
RStudio Team. RStudio: Integrated Development Environment for R. Boston, MA: RStudio, PBC.; 2020.
Mukuku O, Mutombo AM, Kamona LK, Lubala TK, Mawaw PM, Aloni MN, et al. Predictive Model for the Risk of Severe Acute Malnutrition in Children. Journal of Nutrition and Metabolism. 2019;2019:1–7.
Islam MM, Alam M, Tariquzaman M, Kabir MA, Pervin R, Begum M, et al. Predictors of the number of under-five malnourished children in Bangladesh: application of the generalized poisson regression model. BMC Public Health. 2013;13:11.
Osgood-Zimmerman A, Millear AI, Stubbs RW, Shields C, Pickering BV, Earl L, et al. Mapping child growth failure in Africa between 2000 and 2015. Nature. 2018;555:41–7.
Mude AG, Barrett CB, McPeak JG, Kaitho R, Kristjanson P. Empirical forecasting of slow-onset disasters for improved emergency response: An application to Kenya’s arid north. Food Policy. 2009;34:329–39.
Bosco C, Alegana V, Bird T, Pezzulo C, Bengtsson L, Sorichetta A, et al. Exploring the high-resolution mapping of gender-disaggregated development indicators. J R Soc Interface. 2017;14:20160825.
Lentz EC, Michelson H, Baylis K, Zhou Y. A data-driven approach improves food insecurity crisis prediction. World Development. 2019;122:399–409.
Hutcheon JA, Chiolero A, Hanley JA. Random measurement error and regression dilution bias. BMJ. 2010;340 jun23 2:c2289–c2289.
Iacoella F, Tirivayi N. Child nutrition during conflict and displacement: evidence from areas affected by the Boko Haram insurgency in Nigeria. Public Health. 2020;183:132–7.
Brown ME, Backer D, Billing T, White P, Grace K, Doocy S, et al. Empirical studies of factors associated with child malnutrition: highlighting the evidence about climate and conflict shocks. Food Sec. 2020;12:1241–52.
Krishnamurthy PK, Choularton RJ, Kareiva P. Dealing with uncertainty in famine predictions: How complex events affect food security early warning skill in the Greater Horn of Africa. Global Food Security. 2020;26:100374.
Maxwell D, Gottlieb G, Coates J, Radday A, Kim J, Venkat A, et al. Humanitarian Information Systems: Anticipating, Analyzing, and Acting in Crisis. Tufts - Feinstein International Center. https://fic.tufts.edu/research-item/the-constraints-and-complexities-of-information-and-analysis/. Accessed 14 Feb 2021.

No competing interests reported.

malnutpredpaperadditional1.docx

Download PDF

Editorial decision: Major revision
14 Feb, 2022
Reviews received at journal
10 Feb, 2022
Reviews received at journal
10 Feb, 2022
Reviewers agreed at journal
31 Jan, 2022
Reviewers agreed at journal
31 Jan, 2022
Reviewers invited by journal
23 Jan, 2022
Editor assigned by journal
20 Jan, 2022
Editor invited by journal
19 Jan, 2022
Submission checks completed at journal
19 Jan, 2022
First submitted to journal
11 Jan, 2022

You are reading this latest preprint version

Can We Predict the Burden of Wasting in Crisis-Affected Countries? Findings from Somalia and South Sudan

Status:

Version 1

Abstract

Figures

Background

Methods

Results

[FIGURE 1 HERE]

[FIGURE 2 HERE]

[FIGURE 3 HERE]

[FIGURE 4 HERE]

Discussion

Conclusions

Abbreviations

Declarations

Ethics approval and consent to participate

Consent for publication

Availability of data and materials

Competing interests

Funding

Authors' contributions

Acknowledgments

Disclaimer

References

Additional Declarations

Supplementary Files

Status:

Version 1