Comparing Forecasting Models for Predicting Infant Mortality: VECM vs. VAR and BVAR Specifications
Epidemiology Public HealthReceived 03 Sep 2025 Accepted 06 Dec 2025 Published online 08 Dec 2025
ISSN: 2995-8067 | Quick Google Scholar
END
Submit Article
Previous Full Text
Accessible H2O on Mars – A Critical Review of Current Knowledge
Received 03 Sep 2025 Accepted 06 Dec 2025 Published online 08 Dec 2025
This study investigates infant mortality in Uganda, a persistent public health challenge in many developing countries where economic disparities limit access to healthcare. It compares the forecasting performance of three econometric models; Vector Error Correction Model (VECM), Vector Autoregressive (VAR), and Bayesian VAR (BVAR) using annual data on infant mortality rates (IMR), neonatal mortality rates (NMR), GDP, and GDP per capita (GDPP) from 1954 to 2016. Model accuracy was evaluated using Mean Squared Error (MSE), Root Mean Squared Error (RMSE), and Theil’s U-statistic.
The results show strong long-term relationships among IMR, NMR, GDP, and GDPP. VECM provides the most reliable long-term forecasts, with an adjusted R-squared of 97.7%. Impulse response analysis indicates that GDP increases IMR in the short run, while GDPP exerts a stronger long-term reducing effect. For NMR, GDP has a negative impact, whereas GDPP shows a gradual positive response over time. Granger causality tests reveal bidirectional causality between GDPP and IMR, and a unidirectional influence of IMR on GDP.
Uganda’s IMR is projected to fall to about 17 deaths per 1,000 live births by 2035, though NMR declines will slow. Policymakers should use VECM for long-term planning due to its superior accuracy and the strong cointegration among IMR, NMR, GDP, and GDPP, while VAR/BVAR can guide short-term monitoring. Because GDPP most strongly reduces mortality, welfare-enhancing strategies such as social protection and employment are crucial. GDP gains should fund maternal and neonatal health, and systems must be strengthened to withstand economic shocks.
Infant mortality remains a critical global health challenge, particularly in developing regions such as Sub-Saharan Africa, where millions of children continue to die before their fifth birthday despite decades of progress. Infant mortality (IMR) and neonatal mortality (NMR) account for a large proportion of these deaths, which are often due to preventable or treatable causes such as neonatal encephalopathy, infections, complications from preterm birth, and malnutrition. In 2019 alone, over 5 million children under five died, with neonatal deaths declining more slowly than post-neonatal deaths [1-3]. In countries like Uganda, IMR and NMR remain high despite interventions aimed at improving healthcare access, such as mosquito nets, oral rehydration therapy, and maternal health programs. Key determinants of infant health include maternal education, per capita income, health expenditure, and environmental conditions [4]. The impact of macroeconomic factors on infant mortality is also evident, as studies show GDP fluctuations and economic growth influence maternal and child health outcomes, particularly in low-income countries where health system performance is critical [5].
In this context, examining how these factors interrelate with mortality rates becomes essential for designing effective policies aimed at achieving the SDG target of reducing under-five mortality to 25 per 1,000 live births by 2030 [6]. The causal effect of maternal and child mortality on GDP is generally stronger in HICs and UMICs due to the differences between poor and rich countries with respect to the human capital level or infrastructure. Human capital is the stock of competencies, knowledge, and social and personality attributes, including creativity, embodied in the ability to perform labor so as to produce economic value. The higher human capital level of richer countries compared to poorer countries implies that an equal reduction in maternal and child mortality will cause GDP to increase more in richer countries than in poorer countries [7].
Erdil, et al. [8] applied the Granger-causality approach to a panel data model with fixed coefficients in order to determine the relation between GDP and health expenditures per capita. They found significant bidirectional causality even for such a short time period, signaling that the evidence may be stronger for longer time periods. However, this causality is not homogenous, which is evident from the tests of the HC hypotheses. For one-way causality, the pattern of causality is different in low- and middle-income countries as compared to high-income countries. One-way causality generally runs from GDP to H in LIC and MIC, whereas the reverse holds for HIC.
The mortality‑forecasting literature suggests that univariate approaches such as ARIMA often overlook important interactions with other explanatory factors (e.g. socioeconomic or economic indicators), whereas multivariate models can capture these dependencies. For instance, Modeling mortality with a Bayesian vector autoregression [9] applies a multivariate Bayesian VAR (BVAR) to age‑specific mortality parameters and shows gains in forecast accuracy compared to simpler univariate methods. Evidence from a more recent study, Markov‑Switching Bayesian Vector Autoregression Model in Mortality Forecasting [10], demonstrates that adopting a regime‑switching multivariate framework (MSBVAR) further improves predictions and better accounts for structural changes in mortality patterns a scenario analogous to yours where infant/child mortality may respond dynamically to economic shifts (GDP, GDP per capita, etc.). Meanwhile, broader evidence from macroeconomic forecasting confirms that no single modelling approach uniformly dominates across settings: as noted in the review Forecasting in Economics and Finance, forecasting performance varies by variable, time‑horizon, and structural context, justifying a comparative modelling strategy rather than reliance on a single method.
Econometric models have increasingly been applied to forecast health outcomes such as Infant Mortality Rates (IMR) and Neonatal Mortality Rates (NMR). These models are useful in capturing complex interactions between socioeconomic and macroeconomic factors affecting health. Among the most commonly used techniques are Vector Error Correction Models (VECM), Vector Autoregressive (VAR) models, and Bayesian VAR (BVAR) models, each with unique strengths in modeling both short- and long-term effects and handling uncertainty in dynamic systems.
Recent studies demonstrate the application of various forecasting models in different contexts. In India, Mishra, Sahanaa, and Manikandan [11] used ARIMA models to forecast IMR from 1971 to 2016, predicting a decline to 15 deaths per 1,000 live births by 2025. Similarly, Khan, et al. [12] applied log-log regression and ARIMA models for Asian countries, showing strong correlations between IMR and GDP per capita. In Nigeria, Usman, et al. [13] found a decline in newborn mortality from 51.7% in 1990 to 33.9% in 2017 using ARIMA models, while Ogedi, et al. concluded that ARIMA outperformed Simple Exponential Smoothing (SES) and Brown’s Linear Trend (BLT) in forecasting NMR. These studies underscore the utility of time series models in projecting mortality trends, while also highlighting the potential for more advanced models to improve accuracy.
VECM is particularly effective for examining long-term relationships among non-stationary, cointegrated time series such as GDP, GDP per capita, and mortality rates. It allows researchers to analyze both short- and long-run dynamics, making it ideal for health economic applications. For example, Lin and Lee [14] applied VECM to healthcare spending and life expectancy, revealing strong long-term relationships between economic factors and health outcomes. Similarly, Sulaiman and Salleh [15] demonstrated that socioeconomic improvements, such as increased income, contributed to sustained reductions in infant mortality. Studies in Greece and other settings confirm that VECM effectively captures long-term dependencies and enhances forecasting accuracy compared to univariate models like ARIMA [16-18].
VAR models analyze interdependencies among multiple variables without assuming cointegration or a specific causal direction. This flexibility makes them suitable for examining how macroeconomic variables, including GDP and GDP per capita, influence IMR and NMR over time. Liao, et al. [19] used VAR to show significant links between health expenditures, economic growth, and life expectancy in China. In sub-Saharan Africa, Imo, et al. [20] found that short-run variations in IMR were influenced by healthcare investments, whereas long-term improvements required broader socioeconomic development. VAR models have also highlighted the differential impact of GDP per capita and health spending on under-five mortality, demonstrating their value in forecasting short-term dynamics and feedback effects [21-23].
BVAR extends traditional VAR by incorporating Bayesian methods to improve estimation under uncertainty and limited data. By including prior knowledge, BVAR models provide more robust forecasts, particularly for long-term health trends [24]. Applications in low-income countries show that BVAR delivers accurate predictions despite data limitations, allowing policymakers to incorporate factors such as maternal education, sanitation, and healthcare spending [25,26]. Studies in Australia further confirm that BVAR outperforms traditional VAR models by accounting for parameter uncertainty, particularly when projecting long-term mortality trends [9,27].
While VECM, VAR, and BVAR are widely applied in health forecasting, few studies comprehensively compare their performance in predicting IMR and NMR. Fernández, et al. [28] found that BVAR models generally provide more accurate forecasts under uncertainty, whereas Dube, et al. concluded that VECM captures long-term relationships more effectively, with VAR better suited for short-term forecasting. Comparing these models is crucial for identifying the most reliable approach to inform policy and resource allocation, particularly as countries aim to meet SDG targets for child health. Accurate forecasting supports evidence-based decision-making for healthcare interventions, long-term planning, and socioeconomic strategies to reduce infant and neonatal mortality.
The study utilized country-specific annual infant and neonatal mortality rates data compiled and provided by http://www.childmortality.org. UN IGME web site data used in the United Nations Children’s Fund (UNICEF) Report on levels and trends in Child Mortality. Data on Ugandan GDP and GDP per capita was obtained from the World Bank web site. The researchers used GDP at 2000 US prices (the World Bank’s World Development Indicators 2016; http://devdata.worldbank.org/wdi2016.htm).
The projection of infant and neonatal mortality based on the vector error correction model from 2030 was done using time series data from 1954 to 2016, collected from UNICEF, the Bank of Uganda. Variables considered here include: infant mortality rate, neonatal mortality rate, country’s GDP per capita, and GDP.
Three stages of analysis were adopted to facilitate the achievement of the goal. In the first stage, the study examined the trends and patterns of infant and neonatal mortality rates, government health expenditure, GDP, GDP per capita, sanitation coverage, and maternal literacy levels. The study was further focused by conducting an optimal lag length determination and a co-integration test. The second stage involved time series analysis (VAR, BVAR, and VECM) in order to determine the relationships and effects of the aforementioned variables on health outcomes. Lastly, the study devoted stage three to performing an out-of-sample projection of infant and neonatal mortality rates in 2030.
Time series analysis was used to identify the magnitude and direction of the relationship between health outcomes and GDP, GDP per capita, government health expenditure, sanitation coverage, and maternal literacy.
Trend and patterns of infant and neonatal mortality rates, Ugandan GDP, GDP per capita were explored through graphs.
All the series (IMR, NMR, GDP, and GDPP) were tested for stationarity before they were fixed into the model. According to Granger and Newbold [29], if there is a unit root, then that particular series is considered non-stationary.
A stochastic time series Yt is said to be stationary, if and only if, it satisfies the following assumptions:
1.
2.
3.
Conditions (1) and (2) imply that has a constant mean and variance over time, while condition (3) means that the covariance between series depends only on how far apart they are and not on the time of occurrence [30]; Box and Jenkins [31]. If one or more of the above conditions are not satisfied, the series is non-stationary, and proceeding with regression analysis would result in spurious results, which can produce high t-statistics but have no coherent economic meaning or an insignificant result.
The Augmented Dicky-Fuller (ADF) standard test for unit root in the series was used in testing for the presence of unit root in both transformed and non-transformed series. The time series data in this study can take any of the following stationarity models:
(Radom walk) (1)
(Random walk with a drift) (1.2)
(Mixed process) (1.3)
The error terms εt are assumed to be independent and identically distributed. Dickey and Fuller [32] proposed the ADF test in order to handle the autoregressive process in the variables [33]. Where the ADF indicated the occurrence of a unit root, then the series is non-stationary. In case of non-stationary, then the researcher proceeded to taking the logarithms, differencing until he arrives at a stationary series.
Many scholars used differencing to detrend the data and control autocorrelation by subtracting each datum in a series from its predecessor until stationarity was attained [34]. Once stationarity is attained after differencing d times, the series is said to be integrated in order d [35]. In such cases, the researcher then proceeded with optimum lag length determination and co-integration analysis.
The first step in building a VAR (p) model involved model identification. This helps in identifying the appropriate model's order. The most common methods used for lag order determination include Akaike Information Criterion (AIC) [31], Schwarz-Bayesian (BIC), and Hannan-Quinn (HQC). The main idea of AIC is to select the model that minimizes the negative likelihood penalized by the number of parameters.
The AIC and SBC equations are given below:
(1.4)
(1.5)
Where;
|Σ| = The determinants of the variance/covariance matrix pf the residuals;
N = Total number of parameters estimated in all equation; and
T = The number of usable observations.
Both AIC and SBC differ in their exact definition of a good model. In this case, the study chooses the model that has the lowest AIC and SBC values.
The time series Yt, where denote an (n × 1) vector variables, follows a VAR(p) model if it satisfies
(1.6)
Matrix expression of equation 1.6 is as follows,
Where Π is a k-dimensional vector, Φ1, Φ2,…….,Φp are k*k parameter matrices and et is a sequence of independently and identically distributed error vector.
Assumptions of the errors: -
The error term et is a multivariate normal k*1 vector of error satisfying the following assumptions;
1. E (et) = 0; every error term has a mean zero;
2. E(et e't) = s the contemporaneous covariance matrix of error terms is ω (an*n positive semi-definite matrix);
3. For any non-zero k. There is no correlation across time; in particular, no serial correlation in individual error terms and Φj are k*k matrices.
Lag selection for IMR and NMR was conducted using AIC, BIC, and HQC, guided by inspection of ACF/PACF plots. The two-mortality series exhibited different autocorrelation structures, which justified the use of different maximum lag lengths. Stationarity was assessed using ADF tests, and non-stationary I (1) variables were retained for cointegration testing using the Johansen trace and max-eigenvalue procedures. Where cointegration existed, variables were modelled using VECM; otherwise, differenced stationary series were used in VAR and BVAR. Non-stationary variables were not excluded because cointegration analysis requires I (1) variables to estimate long-run relationships. Both the Chao and Phillips joint procedure and the Johansen-Juselius sequential approach were applied to determine the cointegration rank, ensuring robustness of the long-run specification.
Four basic variants of the model were estimated:
(i) the VEC model using both the Chao and Phillips, 1999 joint procedure, VEC(J), and the more standard sequential procedure based on using the BIC to select the lag length and Johansen and Juselius’s, 1990 maximum root test to select the number of common trends using a sequence of 5% tests, VEC(S).
(ii) A VAR using BIC to select the lag length.
(iii) A DVAR using BIC to select the lag length.
(iv) Two BVARs, each using AIC to select the lag length, but with either the standard Minnesota priors (with hyper parameters of 0.2 for the diagonal and 0.2 for the off-diagonal terms), BVAR (M), or with looser priors on the off-diagonal (off-diagonal hyper parameter = 0.8), BVAR (L).
According to Sims, 1980 and Litterman [34], Vector Autoregressive (VAR) models have been proven to forecast better than any simultaneous equation models. Vector Autoregressive (VAR) models provide information about a variable’s forecasting ability for another variable. It is an econometric model used to capture the evolution and the interdependence between multiple time series. In order to build a VAR model [31], certain steps can be followed. This includes model identification, estimation of constants, a diagnostic check, and finally forecasting. Conditional heteroscedasticity and outliers in the residual series are also checked. The existence of co-integration was used to check for the presence of any common trends, and finally, an Error Correction Model (ECM) was developed due to presence of co integration to improve the long-term forecast.
After obtaining the order of the model, p, of the vector series, the researcher derived the estimators of the constants as in the steps below.
Consider the consecutive VAR models:
…=…
(1.7)
The most common methods of estimating parameters are the maximum likelihood estimator (MLE) and the ordinary least squares estimator. Here, the study applied the ordinary least squares (OLS) method to estimate the parameters of these models and applied equation by equation.
For equation (1.7), let be the OLS estimate of Φ1 and be the OLS estimate of Π. Where (i) is used to denote the estimate of a VAR (i) model. The estimates of the coefficients of the VAR model. When estimated using the unrestricted VAR (p) model are considered to be fixed quantities. These estimates of coefficients do not accurately reflect the underlying relationship because some of the estimated coefficients of the VAR model are non-zero purely by chance when estimated by OLS so restrictions may be imposed to reduce the number of parameters being estimated, Mercy, et al. 2015.
Suppose the orders and constants have been chosen for a VAR model underlying the data, then residuals are checked to see whether they are normally, identically, and independently distributed. After the model had been fitted, heteroscedasticity was tested through multivariate arch tests; autocorrelation through Durbin Watson; normality through Jarque-Bera tests; the unit root test through the Dickey Fuller and Augment Dickey Fuller tests; and the statistical significance of the parameters was tested through t statistics. For details, look at Mercy, et al. 2015.
Standard practice in VAR analysis is to report results from Granger causality tests, impulse responses, and forecast error variance decompositions. These statistics are computed automatically (or nearly so) by many econometrics’ packages. Because of the complicated dynamics in the VAR, these statistics are more informative than the estimated VAR regression coefficients or statistics, which typically go unreported. Granger-causality statistics examine whether lagged values of one variable help to predict another variable.
Bi-variate Granger causality was used to determine which national variables help predict health outcomes. If a variable Granger causes a health outcome, it is included in the VAR and BVAR forecasting models. These tests were conducted for each health outcome at lag length 1,2,3,4 and 5 years using both a deterministic time trend and first differences.
Bayesian Vector Autoregressive (BVAR) was used in this study to assess whether prior information on spatial and economic base-sectoral linkages improves forecast accuracy for health outcomes in Uganda. The study then forecasted the mortality levels using VAR and BVAR
models to compare their forecasting accuracy.
Alkema and New [35] concluded that, point estimates on child mortality based on limited information may substantially under- or overestimate the truth. Uncertainty assessments can and should be used to complement point estimates to avoid unwarranted conclusions about levels or trends in child mortality and to reduce confusion about differences in estimates within and between countries. In this study, both bootstrapping and cross-validation were used to assess the predictive power of the models, and the best model was used to project the infant and neonatal mortality by 2030 or 2035.
Time series analysis uses the factor of time to replace other kinds of influencing factors. Li Y, et al., used the autoregressive integrated moving average (ARIMA) model, one of the classic methods of time series analysis based on past values of a series and previous errors, to forecast under-five mortality in India. Despite all the advantages of this approach, it does not consider the influence of other macro variables in the economy like the Vector Auto Regressive model. The study then considered the infant and neonatal mortality rates from 1954 to 2014 as a training sample to fit the VAR model, and 2015–2017 was considered for the within and without samples for internal and external validation, respectively.
To compare the performance of the VECM, VAR and BVAR models, the researcher computed the mean square error, root mean square error, and Theil’s U statistics for the quality of the time series forecast methods.
For a VAR(p) model, the 1-step ahead forecast at the time origin h is given by;
(1.8)
The associated forecast error is ah = eh +1
The covariance matrix of the forecast error is Σ. If Yt is weakly stationary, then the 1-step ahead forecast Yt(1) converges to its mean vector, µ, as the forecast horizon increases, Mercy, et al. 2014.
(i) Mean Squared Error (MSE): Any of the two models was considered best if it has the minimum forecast error arising from comparing the actual value and forecast value.
(ii) Root Mean Squared Error (RMSE): Is the square root of the average of all squared errors, according to Wang and Lim 2005. It ignores any over and under- estimation.
(iii) Theil’s U Statistic: Theil's U-statistics see Theil, 1958 is used as a measure of forecasting error that is minimized. It is a relative measurement based on comparison of the predicted change with the observed change. The value of U lies between 0 and 1. If U equals to 0, there is a perfect fit, whereas U equals to 1 implies that forcasting of data is very poor.
However, the forecast of infant mortality from 2030 forward is very important in knowing what infant and neonatal mortality would look like if nothing is done to address the risks of infant and neonatal mortality. This paper provides exploratory results on the forecasting powers of three econometric models such as BVAR, VECM, and VAR in estimating the future IMR and NMR in Uganda based on historical data.
To understand the relative importance of different shocks in the system, we applied Forecast Error Variance Decomposition (FEVD) to the VAR/VECM models. FEVD quantifies the proportion of the forecast error variance of each dependent variable (IMR, NMR) that can be attributed to innovations in itself versus innovations in other variables (GDP, GDPP) over different time horizons. This allows us to identify which variables drive fluctuations in infant and neonatal mortality in the short and long run, and how the influence of each variable evolves over time. The method complements impulse response analysis by providing a quantitative measure of each shock’s contribution to overall variability [36-38].
The maximum lag length of five (5) was chosen based on the lowest value of AIC, FPE, LR, and HQIC to model IMR (Table 1).
| Table 1: Lag length determination based on AIC (IMR). | ||||||||
| lag | LL | LR | Df | P | FPE | AIC | HQIC | SBIC |
| 0 | -1790.50 | 1.4e25 | 66.4261 | 66.4687 | 66.5366 | |||
| 1 | -1515.161 | 549.40 | 9 | 0.000 | 7.5e20 | 56.5854 | 56.7558 | 57.0274 |
| 2 | -1452.13 | 127.34 | 9 | 0.000 | 1.0e20 | 54.5605 | 54.8588 | 55.3340* |
| 3 | -1440.62 | 23.04 | 9 | 0.006 | 9.2e19 | 54.4673 | 54.8934 | 55.5722 |
| 4 | -1429.19 | 22.86 | 9 | 0.007 | 8.5e19 | 54.3773 | 54.9313 | 55.16138 |
| 5 | -1413.25 | 31.87* | 9 | 0.000 | 6.8e19* | 54.1204* | 54.8023* | 55.16884 |
| The asterisk * marks the lag that minimizes the criterion (AIC, FPE, HQIC, SBIC). LL-Log Likelihood; LR-Likelihood Ratio; Df-Degree of Freedom; P-value; FPE-Final Prediction Error; AIC-Akaike Information Criterion; HQIC-Hannan-Quinn Information Criterion; SBIC-Schwarz Bayesian Information Criterion |
||||||||
Similarly, a maximum lag length of four (4) was determined based on the same approach to model NMR in Uganda (Table 2).
| Table 2: The Lag length determination based on AIC (NMR). | ||||||||
| lag | LL | LR | Df | P | FPE | AIC | HQIC | SBIC |
| 0 | -1599.32 | 5.1e24 | 65.4008 | 65.4448 | 65.5167 | |||
| 1 | -1329.12 | 540.40 | 9 | 0.000 | 1.2e20 | 54.7396 | 54.9154 | 55.2029 |
| 2 | -1269.35 | 119.55 | 9 | 0.000 | 1.5e19 | 52.6672 | 52.9748 | 53.4780 |
| 3 | -1228.92 | 80.845 | 9 | 0.000 | 4.2e18 | 51.3846 | 51.8241 | 52.5429* |
| 4 | -1214.50 | 28.853* | 9 | 0.001 | 3.5e18* | 51.1631* | 51.7344* | 52.6689 |
| 5 | -1211.57 | 5.16459 | 9 | 0.755 | 4.6e18 | 51.4112 | 52.1143 | 53.2644 |
| The asterisk * marks the lag that minimizes the criterion (AIC, FPE, HQIC, SBIC) LL-Log Likelihood; LR-Likelihood Ratio; Df-Degree of Freedom; P-value; FPE-Final Prediction Error; AIC-Akaike Information Criterion; HQIC-Hannan-Quinn Information Criterion; SBIC-Schwarz Bayesian Information Criterion |
||||||||
Given that the study focused on two health outcomes (IMR and NMR), we selected the appropriate model for assessing the relationship between health outcomes and independent variables.
Chepngetich and John, 2015 reported that VAR models are used to describe and forecast multivariate time series for stationary time series and recommended that for non-stationary time series, a vector error correction term is added to form a vector error correction model (VECM). It was therefore necessary to test for the existence of a stationary linear combination of the non-stationary terms (co-integration). I transformed the selected series into a vector error correction model (VECM) by taking the first difference, which in turn facilitated long-term forecasting. Those series that were not stationary at the first difference were dropped from the forecasting model.
Understanding how the variables relate to changes in the outcome variables is important in planning and policy formulation. As a result, I investigated the relationship between LIMR, LNMR, LGDP, and LGDPP.
The first step taken in building a VAR (p) model was the identification of the appropriate model (lag length) using the Akaike Information Criterion (AIC). The main idea of AIC is to select the model that minimizes the negative likelihood penalized by the number of parameters. The VAR model was then fitted to investigate how changes in the independent variables (LGDP and LGDPP) could affect the health outcomes (LIMR and LNMR) as presented in Figure 1.
The response of LIMR to a one standard deviation (SD) shock (innovation) in LGDP remained steady at about zero in the first five years and increased slightly in the subsequent five years; however, beyond 10 years, LIMR rose above average quickly as compared to the previous period and it remains in the positive region. That is, shocks to LGDP will have a positive impact on LIMR in both the short and long run. Furthermore, the effect of a one SD shock in the LGDPP was not noticeable in the short run and declined in the negative region slowly in the period between five and ten years, but it decreased rapidly beyond 10 years. This implied that the shock to LDGPP will have a negative impact on LIMR in the long run; that is, an increase in a country’s GDPP leads to an increase in health infrastructure development in the long run, which in turn leads to a drop in infant mortality (Figure 1).
The response of LNMR to a one SD shock (innovation) on LGDP was about zero in the first three years, and it decreased rapidly between three and ten years; it remained steady between 10 and twelve years. Beyond twelve years, LNMR registered a quick rise but remained in the negative region. In other words, a one SD shock in LGDP hurts LNMR in both the short and long run (Figure 2).
Similarly, while the impact of a one-SD shock in LGDPP on LNMR was not noticeable in the period zero to five years, the response rapidly increased between the five- and ten-year periods, where it hit a steady value for about two years. The response of LNMR to stimulation in LGDPP started declining but remained in the positive region beyond 12 years. This meant that a shock to LGDPP would benefit LNMR in both the short and long run (Figure 2).
Based on variables such as IMR, GDP, and GDPP, it was noted that GDPP significantly influences IMR in the short run. In addition, Uganda's GDP and GDP all together cause changes in IMR in the short run. In a similar twist, it was discovered that IMR causes GDP, yet GDPP was found not to be significant in causing GDP. It's worth noting that both IMR and GDPP influence a country's GDP. It was also observed that IMR significantly causes GDPP and that there were no significant causal effects between GDP and GDPP (Table 3).
| Table 3: Test of Granger causality among IMR, NMR, GDP, and GDPP. | ||||
| Equation | Excluded | X2 | Df | p value |
| LNMR | LGDP | 35039 | 3 | <0.001* |
| LNMR | LGDPP | 1.4e5 | 4 | <0.001* |
| LNMR | ALL | 9.7 | 7 | <0.001* |
| LGDP | LNMR | 5.4705 | 4 | 0.242 |
| LGDP | LGDPP | 5.7e6 | 4 | <0.001 |
| LGDP | ALL | 6.0e6 | 8 | <0.00 |
| LGDPP | LNMR | 5.487 | 4 | 0.241 |
| LGDPP | LGDP | 1.4e6 | 3 | <0.001* |
| LGDPP | ALL | 2.0e7 | 7 | <0.001* |
| LIMR | LGDP | 1.2e7 | 5 | <0.001* |
| LIMR | LGDPP | 8.6e6 | 3 | <0.001* |
| LIMR | ALL | 1.8e8 | 8 | <0.001* |
| LGDP | LNMR | 13.733 | 5 | 0.017 |
| LGDP | LGDPP | 3.9e5 | 3 | 0.001* |
| LGDP | ALL | 8.3e5 | 8 | <0.001* |
| LGDPP | LNMR | 13.754 | 5 | 0.017* |
| LGDPP | LGDP | 9.9e5 | 5 | 0.001* |
| LGDPP | ALL | 1.0e6 | 10 | 0.001* |
| *Significant variable at 5%. | ||||
The findings revealed a significant two-way causality between IMR and GDPP and a one-way causal effect from IMR to GDP. Interestingly, it was clear that GDP only causes IMR through GDP. There was no causality between GDP and GDP in the short term. In view of the above, four variables that caused an increased infant or neonatal mortality rate were included in the VAR, VECM, and BVAR models.
There was a strong relationship between health outcomes (such as IMR and NMR) and GDP, just like GDPP, which could be due to the level of human capital dependency and the higher marginal effect of health spending on LIC and LMIC. This was in agreement with the research conducted by Amiria and Gerdtham, 2013, who found that there was a stronger relationship because the effects of GDP on health are stronger in LIC and LMIC compared to HIC and UMIC, while, in contrast, the effects of health on GDP are stronger in HIC and UMIC compared to LIC and LMIC. The tests were run with a lag length of four for neonatal mortality and five for infant mortality as predetermined in lag length selection procedures (Table 3).
The Granger causality test was employed to determine whether lagged values of GDP (LGDP) and GDP per capita (GDPP) could predict infant mortality rate (LIMR) and neonatal mortality rate (LNMR). The findings indicate a strong causal relationship, as the lagged values of both LGDP and GDPP significantly influence LIMR, with p-values less than 0.001. This suggests that changes in GDP and GDP can effectively predict future infant mortality rates in the short term. Furthermore, a direct causal effect of LGDP and GDPP on LNMR was also observed, confirming that these variables are useful for forecasting neonatal mortality rates. However, the lagged values of LNMR did not independently affect LGDP, but they did show a significant effect when combined with LGDPP (p = 0.017). Similarly, LIMR did not exhibit a direct causal effect on GDPP alone, but this relationship became significant when considering the interaction with LGDP. Overall, while LNMR has a significant influence on both LGDP and GDPP, the direct causal relationships among these variables highlight the complex interdependencies in predicting mortality rates in the context of economic factors (Table 3).
IMR, NMR, GDP, and GDPP were among the variables taken into account in this study. They were non-stationary at the level but became stationary after the first difference. Based on these results, I used the Johansen test of co-integration at 5% to check for co-integration. This assisted in determining whether the variables move together over time or not, which in turn assisted in choosing between fitting the VAR model and the VECM. Results revealed that there was an order 1 co-integration between LIMR, LGDP, and LGDPP, as well as an order 2 co-integration between LNMR, LGDP, and LGDPP. This meant that these three variables move together in the long run, hence the need to fit the Vector Error Correction Model (VECM) for both NMR and IMR.
To keep things simple, the analysis results showed that there was co-integration of order one for infant mortality data (IMR) and order two for neonatal mortality data (NMR). The study considered the first order of co-integration in the subsequent analysis. With the presence of co-integration in the data set, VECM was found to be more appropriate for making long-term projections than the VAR model, whose supremacies were observed only in the short run. However, there was a need to compare the projection accuracy between VECM and BVAR (Table 4).
| Table 4: Assessment of the infant mortality rate projection accuracy. | |||||
| Variable | Inc. obs. | RMSE | MAE | MAPE | Theil |
| BVAR | 8 | 0.067645 | 0.050364 | 1.441729 | 0.009243 |
| VECM | 8 | 0.01088 | 0.00987 | 0.26941 | 0.00148 |
| RMSE: Root Mean Square Error MAE: Mean Absolute Error MAPE: Mean Absolute Percentage Error Theil: Theil inequality coefficient |
|||||
To accomplish this, the dataset was divided into two parts (the training data was sampled from 1954 to 2010, and the validation data was sampled from 2011 to 2018). The accuracy levels for within-sample forecasts were assessed using Root Mean Squared Error (RMSE), Mean Absolute Error (MAE), Mean Absolute Percent Error (MAPE), and the Theil Inequality Coefficient. In all cases, the model with the minimum possible errors was taken to be the best (Table 4).
The results in Table 4 revealed that, in the long run, the model that can best predict infant mortality rate in Uganda is VECM since it has smaller values of the projection errors as compared to BVAR.
Table 5 indicates that the Vector Error Correction Model (VECM) demonstrated superior accuracy in projecting neonatal mortality rates, as evidenced by its lower forecast error values compared to the Bayesian Vector Autoregression (BVAR) model. Consequently, VECM was selected for forecasting infant and neonatal mortality rates from 2021 to 2035. The study further explored the relationships among Infant Mortality Rate (IMR), Neonatal Mortality Rate (NMR), Gross Domestic Product (GDP), and GDP per capita (GDPP) to assess the impact of changes in GDP and GDPP on IMR and NMR. To quantify this effect, forecast error variance decomposition was employed, measuring the extent to which variations in GDP and GDPP account for changes in IMR and NMR over both short and long-time horizons.
| Table 5: Assessment of the neonatal mortality rate projection accuracy. | |||||
| Variable | Inc. obs. | RMSE | MAE | MAPE | Theil |
| BVAR | 8 | 0.0654 | 0.04763 | 1.40564 | 0.00916 |
| VECM | 8 | 0.00864 | 0.00713 | 0.23333 | 0.0014 |
| RMSE: Root Mean Square Error MAE: Mean Absolute Error MAPE: Mean Absolute Percentage Error Theil: Theil inequality coefficient |
|||||
Based on the long-run equation, it was clear that LGDP hurts IMR, whereas GDPP has a positive effect on infant mortality, and the coefficients were significant at the 5% level. In a nutshell, LGDP and LGDPP have a symmetric effect on LIMR in the long run, on average, ceteris paribus. Overall, the VECM fitted the data very well, with an adjusted R-squared of 97.7%. Further still, the F-statistics were significant at the 1% level (Table 6).
| Table 6: VECM with LIMR as the dependent variable (Data 1954-2018). | |||||
| Variables | Coef. | Std. Err. | Z | p-value | |
| ECT | |||||
| L1. | -0.002339 | 0.000943 | -2.480684 | 0.018* | |
| LIMR | |||||
| LD. | 0.684889 | 0.164376 | 4.166600 | <0.001* | |
| L2D. | 0.603716 | 0.213071 | 2.833400 | 0.008* | |
| L3D. | -0.519278 | 0.211802 | -2.451716 | 0.019* | |
| L4D. | -0.059496 | 0.245058 | -0.242785 | 0.809 | |
| L5D. | 0.242812 | 0.190733 | 1.273047 | 0.211 | |
| LGDP | |||||
| LD. | 1.077618 | 5.481750 | 0.196583 | 0.845 | |
| L2D. | -9.303459 | 17.02088 | -0.546591 | 0.588 | |
| L3D. | 20.65873 | 23.54561 | 0.877392 | 0.386 | |
| L4D. | -20.17390 | 17.17973 | -1.174285 | 0.248 | |
| L5D. | 8.818011 | 5.635692 | 1.564672 | 0.126 | |
| LGDPP | |||||
| LD. | -1.079008 | 5.481226 | -0.196855 | 0.845 | |
| L2D. | 9.300009 | 17.02017 | 0.546411 | 0.588 | |
| L3D. | -20.66295 | 23.54524 | -0.877585 | 0.386 | |
| L4D. | 20.16990 | 17.17955 | 1.174064 | 0.248 | |
| L5D. | -8.822784 | 5.636068 | -1.565415 | 0.126 | |
| _Cons | -0.034619 | 0.016883 | -2.050564 | 0.047* | |
| R-squared | 0.967052 | Mean dependent var | -0.024099 | ||
| Adjusted R-squared | 0.952408 | S.D. dependent var | 0.024719 | ||
| S.E. of regression | 0.005393 | Akaike info criterion | -7.352825 | ||
| Sum squared Resid | 0.001047 | Schwarz criterion | -6.720845 | ||
| Log likelihood | 211.8499 | Hannan-Quinn criterion. | -7.109796 | ||
| F-statistic | 66.03922 | Durbin-Watson stat | 1.950897 | ||
| Prob(F-statistic) | <0.001* | ||||
| *pv<0.05, ** pv<0.01, ***pv<0.001. ΔlnIMRt = -0.0346 + 0.6848ΔlnIMR1 + 0.6037ΔlnIMR2 - 0.5192ΔlnIMR3 - 0.0594ΔlnIMR4 + 0.4238ΔlnIMR5 + 1.0776ΔlnGDP1 - 9.3034ΔlnGDP2 + 20.6587ΔlnGDP3 - 20.1739ΔlnGDP4 + 8.8180ΔlnGDP5- 1.0790ΔlnGDPP1 + 9.3 ΔlnGDPP2 - 20.6636 ΔlnGDPP3 + 20.1699ΔlnGDPP4 - 8.8228ΔlnGDPP5 - 0.0023lnECTt-1 |
|||||
Results of the post-estimation tests indicated that the fitted model had no serial correlation at the 5% level of significance. However, the errors were not normally distributed based on the Jarque-Bera test, and no heteroscedasticity was seen based on the Breusch-Pagan-Godfrey test at the 5% level of significance. In addition, the VECM specification imposes 2-unit moduli, which implies a high level of stability at the significance level of 5% (Table 6)
The results of the vector error correction model with IMR as the dependent variable showed no significant short-run causality from LGDP and LGDPP to IMR at a 5% level of significance. It was important to keep in mind that there was a short-run causal relationship between the IMR's lag values and its present values. IMR's values from the previous two years had a positive impact on its current value, which means that in the near future, IMR will grow by 68.5% and 60.4% on itself as a result of its first and second prior values, respectively. On the other hand, due to the impact of its values from three years ago at a 5% level of importance, IMR will show a decline of 51.9% in the short term (Table 7).
| Table 7: VECM with Johansen normalization restriction imposed. | ||||
| Beta | Coef. | Std. Err. | T | P>t |
| LIMR (-1) | 1 | . | . | . |
| LGDP (-1) | 3.2731 | 0.9661 | 3.3880 | 0.0390 |
| LGDPP (-1) | -5.16348 | 1.5234 | -3.8301 | 0.0060 |
| _Cons | -44.9754 | . | . | . |
| LNMR = - 44.9754 + 3.2731LGDP - 5.16348LGDPP | ||||
The adjustment term (-0.0041) suggested that previous year’s error (or deviation from long-run equilibrium) is corrected for within the current year at a convergence speed of 0.041% annually.
In the short run (6 years), more than 80% of the forecast error variance is explained by IMR itself, and very little influence is seen from GDP and GDPP. In the long run (10 to 15 years), the influence of the LGDP increases while that of the GDPP remains stagnant. This meant that GDP exhibited a strong influence on IMR in the long run, yet IMR had a weak endogenous influence on itself (Table 8).
| Table 8: Short and long run influence of study variable on IMR based on forecast error variance decomposition. | ||||
| Period (Years) | S.E. | LIMR | LGDP | LGDPP |
| 1 | 0.005393 | 100.0000 | 0.000000 | 0.000000 |
| 2 | 0.011180 | 96.22578 | 3.749129 | 0.025095 |
| 3 | 0.020700 | 92.30293 | 7.613713 | 0.083361 |
| 4 | 0.032036 | 87.41467 | 12.40978 | 0.175549 |
| 5 | 0.045769 | 83.56496 | 16.14786 | 0.287174 |
| 6 | 0.060560 | 80.66349 | 19.00394 | 0.332573 |
| 7 | 0.076559 | 78.11233 | 21.54645 | 0.341219 |
| 8 | 0.093319 | 75.74225 | 23.95256 | 0.305192 |
| 9 | 0.110961 | 73.48256 | 26.27398 | 0.243456 |
| 10 | 0.129382 | 71.21862 | 28.60170 | 0.179676 |
| 11 | 0.148480 | 68.98844 | 30.85553 | 0.156031 |
| 12 | 0.168143 | 66.65012 | 33.12385 | 0.226030 |
| 13 | 0.188259 | 64.17239 | 35.38761 | 0.440003 |
| 14 | 0.208703 | 61.54723 | 37.62696 | 0.825811 |
| 15 | 0.229395 | 58.82590 | 39.80220 | 1.371894 |
In the long run, LGDP has a negative impact on LNMR, while LGDPP has a positive impact on LNMR, and the coefficients are significant at the 1% level. In conclusion, I found out that LGDP and LGDPP have a symmetric impact on NMR in the long run-on average, ceteris paribus. The VECM fit the data very well overall, with an adjusted R squared of 97.7%; additionally, the F-statistics were significant at the 1% level.
The model's post-estimation results revealed no serial correlation; error terms were normally distributed, with no heteroscedasticity based on the Breusch-Pagan-Godfrey test at 5%; and the VECM specification imposed 2-unit moduli, implying a high level of stability (Table 9).
| Table 9: Neonatal mortality Rates VECM (Data 1954-2018). | ||||
| Variables | Coefficient | Std. Error | t-Statistic | Prob. |
| ECT (-1) | -0.014 | 0.0037 | -3.7898 | <0.001* |
| LNMR | ||||
| LD. | 0.98989 | 0.1825 | 5.42405 | <0.001* |
| L2D. | -2.2063 | 1.56405 | -1.4106 | 0.1672 |
| L3D. | 2.20701 | 1.56394 | 1.41119 | 0.167 |
| L4D. | 0.1791 | 0.25272 | 0.7087 | 0.4832 |
| LGDP | ||||
| LD. | 5.37233 | 4.1028 | 1.30943 | 0.1989 |
| L2D. | -5.3764 | 4.10264 | -1.3105 | 0.1986 |
| L3D. | -0.3287 | 0.25198 | -1.3045 | 0.2006 |
| L4D. | -5.3918 | 4.14973 | -1.2993 | 0.2023 |
| LGDPP | ||||
| LD. | 5.38548 | 4.14986 | 1.29775 | 0.2029 |
| L2D. | -0.2039 | 0.17877 | -1.1407 | 0.2617 |
| L3D. | 1.44299 | 1.64264 | 0.87846 | 0.3857 |
| L4D. | -1.446 | 1.64237 | -0.8805 | 0.3846 |
| Cons | 0.01764 | 0.00761 | 2.31841 | 0.0264 |
| R-squared | 0.98354 | Mean dependent var | -0.0206 | |
| Adjusted R-squared | 0.97743 | S.D. dependent var | 0.01131 | |
| S.E. of regression | 0.0017 | Akaike info criterion | -9.6819 | |
| Sum squared Resid | 0.0001 | Schwarz criterion | -9.1414 | |
| Log likelihood | 251.207 | Hannan-Quinn criterion. | -9.4768 | |
| F-statistic | 160.908 | Durbin-Watson stat | 1.92344 | |
| Prob(F-statistic) | 0 | |||
| ΔlnNMRt = 0.0176 + 0.98991ΔlnNMR1 - 2.2063ΔlnNMR2 + 2.2070ΔlnNMR3 + 0.1791ΔlnNMR4 + 5.3723ΔlnGDP1 - 0.5764ΔlnGDP2 - 0.3287ΔlnGDP3 - 5.3854ΔlnGDP4 + 5.385 ΔlnGDPP1 - 0.2039ΔGDPP2 + 1.4430ΔGDPP3 - 1.4460ΔGDPP3 - 0.014ECTt-1 |
||||
The VECM results revealed no short-run causality running from the lagged values of NMR; in particular, the current values of NMR tend to increase with neonatal mortality by 99%. Furthermore, there were no short-run causalities from lagged GDP and GDPP values to NMR because their coefficients were not significant at the 5% level, implying that there is no short-run causality from LGDP and LGDPP to LNM (Table 10).
| Table 10: VECM with Johansen normalization restriction imposed. | ||||
| Beta | Coef. | Std. Err. | z | P>z |
| LNMR (-1) | 1 | . | . | . |
| LGDP (-1) | 1.0155 | 0.1049 | 9.6795 | 0 |
| LGDPP (-1) | -1.1797 | 0.1998 | -5.9061 | 0 |
| Cons | -19.692 | . | . | . |
| LNMR = –19.6917 + 1.0155LGDP – 1.1797LGDPP | ||||
The VECM results showed an adjustment term of -0.014, suggesting that the previous year’s errors are corrected within the current year at a convergence speed of 1.4% annually, towards the long-run equilibrium. Furthermore, it indicated long-run causality running from LGDP and LGDPP to LNMR.
In the short run (1–6 years), the majority of the neonatal mortality forecast error variances (over 89%) were explained by themselves, and only a very small percentage was explained by GDP and GDPP. This influence reduced with time to the extent that, between 10 and 15 years, the influence of GDPP on NMR became stronger, and that of GDP remained weak all through the period (Table 11).
| Table 11: VECM projection error Variance Decomposition of LNMR. | ||||
| Period (Years) | S.E. | LNMR | LGDP | LGDPP |
| 1 | 0.001700 | 100.0000 | 0.000000 | 0.000000 |
| 2 | 0.004075 | 97.91209 | 1.393570 | 0.694340 |
| 3 | 0.007194 | 97.15839 | 1.209350 | 1.632262 |
| 4 | 0.010488 | 96.43193 | 0.580207 | 2.987866 |
| 5 | 0.013667 | 94.26186 | 0.658029 | 5.080107 |
| 6 | 0.016684 | 89.90393 | 1.688806 | 8.407265 |
| 7 | 0.019628 | 83.41279 | 2.999677 | 13.58754 |
| 8 | 0.022695 | 75.56837 | 3.619241 | 20.81239 |
| 9 | 0.026055 | 67.15857 | 3.348640 | 29.49279 |
| 10 | 0.029772 | 59.10897 | 2.638423 | 38.25261 |
| 11 | 0.033746 | 52.17818 | 2.089350 | 45.73247 |
| 12 | 0.037718 | 46.72443 | 1.964940 | 51.31063 |
| 13 | 0.041405 | 42.73652 | 2.264125 | 54.99935 |
| 14 | 0.044613 | 39.97626 | 2.919827 | 57.10391 |
| 15 | 0.047295 | 38.14498 | 3.918061 | 57.93695 |
After assessing the forecasting powers of the models, predictions for infant and neonatal mortality in the long run up to 2035 based on the Vector Error Correction Model (VECM) were estimated since it proved to be more reliable in making long-term forecasts (Table 12).
| Table 12: Forecast of Infant Mortality Rate (IMR) and Neonatal Mortality Rate (NMR) | ||||||
| Time period | Lower limits | IMR | Upper limits | Lower limits | NMR | Upper limits |
| 2020 | 31.9 | 32.3 | 32.6 | 19.4 | 19.4 | 19.5 |
| 2021 | 30.5 | 31.2 | 31.9 | 18.7 | 18.8 | 19.0 |
| 2022 | 29.0 | 30.1 | 31.3 | 18.0 | 18.2 | 18.4 |
| 2023 | 27.6 | 29.2 | 3s0.9 | 17.3 | 17.6 | 17.9 |
| 2024 | 26.2 | 28.3 | 30.5 | 16.7 | 17.1 | 17.4 |
| 2025 | 25.0 | 27.5 | 30.2 | 16.2 | 16.7 | 17.1 |
| 2026 | 23.8 | 26.7 | 29.9 | 15.9 | 16.4 | 16.8 |
| 2027 | 22.7 | 25.16 | 29.4 | 15.7 | 16.2 | 16.7 |
| 2028 | 21.5 | 24.9 | 28.8 | 15.5 | 16.0 | 16.6 |
| 2029 | 20.3 | 23.9 | 28.1 | 15.2 | 15.16 | 16.4 |
| 2030 | 19.1 | 22.8 | 27.1 | 14.9 | 15.5 | 16.2 |
| 2031 | 17.9 | 21.6 | 26.0 | 14.5 | 15.2 | 15.9 |
| 2032 | 16.8 | 20.4 | 24.9 | 14.1 | 14.8 | 15.5 |
| 2033 | 15.16 | 19.3 | 23.7 | 13.7 | 14.4 | 15.1 |
| 2034 | 14.8 | 18.3 | 22.6 | 13.3 | 14.0 | 14.7 |
| 2035 | 13.9 | 17.3 | 21.5 | 12.9 | 13.6 | 14.3 |
Globally, there has been a steady decline in NMR figures over the years, particularly in Uganda; however, this appears to have stalled in recent years, and this trend of slow decline is expected to continue during the lifespan of the Sustainable Development Goals if nothing is done. As presented in Figure 3, we applied the suitable prediction model (VECM) after a deeper investigation of its capability to conduct both the short- and long-term forecasts. It was observed that by 2035, Uganda will have about 17 deaths per 1,000 live births.
Results presented in Figure 4 show that the decline of NMR will continue to slow over the next 15 years. The projections presented here indicate where the health practitioner should focus. If the goal of eradicating infant mortality is to be realized, more emphasis should be placed on newborn life throughout the country.
The analysis of IMR and NMR in relation to GDP and GDP per capita (GDPP) revealed a significant negative correlation overall: as national GDP and GDPP increase, resources available for investment in the health sector expand, improving access to quality healthcare services and enhancing infant survival. Impulse response functions indicate that in the short run, the response of log-transformed IMR (LIMR) to a one standard deviation shock in LGDP is minimal, reflecting limited immediate impact. However, beyond ten years, LIMR rises and remains in the positive region, suggesting that shocks to LGDP exert a sustained long-term influence on infant mortality.
VECM long-run coefficients (p < 0.05) confirm a negative effect of LGDP on IMR, consistent with the expected economic-health link. Conversely, GDPP exhibits a positive long-term association with infant mortality, which may appear counterintuitive. This outcome can be attributed to structural and contextual factors: rising GDP does not guarantee equitable resource distribution. If economic growth primarily benefits wealthier households, marginalized populations may experience minimal improvements in health, nutrition, or sanitation. Rapid economic growth may also outpace investments in health infrastructure and maternal care, while urban overcrowding and environmental hazards persist, thereby offsetting potential mortality reductions. These findings underscore the importance of equitable growth and targeted health investments to translate economic gains into tangible health benefits.
The observed associations align with findings from Khan, et al. 2019, who forecasted IMR for Asian countries using log-log regression and ARIMA models, reporting significant correlations between GDP and IMR. Similarly, Ensor, et al. [5] highlighted those economic recessions negatively affect maternal and infant health, particularly in the early stages of national development. The relationship between GDP, GDPP, and infant mortality is also influenced by human capital; in low- and middle-income countries (LICs and LMICs), the marginal effect of health spending on mortality is higher, whereas in high- and upper-middle-income countries (HICs and UMICs), reductions in mortality have a stronger effect on GDP.
Comparative analyses of the SAARC countries further support these findings. Panel data analyses from 1990 to 2016 revealed that GDP per capita, health spending, and education spending significantly influenced the Human Development Index (HDI), though immediate causal effects were not evident. Long-run relationships were observed, indicating that sustained investment in health and education, alongside economic growth, is necessary to achieve improvements in development and health outcomes [39,40].
In examining the forecasting powers of econometric models, the study confirmed that BVAR and VAR models perform best for short-run forecasts, while VECM provides superior long-run accuracy for both IMR and NMR. These models were evaluated using out-of-sample forecasts over fifteen years with metrics such as RMSE, MAE, MAPE, and Theil’s U. Evidence from Australia shows that BVAR models enhance forecast accuracy compared to traditional VAR models by incorporating parameter uncertainty [39]. The VECM’s superiority in capturing long-term trends is consistent with findings from Arnold and Sherris [17], who applied VECM to cause-of-death mortality rates in Switzerland, and Zhou, et al. [16], who demonstrated that VECM effectively models stochastic factors in mortality data. These studies collectively validate the reliability of VECM for long-term forecasting of infant and neonatal mortality [41,42].
Macroeconomic variables such as GDP and GDP are important components that can be used to predict infant mortality in both the short and long run. The analysis results showed that the short-run forecasts could be made using univariate time series (ARIMA), since for both NMR and IMR, the majority of the forecast error variance can be explained by themselves, and this was also further alluded to in the short-run coefficient of the VECM. Long-run forecasts of both IMR and NMR can be done more accurately and successfully using VECM than VAR and BVAR models.
Although GDPP had a longer-term advantage over GDP on measures of infant mortality (IMR), GDPP had a longer-term advantage over GDP on these measures. In general, the ability to forecast IMR over the long term more accurately with GDP and NMR more effectively with GDPP, since the nation's GDP is collective in nature, it was only natural that it would favor infants in terms of infrastructure and social service supply in the nation that supports baby survival. The GDPP, on the other hand, offers the closest amount of resources at the individual level that can meet the demands of children. The study revealed that GDP growth generally reduces infant mortality, the heterogeneous distribution of income, and the lag in public service expansion help explain why GDP may exhibit counterintuitive positive effects in certain contexts. This distinction between short-run neutrality and long-run divergence underscores the importance of considering both temporal dynamics and socio-economic structures when interpreting the VECM and impulse response results.
Policymakers should adopt VECM-based forecasting for long-term planning of infant and neonatal mortality, as the model demonstrated superior accuracy over VAR and BVAR and reflects the strong cointegration among IMR, NMR, GDP, and GDPP. Short-term monitoring can rely on VAR/BVAR, which perform better in the absence of long-run equilibrium dynamics. Given that GDPP strongly predicts and reduces IMR and NMR, strategies that raise household welfare are needed. Uganda must strengthen equitable health investments, expand maternal and child health coverage in underserved areas, and address social determinants of health. Aligning economic growth with targeted public health strategies ensures that all segments of the population benefit, making reductions in infant and neonatal mortality more consistent and sustainable.
This study’s macro-level models do not account for potentially important factors such as governance, health sector spending, income inequality, maternal education, and fertility, which could help explain counterintuitive findings, including the positive long-term association between GDP and IMR/NMR. Additionally, aggregate GDP may mask population heterogeneity, limiting the responsiveness of infant mortality to overall economic growth, particularly in developing countries. Data quality for the early years of the series may be less reliable, which could affect model accuracy. Finally, caution is warranted when generalizing these results to other low- and middle-income countries, as country-specific institutional, demographic, and health system contexts may differ.
We thank the World Bank and UNICEF for providing us with data. We thank Makerere University for approving the study.
Contributors: OB: Conceptualization, Data Curation, Investigation, Methodology, Project Administration, Software, Writing original draft, and visualization. BO: Formal Analysis, Validation, reviewing, and editing: ET: Software, Validation; SM: Formal Analysis, Data Curation.
Declaration statement: All Authors declare that the information provided in the publication is authentic with not biased
Data sharing statement: Data can be accessed through the World Bank and UNICEF websites.
Sharrow D. Open access article under the CC BY 4.0 license. 2022. doi:10.1016/S2214-109X(21)00515-5.
Baraki A, Kedir H, Geda T. The impact of preventable neonatal encephalopathy and infections on neonatal mortality in developing countries. Int J Pediatr. 2020;8(5):72–79.
World Health Organization. Neonatal mortality: the silent epidemic. Geneva: World Health Organization; 2017.
Khelfaoui M, Zougari S, Douma L. The impact of maternal education and economic factors on infant mortality in Uganda. J Dev Econ. 2022;87(4):89–104.
Ensor T, Gafur S. The effect of GDP fluctuations on maternal and infant mortality rates in developing nations. J Health Econ. 2010;22(3):223–235.
Levels and trends in child mortality: report 2017. New York: United Nations Children’s Fund; 2017.
Arshia A, Ulf-G G. The causal effect of maternal and child mortality on GDP in high-income and upper-middle-income countries. Int J Health Econ Manag. 2013;13(2):121–136.
Erdil E, Yildirim A. The Granger-causality approach to examining GDP and health expenditures per capita. Appl Econ Lett. 2010;17(12):1151–1155.
Njenga F. Bayesian VAR models for mortality rate forecasts in Australia. Aust J Econ Model. 2015;12(1):67–79.
Fu W, Smit B, Brewer P, Droms S. Markov-switching Bayesian vector autoregression model in mortality forecasting. Risk. 2023;11(9). doi:10.3390/risks11090152.
Mishra S, Sahanaa R, Manikandan S. Forecasting IMR in India from 1971 to 2016 using ARIMA models. Int J Health Forecast. 2019;26(2):145–159.
Shafi KM, Samreen F, Syeeda S. Modeling and forecasting infant mortality rates of Asian countries from the perspective of GDP (PPP). Int J Sci Eng Res. 2019;10(3).
Usman A. Application of ARIMA models to analyze newborn mortality trends in Nigeria. Int J Health Forecast. 2019;28(5):171–182.
Lin W, Lee C. Exploring the relationship between healthcare spending and life expectancy using VECM. Int J Public Health Econ. 2016;49(2):112–130.
Sulaiman T, Salleh M. Forecasting infant mortality in developing countries using the vector error correction model (VECM). J Health Econ. 2019;19(4):210–218.
Zhou H. Mortality rate modeling using VECM: an approach for improving forecast accuracy. J Forecast. 2014;33(1):56–63.
Arnold F, Sherris J. Application of VECM in assessing cause-of-death mortality rates. J Popul Health. 2013;56(3):459–478.
Siahanidou T. Analysis of IMR trends in Greece from 2004 to 2016 using VECM. Eur J Popul. 2019;35(6):787–799.
Liao TF. Analysis of the relationship between health expenditures and health outcomes in China using vector autoregressive (VAR) models. Asian J Public Health. 2017;37(5):209–220.
The effects of health expenditure on infant mortality in sub-Saharan Africa: evidence from panel data analysis. J Public Aff. 2018.
Faye O. Assessing the relationship between health expenditure, GDP, and IMR in the Philippines using a vector autoregressive (VAR) model. Glob Health Action. 2014;7(1):28–36.
Lane S. The impact of public health expenditure on national health outcomes: a statistical analysis. J Public Health Policy. 2013;34(3):396–405.
Chung H, Muntaner C. The effect of income inequality on infant mortality across various countries. Am J Public Health. 2006;96(10):1780–1786.
Robert BL. Forecasting with Bayesian vector autoregressions: five years of experience. J Bus Econ Stat. 1986;4(1):25–38. doi:10.2307/1391384.
Kholid H. Forecasting health outcomes in low-income countries using Bayesian VAR models. Int J Health Forecast. 2019;45(3):168–182.
Elliott G, Rothenberg T, Stock J. Efficient tests for an autoregressive unit root. J Econom. 2016;55(1):1–24.
Guibert L, Lopez B, Piette J. Forecasting mortality rates using Bayesian VAR: improving forecast accuracy in uncertain environments. J Forecast. 2019;40(4):290–302.
Fernández C. Comparison of VAR and BVAR models in forecasting macroeconomic variables. J Forecast. 2018;37(5):529–546.
Granger CWJ, Newbold P. Spurious regressions in econometrics. J Econom. 1974;111–120. doi:10.1016/0304-4076(74)90034-7.
Danao M. Statistical methods for time series analysis: issues in regression and co-integration models. Econ J Southeast Asia. 2002;24(3):111–132.
Box G, Jenkins G. Time series analysis: forecasting and control. 2nd ed. San Francisco: Holden-Day; 1976.
Dickey D, Fuller W. Likelihood ratio statistics for autoregressive time series with a unit root. Econometrica. 1981;49(4):1057–1072.
Dickey D, Fuller W. Distribution of the estimators for autoregressive time series with a unit root. J Am Stat Assoc. 1979;74(366):427–431.
Economic growth, financial and trade globalization in the Philippines: a vector autoregressive analysis. 2014.
Alkema L, New J. Progress toward global reduction in under-5 mortality: a bootstrap analysis of uncertainty in Millennium Development Goal 4 estimates. PLoS Med. 2012;9(12):e1001355.
Lütkepohl H. New introduction to multiple time series analysis. Berlin: Springer; 2005.
Enders W. Applied econometric time series. 4th ed. Hoboken: Wiley; 2015.
Kilian L, Lütkepohl H. Structural vector autoregressive analysis. Cambridge: Cambridge University Press; 2017.
Kurniasih W. Evaluation of the accuracy of ARIMA, Holt–Winters, and the α-Sutte indicator in forecasting mortality rates. Asian J Forecast. 2018;25(2):205–218.
Bhowmik M. Relationship between GDP, health expenditure, and the Human Development Index in the SAARC region. J Dev Stud. 2019;17(1):35–48.
Carolyn N, Michael S. Modeling mortality with a Bayesian vector autoregression. Insurance Math Econ. 2020.
David S. A systematic analysis by the UN Inter-agency Group for Child Mortality Estimation. Elsevier; n.d. doi:10.1016/S2214-109X(21)00515-5.
Odur B, Etil T, Opio B, Memon S. Comparing Forecasting Models for Predicting Infant Mortality: VECM vs. VAR and BVAR Specifications. IgMin Res. December 08, 2025; 3(12): 440-453. IgMin ID: igmin323; DOI:10.61927/igmin323; Available at: igmin.link/p323
Anyone you share the following link with will be able to read this content:
School of Statistics and Planning, Makerere University, Kampala, Uganda
Address Correspondence:
Dr. Tom Etil, School of Statistics and Planning, Makerere University, Kampala, Uganda, Email: etiltom@gmail.com
How to cite this article:
Odur B, Etil T, Opio B, Memon S. Comparing Forecasting Models for Predicting Infant Mortality: VECM vs. VAR and BVAR Specifications. IgMin Res. December 08, 2025; 3(12): 440-453. IgMin ID: igmin323; DOI:10.61927/igmin323; Available at: igmin.link/p323
Copyright: 2025 Odur B, et al. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
Figure 1: Impulse response function for infant mortality rat...
Figure 2: Impulse response function for infant mortality rat...
Figure 3: Forecast of NMR up to 2035 using VECM....
Figure 4: Forecast of NMR up to 2035 using VECM....
Sharrow D. Open access article under the CC BY 4.0 license. 2022. doi:10.1016/S2214-109X(21)00515-5.
Baraki A, Kedir H, Geda T. The impact of preventable neonatal encephalopathy and infections on neonatal mortality in developing countries. Int J Pediatr. 2020;8(5):72–79.
World Health Organization. Neonatal mortality: the silent epidemic. Geneva: World Health Organization; 2017.
Khelfaoui M, Zougari S, Douma L. The impact of maternal education and economic factors on infant mortality in Uganda. J Dev Econ. 2022;87(4):89–104.
Ensor T, Gafur S. The effect of GDP fluctuations on maternal and infant mortality rates in developing nations. J Health Econ. 2010;22(3):223–235.
Levels and trends in child mortality: report 2017. New York: United Nations Children’s Fund; 2017.
Arshia A, Ulf-G G. The causal effect of maternal and child mortality on GDP in high-income and upper-middle-income countries. Int J Health Econ Manag. 2013;13(2):121–136.
Erdil E, Yildirim A. The Granger-causality approach to examining GDP and health expenditures per capita. Appl Econ Lett. 2010;17(12):1151–1155.
Njenga F. Bayesian VAR models for mortality rate forecasts in Australia. Aust J Econ Model. 2015;12(1):67–79.
Fu W, Smit B, Brewer P, Droms S. Markov-switching Bayesian vector autoregression model in mortality forecasting. Risk. 2023;11(9). doi:10.3390/risks11090152.
Mishra S, Sahanaa R, Manikandan S. Forecasting IMR in India from 1971 to 2016 using ARIMA models. Int J Health Forecast. 2019;26(2):145–159.
Shafi KM, Samreen F, Syeeda S. Modeling and forecasting infant mortality rates of Asian countries from the perspective of GDP (PPP). Int J Sci Eng Res. 2019;10(3).
Usman A. Application of ARIMA models to analyze newborn mortality trends in Nigeria. Int J Health Forecast. 2019;28(5):171–182.
Lin W, Lee C. Exploring the relationship between healthcare spending and life expectancy using VECM. Int J Public Health Econ. 2016;49(2):112–130.
Sulaiman T, Salleh M. Forecasting infant mortality in developing countries using the vector error correction model (VECM). J Health Econ. 2019;19(4):210–218.
Zhou H. Mortality rate modeling using VECM: an approach for improving forecast accuracy. J Forecast. 2014;33(1):56–63.
Arnold F, Sherris J. Application of VECM in assessing cause-of-death mortality rates. J Popul Health. 2013;56(3):459–478.
Siahanidou T. Analysis of IMR trends in Greece from 2004 to 2016 using VECM. Eur J Popul. 2019;35(6):787–799.
Liao TF. Analysis of the relationship between health expenditures and health outcomes in China using vector autoregressive (VAR) models. Asian J Public Health. 2017;37(5):209–220.
The effects of health expenditure on infant mortality in sub-Saharan Africa: evidence from panel data analysis. J Public Aff. 2018.
Faye O. Assessing the relationship between health expenditure, GDP, and IMR in the Philippines using a vector autoregressive (VAR) model. Glob Health Action. 2014;7(1):28–36.
Lane S. The impact of public health expenditure on national health outcomes: a statistical analysis. J Public Health Policy. 2013;34(3):396–405.
Chung H, Muntaner C. The effect of income inequality on infant mortality across various countries. Am J Public Health. 2006;96(10):1780–1786.
Robert BL. Forecasting with Bayesian vector autoregressions: five years of experience. J Bus Econ Stat. 1986;4(1):25–38. doi:10.2307/1391384.
Kholid H. Forecasting health outcomes in low-income countries using Bayesian VAR models. Int J Health Forecast. 2019;45(3):168–182.
Elliott G, Rothenberg T, Stock J. Efficient tests for an autoregressive unit root. J Econom. 2016;55(1):1–24.
Guibert L, Lopez B, Piette J. Forecasting mortality rates using Bayesian VAR: improving forecast accuracy in uncertain environments. J Forecast. 2019;40(4):290–302.
Fernández C. Comparison of VAR and BVAR models in forecasting macroeconomic variables. J Forecast. 2018;37(5):529–546.
Granger CWJ, Newbold P. Spurious regressions in econometrics. J Econom. 1974;111–120. doi:10.1016/0304-4076(74)90034-7.
Danao M. Statistical methods for time series analysis: issues in regression and co-integration models. Econ J Southeast Asia. 2002;24(3):111–132.
Box G, Jenkins G. Time series analysis: forecasting and control. 2nd ed. San Francisco: Holden-Day; 1976.
Dickey D, Fuller W. Likelihood ratio statistics for autoregressive time series with a unit root. Econometrica. 1981;49(4):1057–1072.
Dickey D, Fuller W. Distribution of the estimators for autoregressive time series with a unit root. J Am Stat Assoc. 1979;74(366):427–431.
Economic growth, financial and trade globalization in the Philippines: a vector autoregressive analysis. 2014.
Alkema L, New J. Progress toward global reduction in under-5 mortality: a bootstrap analysis of uncertainty in Millennium Development Goal 4 estimates. PLoS Med. 2012;9(12):e1001355.
Lütkepohl H. New introduction to multiple time series analysis. Berlin: Springer; 2005.
Enders W. Applied econometric time series. 4th ed. Hoboken: Wiley; 2015.
Kilian L, Lütkepohl H. Structural vector autoregressive analysis. Cambridge: Cambridge University Press; 2017.
Kurniasih W. Evaluation of the accuracy of ARIMA, Holt–Winters, and the α-Sutte indicator in forecasting mortality rates. Asian J Forecast. 2018;25(2):205–218.
Bhowmik M. Relationship between GDP, health expenditure, and the Human Development Index in the SAARC region. J Dev Stud. 2019;17(1):35–48.
Carolyn N, Michael S. Modeling mortality with a Bayesian vector autoregression. Insurance Math Econ. 2020.
David S. A systematic analysis by the UN Inter-agency Group for Child Mortality Estimation. Elsevier; n.d. doi:10.1016/S2214-109X(21)00515-5.