Abstract

Africa’s first COVID-19 case was recorded in Egypt on February 14, 2020. Although it is not as expected by the World Health Organization (WHO) and other international organizations, currently a large number of Africans are getting infected by the virus. In this work, we studied the trend of the COVID-19 outbreak generally in Africa as a continent and in the five African regions separately. The study also investigated the validity of the ARIMA approach to forecast the spread of COVID-19 in Africa. The data of daily confirmed new COVID-19 cases from February 15 to October 16, 2020, were collected from the official website of Our World in Data to construct the autoregressive integrated moving average (ARIMA) model and to predict the trend of the daily confirmed cases through STATA 13 and EViews 9 software. The model used for our ARIMA estimation and prediction was (3, 1, 4) for Africa as a continent, ARIMA (3, 1, 3) for East Africa, ARIMA (2, 1, 3) for West Africa, ARIMA (2, 1, 3) for Central Africa, ARIMA (1, 1, 4) for North Africa, and ARIMA (4, 1, 5) for Southern Africa. Finally, the forecasted values were compared with the actual number of COVID-19 cases in the region. At the African level, the ARIMA model forecasted values and the actual data have similar signs with slightly different sizes, and there were some deviations at the subregional level. However, given the uncertain nature of the current COVID-19 pandemic, it is helpful to forecast the future trend of such pandemics by employing the ARIMA model.

1. Introduction

Severe acute respiratory syndrome coronavirus-2 (SARS-CoV-2) caused a highly contagious disease called coronavirus disease 2019 (COVID-19). The virus was first reported in Wuhan city, People’s Republic of China (PRC), in December 2019 [13].

Africa is the last continent to be infected by the COVID-19 pandemic. However, Africa, with the most vulnerable populations to infectious diseases, is predicted to be significantly affected by the ongoing COVID-19 outbreak [3]. As of October 30, 2021; CDC Africa reported 8,491,452 cases, 218,175 deaths, and 7,890,021 recoveries out of 76,692,988 COVID-19 tests [4].

There is no sustainable economy without a strong health sector [5]. However, the socio-economic conditions in Africa are much less developed than other countries of the world. A series of factors, i.e., the scarcity of medical supplies, low socioeconomic status, lower virus testing efficiency, and poor information and communication technology (ICT) would facilitate the spread of the pandemic [6]. Before the COVID-19 pandemic, most of the healthcare infrastructure in African countries had been deteriorating, and the situation posed unprecedented pressure on the public health systems in many African countries [7]. Currently, the COVID-19 health crisis has already transformed into an economic and labor market shock, impacting not only supply (production of goods and services) but also demand (consumption and investment). In Africa and many low- and middle-income countries, the spread of COVID-19 is translated into economic impacts that will affect the already affected most vulnerable populations [8]. Many people, especially from developing nations, are shifting their focus from the fatal effects of the pandemic to the threats it poses to their daily supply of food [9].

What will be the global socio-economic impact of the novel coronavirus (COVID-19)? Answering this question requires accurate forecasting of the spread of confirmed cases as well as analysis of the number of COVID-19-related deaths and recoveries [10]. It is essential to create a reliable and suitable predictive model that is of high importance to understand the current situation, evaluate the severity of the pandemic, and help governments and other stakeholders to control the further spread of novel coronavirus [11, 12]. Until now, several researchers have utilized the ARIMA model to forecast the spread of COVID-19 in many countries. In [13], the researchers employed the ARIMA model to forecast the spread of COVID-19 incidence in Italy, Russia, and the United States of America. In [1419], ARIMA was used to forecast the spread of the COVID-19 pandemic in Bangladesh, Malaysia, India, Korea, Ethiopia, and Pakistan, respectively. However, the outcome of the aforementioned research depends on the validity of the ARIMA model. Therefore, the general objective of this research is to check the validity of the ARIMA model to forecast the spread of the COVID-19 pandemic in Africa under current testing capacity and public health interventions.

Once we started with this brief introduction, the whole paper was organized as follows: In the second part, the study presented the research methodology, in which the specification of the theoretical model and the data issues are included. The third part presents the empirical findings and their interpretations. Finally, in the last part, the study draws possible conclusions and policy implications based on the findings of the study.

2. Materials and Methods

2.1. Description of the Study Area

Africa is also called the “Mother Continent” due to its being the oldest inhabited continent on Earth and being the homeland for humans and their ancestors for more than five million years. The current population of Africa is 1,382,727,760 as of Tuesday, October 26, 2021, and it is equivalent to 16.72% of the total world population [20]. It can be seen from Table 1 that the UN Statistics Division, Africa, is subdivided into five regions: Northern Africa, Central or Middle Africa, Southern Africa, East Africa, and Western Africa [21].

2.2. Data Collection

The official website of Our World in Data was used to collect the daily confirmed new COVID-19 cases from February 15, 2020, to October 16, 2020 [22]. Since the collected data includes the respective daily new confirmed COVID-19 cases of the past few days, it is considered as time series data. STATA 13 and EViews 9 has been used to build and analyze the time-series data.

2.3. Description and Development of Econometric Model

The Box-Jenkins approach to modeling ARIMA processes is employed in this study, and this methodology is widely regarded as the most efficient forecasting technique and is used extensively [23]. The abbreviation ARIMA stands for autoregressive integrated moving average model, and the model is divided into three components depending on the type of data. The first building block is the autoregressive (AR) models, in which the value of a variable in one period is related to its value in previous periods. is an autoregressive model with lag.

; where Yt is the dependent variable, is a constant, p is the coefficient for the lagged variable in time and εt is the random or white noise term that represents a shock that cannot be explained.

The second component is the moving average (MA) model, which accounts for the possibility of a relationship between a variable and the residuals from previous periods. MA(q) is a moving average model with q lags: ; where Yt is the dependent variable, is a constant, is the coefficient for the lagged variable in time t-i, and εt is the error term.

The combination of the above two models, which are AR (autoregressive) and MA (moving average), gives the ARMA model, and the ARMA model can be used if and only if the data is stationary. The ARMA models combine both autoregressive terms and q moving average terms, also called ARMA .

When a variable Yt is not stationary, a common solution is to use a differenced variable: , for first-order differences, and by differencing (integrating (I)) the original series before using them will remove any linear time trend. If we include the third component, which is integrated into our ARMA model, it will become ARIMA.

The emphasis of these methods is not on constructing single-equation or simultaneous equation models but on analyzing the probabilistic or stochastic properties of economic time series according to their philosophy. Let the data speak for themselves as it allows Yt to be explained by the past, or lagged, values of Y and the stochastic error terms. For this reason, ARIMA models are not derived from any economic theory [2426].

2.4. Model Selection Criteria

Out-of-sample forecasting is concerned with determining how a fitted model forecasts future regress values and given the values of the regressors. To select the best ARIMA type of model fitted for Africa and the five African regions, their goodness-of-fit has been compared using the Akaike information criteria (AIC) and Bayesian information criterion (BIC). AIC is an important and leading statistic by which we can determine the order of an autoregressive model. The AIC considers both how well the model fits the observed series and the number of parameters to be used in the fit. The BIC can help in deciding the order of autoregression., where k is the number of regressors (including the intercept), n is the number of observations, and RSS stands for the residual sum of the square. In comparing two or more models, the model with the lowest value of AIC is preferred. One advantage of the AIC is that it is useful for not only in-sample but also out-of-sample forecasting of the performance of a regression model.; where is the value of the likelihood function evaluated at the parameter estimates, is the number of observations, and is the number of estimated parameters. A lower AIC or BIC value indicates a better fit (a more parsimonious model).

3. Data Presentation and Analysis

3.1. Descriptive Statistics

The daily confirmed new COVID-19 case time series plots of African countries initially depict an increasing trend and later a declining trend, but there is a variation in the daily confirmed new COVID-19 case time series plots among African countries and between the five African regions, as shown in Figure 1.

Source: own computation, 2021.

3.2. Econometric Result and Interpretation
3.2.1. Parameter Estimation and Validation

Five models are chosen and tested to find a model with a good fit for Africa and the five African regions, and the model with the lowest AIC is selected for estimation and forecasting. As we can see from Table 2, that the ARIMA (3, 1, 4), ARIMA (3, 1, 3), ARIMA (2, 1, 3), ARIMA (2, 1, 3), ARIMA (4, 1, 2), and ARIMA (4, 1, 5) models are selected to estimate the parameters of Africa, East Africa, West Africa, Central Africa, North Africa, and Southern Africa, respectively.

Source: own computation, 2021.

Source: own computation, 2021.

Source: own computation, 2021.

Source: own computation, 2021.

Source: own computation, 2021.

Source: own computation, 2021.

3.2.2. Forecasting Using the Selected ARIMA Models

Candidate models were obtained based on the respective spikes observed from the autocorrelation function (ACF) and the partial autocorrelation function (PACF). Table 3 presents the prediction results of daily confirmed new COVID-19 cases in Africa for the period between 17/9/2020 and 16/10/2020 as predicted by the ARIMA (3,1,4) model, and the result demonstrates that the daily confirmed new cases trend in Africa as the continent may become stable or it will become stagnant (see Figure 2). However, this is the weighted average of the five African regions, and there is significant variation in the spread and trend of COVID-19 among African regions.

Unlike the cases in Africa, the prediction results of daily confirmed COVID-19 cases in East Africa as predicted by the ARIMA (3, 1, 3) model show an increasing trend in the spread of daily cases of the COVID-19 pandemic (as shown in Table 4 and Figure 3). Similarly, Table 5 presents the prediction results of daily confirmed COVID-19 cases in Northern Africa for the period between 17/9/2020 and 16/10/2020 as predicted by the ARIMA (1,1,4) model. The prediction results on the plot of actual confirmed cases with the prediction result of the ARIMA model depict the increasing trend and spread of COVID-19 in Northern Africa (see Figure 4).

The prediction results of daily confirmed new COVID-19 cases in West Africa and Central Africa for the next month were estimated by employing the ARIMA (2,1,3) and ARIMA (2,1,3) models, respectively (see Tables 6 and 7). The prediction results indicate a slight increasing trend in the number of daily new confirmed cases of COVID-19 in both West Africa and Central Africa (see Figures 5 and 6). On the other hand, Southern Africa is the most infected region in Africa, but Table 8 and Figure 7 confirmed that Southern Africa had an almost decreasing trend in the spread and daily confirmed cases of the COVID-19 pandemic, and this may be the major reason for the stable trend of the African daily confirmed COVID-19 cases as the increase in the daily confirmed cases in the North, East, West, and Central Africa is counterpoised by the decreasing trend of Southern Africa daily confirmed new COVID-19 cases.

Except for Southern Africa, all African regions show an increasing trend in the predicted daily COVID-19 new cases, and specifically East and Northern Africa show a significant increasing trend in the spread of the COVID-19 pandemic.

3.2.3. The Deviation between the Forecasted and Actual Data on the Spread of Covid-19

As shown in Table 9, the sign of the forecasted values of COVID-19 cases in Africa and the actual values of the daily confirmed COVID-19-19 cases are similar, which is a positive or an increasing trend. Both values portray the increasing trends of the COVID-19-19 pandemic. However, in all cases, the forecasted values are lower than the actual values, or the ARIMA model prediction results underestimate the spread of COVID-19-19 in Africa.

At the subregional level, there is a deviation between the actual and the ARIMA model forecasted values of the daily COVID-19 cases, and the deviation is both on the sign and the size of the actual and forecasted values of the daily COVID-19 cases. Tables 1014 present the deviation between the actual and forecasted values of the daily confirmed COVID-19 cases in East Africa, West Africa, Central Africa, North Africa, and Southern Africa for the period between 17/9/2020–16/10/2020, respectively.

However, we can tolerate the minor difference between the actual and the ARIMA model forecasted values of COVID-19 daily confirmed cases, given the uncertain nature of the current COVID-19 pandemic and the growing interconnected and complex world, which are ultimately demanding flexibility, robustness, and resilience to cope with unexpected future events and scenarios [27].

4. Conclusion

The pattern of daily confirmed new cases of the COVID-19 pandemic is forecasted by using the ARIMA model strategy to help the efforts of the World Health Organization, African Union, Africa Centers for Disease Control and Prevention (CDC Africa), African regional organizations (IGAD, ECOWAS, COMESA, ECCAS, SADC), and other stakeholders to control the further spread of coronavirus in Africa. The prediction of the number of infections would assist policymakers in a specific region to assess their current healthcare capacity and decide which measures need to be taken to curb and control the spread of COVID-19 [28].

Based on the available data, the study found that the best prediction models for forecasting the trend of daily new confirmed COVID-19 cases are ARIMA (3,1,4), ARIMA (3,1,3), ARIMA (2,1,3), ARIMA (2,1,3), ARIMA (1,1,4), and ARIMA (4,1,5). The prediction models are considered best for Africa as a continent, East Africa, West Africa, Central Africa, North Africa, and Southern Africa, respectively. By adopting this model, we were able to predict the daily confirmed new COVID-19 cases for the period between 17/9/2020 and 16/10/2020.

Given the uncertain nature of the current COVID-19 pandemic and the world’s growing interconnectedness and complexity, the proposed method’s experimental results match the actual COVID-19 daily case data to demonstrate the proposed model’s realistic nature. As a result, the ARIMA model’s forecasting will help in prioritizing and promoting regional cooperation in formulating policies to address the COVID-19 pandemic and employing effective strategies to control the virus by incorporating the experience of those regions that have seen a declining trend, such as Southern Africa.

Abbreviations

ARIMA:Autoregressive integrated moving average
AIC:Akaike information criteria
BIC:Bayesian information criterion
IGAD:Intergovernmental authority for development
ECOWAS:Economic community of West African states
COMESA:Common market for Eastern and Southern Africa
ECCAS:Economic community of Central African states
SADC:Southern African development community.

Data Availability

The daily confirmed COVID-19 cases were retrieved from the official website of our world in data. Retrieved from the: https://covid.ourworldindata.org/data/owid-covid-data.xlsx.

Conflicts of Interest

The authors declare that there are no conflicts of interest.

Authors’ Contributions

Both authors contributed equally to this paper.