Abstract
Forecasting the depth of groundwater in arid and semiarid areas is a great challenge because these areas are complex hydrogeological environments and the observational data are limited. To deal with this problem, the grey seasonal index model is proposed. The seasonal characteristics of time series were represented by indicators, and the grey model with fractional-order accumulation was employed to fit and forecast different periodic indicators and long-term trends, respectively. Then, the prediction results of the two were combined together to obtain the prediction results. To verify the model performance, the proposed model is applied to groundwater prediction in Yinchuan Plain. The results show that the fitting error of the proposed model is 2.08%, while for comparison, the fitting error of the grey model of data grouping and Holt–Winters model is 3.94% and 5%, respectively. In the same way, it is concluded that the fitting error of groundwater in Weining Plain by the proposed model is 2.26%. On the whole, the groundwater depth in Ningxia Plain including Yinchuan Plain and Weining Plain will increase further.
1. Introduction
1.1. Motivation
Groundwater is closely related to the development of human civilization. More than 2 billion people worldwide drink groundwater directly. This number is rising as rivers are polluted and depleted. The overuse of groundwater destroys the natural water cycle and causes problems such as ground collapse. China’s groundwater resources have been drastically reduced due to excessive exploitation of groundwater. At present, the accumulated deficit of groundwater deficit in North China is 180 billion cubic meters. In order to reduce the impact of groundwater depth decline, it is meaningful to achieve accurate groundwater prediction. Especially in dry areas, the reliance on groundwater is greater.
Ningxia Hui Autonomous Region (Ningxia) is located in Northwest China and is a typical dry area. Ningxia has the fewest water resources in China. The average annual runoff in Ningxia is 18.3 mm deep, only 1/15 of the national average, with annual rainfall from 150 mm to 600 mm. To solve the problem of farmland irrigation, the ancient people of Ningxia built water conservancy projects to divert the Yellow River. The water conservancy project is still in use today. In 2017, the ancient irrigation area in Ningxia was listed on the world irrigation engineering heritage list. Although irrigation projects have eased the drought on farmland, groundwater is still the main source of water resources in Ningxia. Moreover, with the continuous development of China’s western development strategy, Ningxia’s economy has developed rapidly, which has intensified the demand for water resources. According to Ningxia water resource bulletin, the groundwater consumption in Ningxia has been on the rise from 2014 to 2018. The content comes from Ningxia Hydrological Information Network.
In order to provide reference for groundwater management in Ningxia, the grey seasonal index model (GSIM (1,1)) was established to predict the groundwater depth of Ningxia Plain and compared with the results of Holt–Winters model (Holt–Winters) and grey model of data grouping (DGGM (1,1)).
This paper is organized as follows. The model is introduced in Section 2. Experimental results are given in Section 3. In Section 4, some conclusions and directions for future research are provided.
1.2. Literature Review
Under the background of water resource depletion, groundwater management has gradually developed into a discipline to be studied by scholars [1]. Accurate prediction of groundwater is important for groundwater management. There are many studies on prediction method [2]. The most direct way is through the surface vegetation to predict groundwater depth [3]. However, changes in groundwater are most affected by humans. Industrial water, agricultural water, and domestic water are considered as the main factors affecting groundwater [4]. Based on different influencing factors, stochastic time series and artificial neural network models are considered to have better effect in groundwater depth assessment [5]. Rivers are the main natural way to replenish groundwater and play a regulating role in the water cycle. A groundwater prediction model based river stage was built [6]. Based on groundwater dynamic data and related factors, the quantile regression method was used to predict groundwater depth [7]. Other factors affecting groundwater depth have also been investigated [8]. The development of science and technology provides convenience for the study of groundwater. The conceptual model, the Bayesian network, and artificial network models were applied to groundwater depth prediction [9]. Support vector machines have good performance in groundwater depth prediction [10]. The prediction effects of grey self-memory model, radial basis function network, and adaptive neuro fuzzy inference system models on groundwater were compared [11]. The mathematical models have their own characteristics, and different models are combined and applied to groundwater depth prediction [12]. Hybrid support vector machine regression and artificial neural network models were applied to groundwater depth prediction [13]. A hybrid model using entropy spectral analysis with the optimal input is proved to be effective in predicting groundwater depth [14]. The empirical mode decomposition (EMD) was combined with Elman neural network, and the coupling forecasting model was constructed to groundwater depth forecasting [15]. The EMD is also used in combination with phase space reconstruction, particle swarm optimization, and extreme learning machine [16].
Groundwater is a huge system. Due to the influence of geological conditions, hydrological environment, and other factors [17–20], the information about groundwater is ambiguous. Therefore, groundwater is a complex grey system. Moreover, the above model needs a lot of data to guarantee the accuracy of prediction [21]. Without a large number of training samples, the prediction accuracy of the model cannot reach the best [22]. The grey prediction model has high precision in small sample prediction [23, 24]. Grey models are widely used in air pollution, energy, electricity, biology, computer, water resources, and other fields [25–33]. The fractional-order grey prediction model improves the stability of the model from the perspective of fractional order [34, 35]. On this basis, the seasonal index is introduced into the fractional-order grey prediction model, and the processing ability of the model to nonlinear data is improved. Thus, PSO-GSIM (1,1) is used to predict the groundwater depth of Ningxia Plain.
2. Methodology
2.1. The Holt–Winters Model
The Holt–Winters model is a traditional statistical model and has been widely used since it was proposed by Winters in 1960. Scholars have conducted in-depth research on the model [36]. The Holt–Winters model is used to process data with seasonal characteristics. Time series data are divided into three parts, namely, residual data, trend data, and seasonal data. A prediction model is established based on the Holt–Winters model. There are two kinds of calculation methods for the Holt–Winters model, namely, addition and multiplication. The multiplication Holt–Winters model works better with solving the seasonal time series prediction problem. Therefore, the multiplication Holt–Winters model is used for comparison in this paper. The establishment process of the model is as follows.
For the time series , the multiplication Holt–Winters model iswhere is the seasonal length, such as four quarters, 12 months, and so on, is the seasonal index, and represents the tendency.where represents the average of same month in different years and is general average.
The Holt–Winters model has three parameters, and the application of the grey wolf optimization algorithm (GWO) improves the efficiency of multiparameter optimization. GWO is an optimization algorithm inspired by the hunting behavior of wolves. It is widely used for multiparameter model optimization. The behavior of a pack of wolves is represented digitally as follows:where represents the literation at this point, and are coefficient vectors, and represents the position vector of the prey, i.e. objective vector. is the position of one of the wolves in the pack. Coefficient vectors and are obtained by the following formula:where decreases linearly from 2 to 0 along with iteration and and are randomly generated from .
In GWO, there are three optimization parameters, , , and , corresponding to the three variables in the Holt–Winters model, respectively, where alpha is the dominant player in GWO. Suppose that the three wolves knew where their prey was; therefore, the first three best hunting sites were preserved. Other searcher is forced to update their locations on this basis. The other search location update formula is as follows:where is the position of other searcher, , , and are the search locations of alpha, beta, and delta, and is where the other searchers are in the next iteration. To speed up search efficiency, the maximum number of iterations is set to 30. The value range of parameters is . Detailed description and source codes of GWO can be found in the literature [37].
2.2. Grey Model of Data Grouping Method
The traditional grey prediction model is not applicable to the data of seasonal fluctuation. For the grey model to process the seasonal data, DGGM (1,1) model was proposed [38]. Two more steps are added to the traditional GM (1,1) model. The calculation process of DGGM (1,1) model is described below. Step 1. The time series data are divided into twelve monthly groups and are given by Step 2. GM (1,1) models are calculated based on grouped data separately. An accumulative series is obtained by accumulative calculation: where , and the mean series is The first-order differential equation of a single variable is used as the prediction model, namely, GM (1,1), and the grey differential equation is as follows: The corresponding whitening differential equation is where , and represent the time variable, grey development coefficient, and grey action quantity, respectively. Data matrix and data vector are given by The undetermined coefficient vector is , and the prediction equation is obtained: Step 3. Reductive treatment is conducted. Through a subtraction calculation, the predicted values of original series are obtained: Furthermore, the twelve groups of monthly predictions are combined into a new time series:
2.3. The GSIM (1,1) Model
The data varied substantially among different months, but the traditional grey prediction model is only suitable for time series with an exponential trend, and it is not capable of effectively predicting data with large fluctuations. Thus, in order to enhance the forecasting accuracy, the seasonal data are summarized into annual data in this study. The annual data are predicted by GM (1,1) with fractional-order accumulation (FGM (1,1)) [24]. The prediction results were calculated by seasonal index and annual data.
The time series data on the depth of groundwater are with a cycle of , for example, , and the modeling process for the GSIM (1,1) model is as follows. Step 1. The yearly data are , where . is the number of periods. Step 2. By using , the -order accumulation sequence is where . r is the fractional order, and the optimal value can be obtained through particle swarm optimization (PSO) algorithm. Step 3. The differential equation with one variable of the r-order accumulation sequence (i.e., the FGM (1,1) model) can be defined as where and represent grey development coefficient and grey action quantity, respectively. The solution to equation (14) is The parameters of the model are calculated by the least square method. The calculation process of , is shown in the following equation: where Step 4. and are entered in the time response function: and is the fitting value at time . Step 5. For , the restored sequence is where . Then, the restored value is Step 6. The seasonal index is calculated by using the average method: where Step 7. According to the calculation process in Step 2, the accumulation series is obtained. The accumulation of the seasonal index is no longer carried out in accordance with the time series, but data of the same month are combined into a group for the accumulation. The purpose of this is to reduce the dimension of the data so as to make full use of the grey model to process the small data. At the same time, it is also preparation for the prediction of the next cycle. The next calculation refers to Steps 3–5, and the forecasting value is obtained: Step 8. The fitting value of the groundwater depth is obtained. For , the solution formula is where and are the quotient and remainder of , respectively. Step 9. The mean absolute percentage error (MAPE) is used to test model accuracy, and the calculation process is as follows:
In the paper, PSO is applied to obtain the optimal value of the parameter of the GSIM (1,1) model. PSO has been widely used since it was proposed [39]. Compared with other optimization algorithms, PSO is simpler and easier to implement without too much parameter adjustment. To increase the search effect, the position and velocity of the particle need to be constantly updated. The updating process depends on the values Pbest and gbest, where Pbest represents the current optimal solution and gbest represents the current optimal solution for any particle. The updating process is shown in the following equations:where and represent the velocity and position of the particle at i, respectively, represents the inertia weight value which is generally set as 0.8 [40], and is a random function; the optimal value of r of most experiments is generally within the range of [0, 1]; to ensure the accuracy of the experiment, the value range of GSIM (1,1) parameter r is expanded to [0, 2.5]; therefore, represents the random number generated from [0, 2.5]. and are acceleration constants with equal values and set to 2 [41]. Suppose the number of individuals in the initial population is , the maximum number of iterations is 200. More detailed description of PSO can be found in literature [42].
3. Experimental Results
3.1. Study Area and Data
Ningxia spans 35°14′–39°14′N and 104°17′–109°39′E and includes five cities. The climatic types of Ningxia are temperate, continental, arid, and semiarid. Ningxia Plain is considered as the largest plain in Ningxia. The study region has an area of 66.400 km2 and is bordered by the Helan Mountains and Liupan Mountains to the northwestward and south, respectively. Ningxia is 1100–1200 meters above sea level and the terrain slopes from southwest to northeast. The Yellow River enters from the southwest of Ningxia and leaves from the northeast. Due to the different physiognomy types in northcentral Ningxia and south Ningxia, Ningxia Plain can be divided into Yinchuan Plain and Weining Plain (Figure 1). The recoverable groundwater accounts for 88% of the total groundwater in Ningxia. Ningxia Plain is surrounded by mountains, water flows to the plain, and the water table changes slowly and steadily, creating a predictable gradient of water [43].

The data are from the official website of the Ministry of Water Resources of the People’s Republic of China (https://www.mwr.gov.cn/sj/tjgb/dxsdtyb/). There are a total of 47 groundwater monitoring wells in Ningxia Plain. Yinchuan Plain contains three cities and 28 groundwater monitoring wells. Weining Plain contains one city and 11 groundwater monitoring wells (by the end of the 2019). The groundwater depth data for Yinchuan Plain and Weining Plain are the average result of multiple groundwater monitoring wells.
3.2. Forecasting the Groundwater Depth of Yinchuan Plain
The groundwater depth of Yinchuan Plain is predicted in this section. The depth of groundwater is one of the contents of hydrology. Groundwater is subject to changes in rainfall or in rivers. As we all know, changes in rainfall and rivers are cyclical and seasonal. In recent years, the seasonal variation of groundwater depth is more obvious under the influence of human production activities. For example, the original groundwater depth data of Yinchuan Plain are summarized in Figure 2. Figure 2 shows that the groundwater depth changes with time, and groundwater depth fluctuations are similar at the same time in different years. Therefore, the seasonal model is used to study the variation of groundwater depth.

3.2.1. The Calculation Results of the Holt–Winters Model
The groundwater depth data of Yinchuan Plain from 2014 to 2018 were fitted to three models. The models do not fit the data of the first period, so the data of the first period are omitted. The fitting results of the Holt–Winters model are listed in Table 1. The calculation process is completed by MATLAB 2016b. The GWO was used to find optimal parameters , and , and the optimization result of parameters is [0.51639, 0.05160, 0.00752]. The fitting error (MAPE) of the Holt–Winters model is 5.00%. It can be seen that the fitting accuracy of the traditional model is lower than 10%, and the fitting results are accepted. As the first cycle, the data of 2014 were omitted.
3.2.2. The Calculation Results of the DGGM (1,1) Model
The DGGM (1,1) model was applied to the data calculation on a monthly basis. The data grouping method is adopted to improve the fitting accuracy of the traditional GM (1,1) model, and the model can be applied to process seasonal fluctuation data. The fitting results of DGGM (1,1) model are listed in Table 2. MAPE is 3.94%. Compared with the Holt–Winters model, the new model has higher fitting accuracy. Moreover, there are no fitting results for the first period of the two models, increasing contrast between models.
The parameters generated by the DGGM (1,1) model in the fitting process are shown in Table 3. The time response function of each set of parameters can be obtained from equation (16).
3.2.3. The Calculation Results of the GSIM (1,1) Model
The seasonal index in this paper refers to the index calculated on a monthly basis. The seasonal index reflects the fluctuation of the data in the time series. Most models do not deal well with fluctuating data. For the purpose of stabilizing the sequence, scholars used seasonal factors to eliminate seasonal changes in time series. This method improves the ability of the model to process the fluctuation data. However, the time series that eliminate seasonal effects may lose a lot of information. Its accuracy is also difficult to improve. To reduce information loss in the sequence, the seasonal index was extracted and studied separately. The results of yearly data are listed in Table 4. The MAPE is 0.38%, and the fitting accuracy is high.
Then, the seasonal index of monthly data is fitted. During the fitting process, the GSIM (1,1) model is used sometimes, so the multiple sets of parameters are generated. Each set of parameters in Table 5 can be substituted into equation (20) to obtain the corresponding time response function.
The fitting values of the seasonal indices are listed in Table 6. The average MAPE from 2015 to 2018 is 2.01%. Groundwater is consumed more in February, March, April, May, October, and November. Affected by dry and wet seasons, the groundwater depth varies seasonally throughout the year. The GSIM (1,1) model can not only predict accurately but also quantify the periodic variation of data within the time span. With the allowable error, it is beneficial to understand the variation law of groundwater depth.
The groundwater depth fitting results from 2014 to 2018 are shown in Table 7. The MAPE of fitting results is 2.08%. By calculating the seasonal index and annual fitting value of groundwater depth, the monthly fitting value is obtained. As the trend parameter, the annual fitting value controls the change direction of groundwater. The seasonal index breaks down the units of time from years into months.
In the calculation process of the above three models, the GSIM (1,1) model extracts the most information, and the seasonal variation of the data is shown in the prediction process. The prediction accuracy of the GSIM (1,1) model is also higher than that of the GWO-Holt–Winters model and the DGGM (1,1) model. The MAPE of the GSIM (1,1) model is 2.08%, the MAPE of the Holt–Winters and the DGGM (1,1) models is 5% and 3.94%, respectively (Table 8). This shows that the GSIM (1,1) model is more practical for seasonal fluctuation data.
The fitting process of the three models is visualized in Figure 3. The consumption of groundwater in Ningxia keeps increasing, and seasonal fluctuation trend is obvious. The fitting line of the GSIM (1,1) model is close to the original data line. The three models did not fit the data of the first period. It can be seen that the original calculation of the three models has the same characteristics, and the three models are comparable.

3.2.4. Prediction of Groundwater Depth in Yinchuan Plain
In the fitting results of groundwater depth in Yinchuan Plain, the GSIM (1,1) model has the smallest error. Therefore, the GSIM (1,1) model is selected to predict the groundwater depth of Yinchuan Plain from 2019 to 2020 (Table 9). From the prediction results, groundwater consumption in Yinchuan Plain will increase further. The seasonal law of groundwater depth remains unchanged. Ningxia is inland in Northwest China. The inland areas are short of water. Yinchuan Plain is the most economically dynamic area in Ningxia. Continued economic growth has increased water consumption. In addition, the tradition farmland irrigation model consumes most of the water resources. Strengthening water resources management is crucial to the economic development and agricultural production of Yinchuan Plain.
The GSIM (1,1) model is the best among three models in Yinchuan Plain. This model can also predict the groundwater depth of Weining Plain. These results in Weining Plain and Yinchuan Plain help us to know the groundwater situation in the whole Ningxia Plain. The prediction of groundwater depth in Weining Plain is described in the next section.
3.3. Forecasting the Groundwater Depth of Weining Plain
Yinchuan, the capital of Ningxia, is located on the Yinchuan Plain. Zhongwei city is located on the Weining Plain. The Yellow River enters Ningxia from Zhongwei city. From a geographical point of view, Weining Plain has more water resources than Yinchuan Plain. The Yellow River flows from Weining Plain to Yinchuan Plain. Weining Plain needs to coordinate the local water resources to support the water shortage of Yinchuan Plain. Therefore, the accurate prediction of groundwater in Weining Plain is conductive not only to the regulation of local water resources but also to the diversion of water to Yinchuan Plain.
The annual groundwater depth in Weining Plain changes slowly. The annual data of groundwater depth were fitted by the FGM (1,1) model. The fitting results are listed in Table 10, and the MAPE of fitting value is 0.10% and has high precision. Since 2016, the groundwater depth of Weining Plain has gradually increased. It shows that water consumption in Weining Plain has increased in recent years.
The variation law of groundwater consumption in Weining Plain is known by seasonal index. The parameters generated by the GSIM (1,1) model during the fitting process are listed in Table 11. The fitting results of the seasonal index are listed in Table 12, and the MAPE of fitting value is 2.23%. The groundwater consumption in Weining Plain is large from January to May and November to December. The seasonal index in December showed a downward trend. It indicates that the groundwater consumption in December was reduced. But the seasonal index in October was above 1 several times. Changes in groundwater consumption are also dynamic, and it is important to predict the next year based on the index of same season.
The MAPE of the GSIM (1,1) model for the groundwater depth of Weining Plain is marked in Figure 4. The error is obtained by averaging the monthly data fitting error of a year. The original value of groundwater depth in Weining Plain is represented by the line in Figure 4. The seasonal variation of groundwater depth is obvious. The largest error occurred in 2016, which is 5.6%. The overall level of error is low.

From the fitting value of the year and the seasonal index, the fitting value of the month from 2014 to 2018 is calculated (Table 13). The prediction section in Table 13 is visualized in Figure 5. It can be seen that the predicted value has a small increase compared with the original data. The groundwater depth in 2019 to 2020 will still fluctuate seasonally. The groundwater consumption in the first half is more than the second half. Government departments should focus on water management in the first half of the year.

The groundwater depth of Yinchuan Plain and Weining Plain is calculated and analyzed. It shows that the groundwater depth in Ningxia Plain is increasing year by year. Maintaining groundwater stability is important in dry areas. Water resources are coordinated between the two plains by the Yellow River. Weining Plain in the upper reaches of the Yellow River can allocate more water resources to Yinchuan Plain. The concept of coordinated development is also applied to water resources management.
4. Conclusions
In this paper, the GSIM (1,1) model, the Holt–Winters model, and the DGGM (1,1) model were established and applied to groundwater depth prediction in Ningxia Plain. By comparison, the GSIM (1,1) model is proved to be an effective one, and its prediction accuracy is higher than that of other models. The following conclusions were obtained.
The GSIM (1,1) model is more suitable for dealing with seasonal fluctuation data. In the process of calculation, the seasonal variation of data is presented. The GSIM (1,1) model provides a new choice for studying the data with the characteristics of seasonal variation.
GSIM (1,1) retains the characteristics of the traditional grey model for processing small sample data and has high prediction accuracy. Seasonal variations in the data were quantified and used for the first time to predict research. Data for the same month for different years are used as the object of the GSIM (1,1) model, and the processing method of traditional series data is changed.
In the paper, the GSIM (1,1) model parameter r is reserved for two decimal places. During the experiment, we found that the more the bits reserved for parameter r, the higher the accuracy of the model. Scholars can adjust parameter r according to the experimental needs to meet higher accuracy requirements.
Data Availability
The data used to support the findings of this study are included within the article.
Conflicts of Interest
The authors declare that they have no conflicts of interest.
Acknowledgments
This study was supported by the National Natural Science Foundation of China (U20A20316).