Abstract

Carbon price fluctuation is affected by both internal market mechanisms and the heterogeneous environment. Moreover, it is a complex dynamic evolution process. This paper focuses on carbon price fluctuation trend prediction. In order to promote the accuracy of the forecasting model, this paper proposes the idea of integrating network topology information into carbon price data; that is, carbon price data are mapped into a complex network through a visibility graph algorithm, and the network topology information is extracted. The extracted network topology structure information is used to reconstruct the data, which are used to train the model parameters, thus improving the prediction accuracy of the model. Five prediction models are selected as the benchmark model, and the price data of the EU and seven pilot carbon markets in China from June 19, 2014, to October 9, 2020, are chosen as the sample for empirical analysis. The research finds that the integration of network topology information can significantly improve the price trend prediction of the five benchmark models for the EU carbon market. However, there are great differences in the accuracy improvement effects of China’s seven pilot carbon market price forecasts. Moreover, the forecasting accuracy of the four carbon markets (i.e., Guangdong, Chongqing, Tianjin, and Shenzhen) has improved slightly, but the prediction accuracy of the carbon price trend in Beijing, Shanghai, and Hubei has not improved. We analyze the reasons leading to this result and offer suggestions to improve China’s pilot carbon market.

1. Introduction

At present, global climate change and dioxide emission reduction become hot topics of global concern. Effectively reducing gas emissions and curbing the trend of worldwide warming have become common challenges for all countries within the world. The carbon emission commerce market originates from the theory of environmental property rights and the theory of ecological modernization, which advocate for market means to solve environmental problems. It is a good policy tool to regulate and reduce greenhouse gas emissions by a market mechanism and an important institutional innovation to push green and low-carbon transformation of economic development mode. Since the Kyoto Protocol came into force, the international carbon market has seen rapid growth. By the end of 2018, there have been 20 carbon markets operating around the world, covering 8% of the world’s gas emissions, and the countries and regions where the carbon markets are located account for 37% of the global economy [1]. As the world’s largest emitter and the country with the greatest potential to reduce emissions, China has a path of emission reduction that has drawn close attention from the international community. In its own unique way, it has demonstrated its determination and actions to actively tackle climate change. At the Paris Climate Change Conference 2015, the Chinese government made a solemn commitment to reduce carbon dioxide intensity per unit of GDP by 60–65% by 2030 from the 2005 level, hit peak carbon emissions around 2030, and raise the share of nonfossil energy in primary energy consumption to 20%. In 2020, at the General Debate of the 75th Session of the United Nations General Assembly, President Xi Jinping said that the country should strive to achieve carbon neutrality by 2060, which means that China must reduce its carbon emissions from 16 billion tons per year to almost no emissions during the 40-year period from 2020 to 2060. At present, China is steadily promoting the construction of a national carbon emission trading market. As of November 15, 2019, the cumulative spot trading volume of the carbon market in the pilot areas had reached 364 million tons, and the trading volume had reached 7.985 billion yuan. The total carbon emission and intensity within the pilot range had both reduced, and the low-carbon awareness of enterprises in the pilot market had been continuously improving [2].

The aim of the carbon market is to guide enterprises to create emission-reduction selections through the mechanism of supply and demand to form an effective carbon commerce price. Carbon price fluctuations are influenced not only by internal market mechanisms but also by heterogeneous environments [3]. Carbon price fluctuation is a complex dynamic evolution process. Accurate prediction of carbon price is very important for grasping the fluctuation law of carbon price and avoiding investment risk [4]. Existing carbon price or energy price forecasting models can be roughly divided into three categories. The first is the forecasting model using econometric methods. This kind of model can be further divided into two types. The first is the structural model, such as the error correction model (ECM), vector error correction model (VECM), and vector autoregressive (VAR) model. Such models process data by means of linear regression, which not only includes historical carbon price data but also can include corresponding explanatory variables [39]. The second is the time series prediction models, including the autoregressive integrated moving average (ARIMA) model, generalized autoregressive conditional heteroskedasticity (GARCH) model, and their extended models. Such models take no account of other factors and rely only on historical price for forecasts [1013]. Forecasting models based on econometrics have the advantage of capturing the time-varying fluctuation characteristics of carbon price, but they cannot accurately describe the nonlinear characteristics in the process of carbon price fluctuation. The second category is the prediction model based on machine learning, such as the neural network, random forest, support vector machine, and neurofuzzy control model [1418]. Compared with the first category model, the second category model has a stronger ability to deal with nonlinear data. However, this kind of model often contains a large number of parameters, and it is prone to defects such as overfitting or poor convergence in the process of use. The third category is a combination prediction model based on multiple methods, such as an integrated model of group method of data handling (GMDH), particle swarm optimization (PSO), and least squares support vector machines (LSSVM), that is, GMDH-PSO-LSSVM [19], the integrated model of empirical mode decomposition (EMD), particle swarm optimization (PSO), and support vector machines (SVM), that is, EMD-PSO-SVM [20], the hybrid ARIMA and LSSVM methodology [21], the hybrid approach with exogenous variables [22], the combination of the model based on phase space reconstruction (PSR) and the least squares support vector regression (LSSVR) model [23], the multiscale nonlinear ensemble leaning paradigm [24], the variational mode decomposition and optimal combined model [25], the model based on secondary decomposition algorithm and optimized back propagation neural network [26], the particle swarm optimization (PSO) and radial basis function (RBF) algorithm model [27], the prediction model based on extremum point symmetric mode decomposition, the extreme learning machine and Grey Wolf optimization algorithm [28], and the hybrid method based on empirical wavelet transform (EWT) and Gated Recursive Unit Neural Network (GRU) [29]. This kind of model effectively integrates and utilizes the advantages of the single prediction model, and its prediction accuracy is significantly higher than that of the single prediction model. However, the structure of this kind of model is more complex than the single prediction model, and a large number of model parameters need to be determined, which brings a lot of inconvenience to the practical application. Some typical prediction models are shown in Table 1.

The carbon market has its own unique market characteristics, such as great differences in the price and policy mechanism between the EU and China’s pilot carbon markets. The factors affecting the carbon market are numerous, and the structures are complex. Besides, within the background of the current carbon market link, various uncertainties inside and outside the market have become important driving factors affecting the price trend of the carbon market. The price mechanism and fundamental characteristics of the carbon market have changed, and the traditional research framework cannot explain the driving mechanism and characteristics of carbon price fluctuation under uncertain conditions. The existing carbon price prediction models have not fully explored the historical carbon price fluctuation information or the influence of financial high-frequency data, network information, and market uncertainties on carbon price fluctuation. There are still several issues worth discussing within the field of carbon price prediction. Recently, the technology of building complex networks for data has attracted extensive attention from scholars at home and abroad, and a variety of methods to map nonlinear data into complex networks have been proposed [3033]. In the field of energy economy research, complex network construction technology was used to construct an energy price network based on the energy price data. The fluctuation characteristics of price were studied, and the fluctuation mechanism of price and the dynamic characteristics of the energy price network were revealed [34, 35]. Based on the data of energy import and export trade, the energy trade network was constructed to reveal the interaction pattern and the evolution characteristics of trade relations between energy import and export countries [36]. Based on the data between the energy market, carbon market, and financial market, the information transmission network among multiple markets was constructed, and the volatility spillover effect of the energy market was studied [37]. The above research used the network science to study the energy economic system and to obtain many valuable results. The rise and development of complex network science give an alternative perspective and methodology for energy economic data mining and carbon price trend prediction.

In fact, higher prediction accuracy depends on the suitability of the prediction model and the quality of the data used to train the model. Therefore, reconstructing the collected data and using the reconstructed data to train the prediction model are effective means to improve the prediction accuracy. Therefore, this paper puts forward the idea of integrating network topology information into carbon price data. Compared with previous research works, the contribution and innovation of this paper lie in the following aspects: (1) With respect to the research objects, previous studies mostly focused on the real value of carbon price, while this paper focuses on the fluctuation trend, because in many cases, such as policy-making and scenario analysis, it is enough to know only the information about the rise and fall trend of carbon price. Therefore, it is of great practical significance to predict the rise and fall trend of carbon price. (2) From the perspective of improving the prediction algorithm of carbon price fluctuation trend, a new prediction paradigm is proposed based on complex network and the classical discriminant analysis algorithm. This paper proposes the idea of integrating network topology information into carbon price data. The classical discriminant analysis algorithm is trained with the data set, and then the carbon price trend prediction model is constructed. The degree of improvement of the prediction accuracy of the benchmark model after incorporating the topological structure information of the carbon price network is quantitatively analyzed, which was not found in previous studies. (3) In terms of sample data selection, previous studies mostly analyzed the price data of a specific carbon market, but this paper selects the price data of eight carbon markets for comparative analysis and discusses the effect of different carbon market price data on the prediction accuracy.

The structure of the rest of this paper is as follows. The second part details the methods used in this paper. The third part describes how we build the carbon price visibility graph network and analyze the network structure. In the fourth part, we make a comparative analysis of the predicted results. In the fifth part, we provide the conclusion.

2. Methodology

2.1. Visibility Graph Algorithm

A visibility graph is a method to map time series into a complex network. As proposed by Lacasa et al., the basic idea is to treat each data point of time series as a network node and to set up an edge between nodes if the “visual condition” is satisfied. The specific mathematical description is as follows: Let be a time series containing N data, and there is an edge between node and node if and only if the following visual criteria are met:

According to the visibility graph algorithm, a time series containing N data can be mapped into a network containing N nodes. The principle of a visibility graph is simple, and the visibility graph network obtained has connectivity properties such as undirected affine invariance and limited information loss. It can also effectively distinguish a random sequence from a chaotic sequence. The complexity of the visibility graph algorithm is ; thus, it needs a lot of computation time in practical application, which restricts its application in practice. To reduce the time complexity of the algorithm, Luque et al. (2010) modified the visual criterion as follows [38]:

This algorithm is called the horizontal visibility graph algorithm (HVG). This algorithm not only maintains the related properties of the visibility graph algorithm but also carries out theoretical analysis on specific time series. The complexity of the algorithm is , which greatly reduces the computational complexity compared with the visibility graph algorithm and has a higher application value.

2.2. Network Topology Index

Many concepts and methods have been proposed to characterize the statistical characteristics of complex network structures, such as degree distribution, average path length, and clustering coefficient. In the following, we only introduce the topological indicators used in the reconstruction of data.

2.2.1. Clustering Coefficient

The clustering coefficient is a coefficient that measures the degree of the cluster of nodes in the network [39]. The local clustering coefficient of node is expressed aswhere represents the number of connecting edges between nodes in the neighborhood of node , is the degree of node , and .

2.2.2. Network Assortativity Coefficient

In the nonzero graph , and represent the degree of two nodes on some edge , and is the number of edges in graph . Then, the assortativity coefficient of the network iswhere the assortativity coefficient is a degree-based Pearson correlation coefficient. When is positive, nodes with a high degree of connection tend to other nodes with a high degree of connection. When is negative, nodes with a high degree tend to nodes with a low degree of connection [40].

2.3. Classification Algorithm

Classification is an important research area in data mining, machine learning, and pattern recognition. By analyzing the data of training samples of known categories, one can find classification rules and predict categories of new data. The classification algorithms involved in this paper include linear discriminant analysis (LDA), naive Bayes (NB), K-nearest neighbor (KNN), random forest (RF), and support vector machine (SVM).

2.3.1. LDA

LDA is a dimension-reduction technology with supervised learning. Its idea is to project data that are not easy to classify in a certain direction so that intraclass variance can be minimized and interclass variance can be maximized after the projection. That is, after the projection, the projection points of each kind of data are expected to be as close as possible, while the distance between the data category centers of different categories is as large as possible [41].

Let be the number of data samples, be the number of classes, be the number of class samples, and be the sample of class . Then, the linear discriminant analysis can be expressed by the following optimization function:where V is the projection matrix, is the divergence matrix between classes, and is the divergence matrix within the following class:where .

2.3.2. Naive Bayes (NB) Model

Although LDA is simple and intuitive, it does not consider that the observation value of each category is different and the opportunity of each category is different, nor does it take into account the difference of losses caused by misjudgment. Naive Bayes discrimination can overcome the above shortcomings, and its calculation process is as follows. The possible estimate of each sample that may belong to a certain population (category) is called the “prior probability,” denoted as . The value of the prior probability can be obtained from experience or can be estimated using the percentage of each group of samples in the total sample [42]. The score of each sample can be calculated according to the discriminant function, and the conditional probability of the discriminant score under the condition that it belongs to the category is . The probability of the sample being judged as a certain category according to the discriminant function is called the postprobability. According to the Bayesian formula, the posterior probability can be calculated as

Each sample is classified according to the posterior probability of being assigned to a category.

In summary, the basic idea of Bayesian discrimination is as follows. For each sample, the score of the discriminant function is first calculated. Then, according to the prior probability, , and the conditional probability, , of the discriminant score, , the posterior probability of each category is calculated for the sample to be judged. The category in which the posterior probability is the largest is judged, and the sample is judged as the category.

2.3.3. K-Nearest Neighbor (KNN)

The basic idea of KNN is as follows. If most of the k-nearest neighbors or most similar samples of a sample belong to a category, then the sample also belongs to that category. In this method, the nearest neighbor samples selected by a sample are all samples that have been correctly classified. This method of discrimination or classification is one of the simplest methods in data mining classification technology [43].

RF is an advanced algorithm of the decision tree algorithm. Its advantage is that it does not produce overfitting. Its theoretical basis is the law of large numbers, which is simply described as follows. Let be the sample vector, be the label vector of correct classification, and RF consist of a series of single plant classifiers , where is an independent identically distributed random variable. Then, the marginal function is defined aswhere is the indicator function and is the average value function. The larger the marginal function, the higher the confidence of classification.

2.3.4. Support Vector Machine (SVM)

SVM is a linear classifier with the largest interval in the feature space. Its learning strategy is to maximize the interval, and it is ultimately transformed into a convex quadratic programming problem to solve [44]. Based on the existing training set,where , is the input space, each point in the input space is composed of attribute features, and .

Find real valued function on , so that the classification function can be used to infer the corresponding value of for any pattern . The mathematical structure expression of vector machine optimization problem is as follows:

The local duality theorem is needed to solve the optimization model shown in (10). The solution process is as follows. First, (10) is expressed as follows:

If is the feature mapping, (10) can also be expressed as

If soft separation is considered and the relaxation variable is introduced, the mathematical structure of the (10) support vector machine optimization problem can be modified as

2.4. Prediction Algorithm Integrating Network Topology Information

The carbon price trend prediction model integrating network topology information proposed in this paper includes three modules:

Step 1. Construct data sets. Let be carbon price series; then the volatility series can be calculated bywhere , . The following data matrix can be got from the volatility series,where is the length of the sliding window, is the step, and is the number of the sliding windows. The independent variable data set is , and the dependent variable data set is

Step 2. Construct the visibility graph network and extract carbon price volatility information. We transform each row of the independent variable data set into a complex network using a visibility graph algorithm and get visibility graph networks, denoted as . The mean clustering coefficientand the network assortativity coefficientcan be obtained by using (3 and 4). Then, the new independent variable data set is

Step 3. Divide training data and test data for prediction. The training data and test data are divided into independent variable data sets , and corresponding label sets are obtained from data set .
The algorithm flowchart is shown in Figure 1.

2.5. Precision Index

The carbon price trend prediction problem can be transformed into a binary classification problem, and based on the prediction result, we can get the following confusion matrix:

Predictive valueReal valuePositiveNegative
PositiveTP (true positive)FN (false negative)
NegativeFP (false positive)TN (true negative)
2.5.1. Accuracy

The percentage of the total number of correctly predicted samples in the total number of samples, abbreviated as AC, is calculated by the following formula:

2.5.2. Precision

This index is an evaluation index for the prediction results. Among the results predicted by the model as positive samples, the percentage of the truly positive samples, abbreviated as PR, is calculated by the following formula:

2.5.3. Recall

This index is an evaluation index for the original sample. Among the actual positive samples, the percentage that is predicted to be positive samples is abbreviated as RE. The calculation formula is as follows:

2.5.4. F1 Score

This index is the harmonic mean of PR and RE, taking into account the advantages of both. The calculation formula is as follows:

3. Data Selection and Network Feature Analysis

The carbon price data of the EU carbon market (EUETS) and seven pilot carbon markets of China from June 19, 2014, to October 9, 2020, that is, Beijing (BJ), Guangdong (GD), Chongqing (CQ), Shanghai (SH), Tianjin (TJ), Shenzhen (SZ), and Hubei (HB), are selected as sample data. For the convenience of comparative analysis, we select the time points from June 19, 2014, to October 9, 2020, when data are available in all the eight carbon markets. Therefore, 1525 data samples are selected for each carbon market. The selected sample data of carbon price are shown in Figure 2(a), the statistical image of the sample is shown in Figure 2(b), the kernel density distribution image of the sample is shown in Figure 2(c), and the statistical relationship diagram among the eight sample data of carbon market price is shown in Figure 2(d). The calculation results of descriptive statistics of selected sample data are shown in Table 2 (upper part).

As Figures 2(a) and 2(b) show, the carbon price in China’s pilot carbon market is low. The average carbon price in the EU carbon market is 12.8107 EUR/t, and the highest carbon price is 30.44 EUR/t. However, for China’s pilot carbon market, Beijing’s carbon market has the highest average carbon price, at 59.5397 CNY/t, while TJ carbon market has the lowest average carbon price, at only 16.8758 CNY/t. According to the coefficient of variation, shown in Table 2, the coefficient of variation of the EU carbon market is greater than the average of China’s pilot carbon market, indicating that the fluctuation range of the carbon price in the EU carbon market is higher than the average fluctuation range of China’s pilot carbon markets. According to the measurement of the distribution symmetry of the data in Figures 2(c) and 2(d), the EU carbon price data present a right-skewed distribution, indicating that, for the EU carbon market price, the distribution of the carbon price above the mean is more dispersed. Except for Shanghai, China’s six pilot carbon markets present a right-skewed distribution. From the perspective of flat data distribution, the EU, Beijing, Chongqing, Shanghai, Shenzhen, and Tianjin carbon markets show flat distribution, while the Guangdong and Hubei carbon market price data show a sharp distribution. As the correlation calculated, there is a significant positive correlation between the carbon price in the EU carbon market and that in Beijing, Hubei, and Shanghai, with Pearson correlation coefficients of 0.8025, 0.7175, and 0.5535, respectively. Moreover, there is a negative correlation between the carbon price in the EU carbon market and that in Shenzhen, with a correlation coefficient of −0.4862.

The eight carbon market price data samples are mapped into a complex network using the visibility graph algorithm, and the corresponding network structure and degree distribution are obtained, as shown in Figure 3. The corresponding network topology index calculation results are shown in Table 2 (the bottom part).

As shown in Figure 3 and Table 2, from the perspective of network topology, the price networks of the EU carbon market and the seven China’s pilot carbon markets both have a large degree and high clustering coefficient, and the node degree distribution follows power law distribution. However, the power index is different. The power law index of the EU carbon market is obviously higher than the average value of China’s carbon market, indicating that the heterogeneity of the EU carbon market price network is higher than that of China’s carbon market price network. From the perspective of network assortativity, the EU carbon market price network assortativity coefficient is greater than 0, indicating that the network is in assortative, while China’s seven carbon markets are on the contrary, indicating that the price network is an assortativity network. Thus, the EU carbon market has some similarities with China’s seven pilot carbon markets in terms of statistical characteristics and network topology, but there are significant differences in specific indicators such as the assortativity coefficient. These differences are mainly caused by differences in policies, systems, distribution patterns, and ranges of industries covered by each carbon market.

4. Model Construction and the Results Analysis

4.1. Data Reconstruction

We first group the carbon price data of each carbon market at N = 1525 time points. We set the length of the sliding window as L and the sliding step size as 1. Then, the price data of 1✕N dimension of each carbon market are transformed into the price data of (N-L+1)✕L dimension, the price data of the first L-1 day of each carbon market are taken as the independent variable data set X, and the price of the Lth day is taken as the dependent variable Y. In this case, the price of the Lth day can be predicted by using the data of the previous L-1 days. To integrate network topology information into independent variables, we use the visibility graph algorithm to convert the price data of the first L-1 days of each carbon market into a complex network, calculate the average clustering coefficient and assortativity coefficient of each network, and add the calculated results to the independent variable data set X. Then, the dimension of the newly constructed data set X is (N-L+1) ✕ (L+1), and we use this data set to predict the price trend of the carbon market. To test the robustness of the model, we randomly divide the reconstructed data set into a training set and test set. The sample ratio of the training set and test set is set at 9 : 1. To illustrate the overall prediction effect of the model, we make several predictions on the test set to calculate the average value of the AC, PR, RE, and F1 score. The numerical simulation software used in this work is Python.

4.2. Analysis of the Forecast Results of the EU Carbon Market

According to the carbon price data of the EU ETS, we use the model to predict the fluctuation trend. We randomly divide the training set and the test set, train the model parameters on the training set, make predictions on the test set by using the trained model, and repeat calculations of 500, 1000, …, 20,000 times. The evolution image of the prediction accuracy index (AC and F1), along with the calculation times, is obtained, as shown in Figure 4.

As Figures 4(a)4(e) show, the prediction accuracy of the five benchmark prediction models, LDA, KNN, NB, RF, and SVM, and the corresponding CN-LDA, CN-KNN, CN-NB, CN-RF, and CN-SVM, all tend to be stable with the increase in iteration times. The prediction models incorporating the network topological information are superior to the corresponding benchmark models in terms of prediction accuracy, precision, recall, and F1 value.

Next, we calculate the average prediction accuracy of various models under different iterations, and the results are shown in Table 3.

We analyze the accuracy and F1 value of the model prediction. From Figure 4(f) and Table 3, we find that the average prediction accuracies of the five benchmark prediction models are 0.53822, 0.49710, 0.53325, 0.52835, and 0.52342, and the variation coefficients are 0.00092, 0.00118, 0.00087, 0.00092, and 0.00073. Moreover, the average prediction accuracies of CN-LDA, CN-KNN, CN-NB, CN-RF, and CN-SVM are 0.54253, 0.52877, 0.53618, 0.52952, and 0.52905, respectively, and the coefficients of variation were 0.00089, 0.00102, 0.00068, 0.00089, and 0.00072, respectively. The average predicted F1 values of the five benchmark prediction models are 0.60250, 0.53035, 0.55835, 0.58243, and 0.68631, and the variation coefficients are 0.00080, 0.00091, 0.00104, 0.00094, and 0.00049. Furthermore, the average predicted F1 values of the corresponding CN-LDA, CN-KNN, CN-NB, CN-RF, and CN-SVM prediction models are 0.61668, 0.56807, 0.57451, 0.58434, and 0.69200, respectively. The coefficients of variation are 0.00077, 0.00088, 0.00089, 0.00092, and 0.00046. Thus, the prediction accuracy of the CN-LDA model is improved by 0.79952%, and the F1 value is improved by 2.35392% compared with the LDA model. Compared with the KNN model, the prediction accuracy of the CN-KNN model is increased by 6.37085%, and the F1 value is increased by 7.11328%. Compared with the NB model, the prediction accuracy of the CN-NB model is increased by 0.54876%, and the F1 value is increased by 2.89425%. The prediction accuracy of the CN-RF model is 0.22106% higher than that of the RF model, and the F1 value is 0.32818% higher than that of the RF model. Moreover, compared with the SVM model, the prediction accuracy of the CN-SVM model is increased by 1.07535%, and the F1 value is increased by 0.82907%. Furthermore, from the results of the coefficient of variation calculation, it can be seen that the stability of the prediction results of the CN-LDA, CN-KNN, CN-NB, CN-RF, and CN-SVM prediction models is better than that of the corresponding benchmark models.

In conclusion, for the prediction of the EU carbon market price trend, the prediction model incorporating the network topology information can effectively improve the prediction effect of the benchmark model. Next, we compare the prediction effects of the CN-LDA, CN-KNN, CN-NB, CN-RF, and CN-SVM prediction models. From the perspective of prediction accuracy, the order of improvement of prediction accuracy from high to low after integrating network topology information is as follows: CN-KNN > CN-SVM > CN-LDA > CN-NB > CN-RF. The prediction accuracy of the five prediction models from high to low is CN-LDA > CN-NB > CN-RF > CN-SVM > CN-KNN. From the perspective of the F1 value of prediction accuracy, the order of improvement of the F1 value of prediction accuracy from high to low is as follows: CN-KNN > CN-NB > CN-LDA > CN-SVM > CN-RF. The F1 value of the five prediction models from high to low order is as follows: CN-SVM > CN-LDA > CN-RF > CN-NB > CN-KNN. Thus, after integrating network topology information, the prediction accuracy and F1 value of the KNN prediction model are improved most significantly.

4.3. Analysis of the Forecast Results of China’s Pilot Carbon Market

The price fluctuation trend of seven pilot carbon markets in China is predicted by using five benchmark forecasting models and the forecasting model incorporating network topology information. The prediction accuracy, precision, recall, and F1 value of each prediction model are calculated, and the results are shown in Figures 5(a)5(g). Comparison results of the average precision of carbon price prediction in seven pilot carbon markets are shown in Table 4.

As Figure 5(a) shows, for Beijing’s carbon market, after integrating the network topology information, only the forecasting accuracy of CN-LDA and the forecasting accuracy, precision, and F1 score of CN-NB are improved compared with the benchmark models LDA and NB. As Figure 5(b) illustrates, for Guangdong’s carbon market, after integrating the network topology information, only the forecasting precision of CN-KNN and the forecasting accuracy and forecasting precision of CN-RF and CN-SVM are improved compared with the benchmark models KNN, RF, and SVM. Moreover, as Figure 5(c) shows, for Chongqing’s carbon market, after integrating the network topology information, the prediction effects (including prediction accuracy, precision, recall, and F1 score) of the CN-KNN, CN-RF, and CN-SVM models are significantly improved, and the forecasting precision of CN-NB is improved compared with the benchmark model. However, the prediction effects of CN-LDA are not improved compared with the benchmark model. From Figure 5(d), for Shanghai’s carbon market, after integrating the network topology information, the prediction accuracy of CN-RF and the prediction precision of CN-NB are improved compared with the benchmark model. However, other forecasting models fail to improve their performance. As shown in Figure 5(e), for Tianjin’s carbon market, after integrating the network topology information, the prediction effects (including prediction accuracy, precision, recall, and F1 score) of the CN-NB, CN-RF, and CN-SVM are significantly improved compared with the benchmark model, while the CN-LDA and CN-KNN fail to improve their performance. What needs to be pointed out here is that when the benchmark NB prediction model is used to predict the trend of carbon price fluctuation in Tianjin’s carbon market, the prediction accuracy, recall, and F1 value are all very small, at 0.24492, 0.15168, and 0.25402, respectively. This situation indicates that the NB model is invalid in predicting the trend of carbon price fluctuation in Tianjin’s carbon market at this time. However, the CN-NB prediction model obtained by integrating the network topology information significantly improves the prediction accuracy of the NB prediction model, enabling the accuracy, recall rate, and F1 value to reach 0.56708, 0.52440, and 0.67349, respectively. This situation also indicates that the previously invalid prediction model can become effective after integrating the network topology information. As shown in Figure 5(f), for Shenzhen’s carbon market, after integrating the network topology information, the prediction effects (including prediction accuracy, precision, recall, and F1 score) of the CN-LDA, CN-KNN, and CN-RF are significantly improved compared with the benchmark model, while the CN-NB and CN-SVM fail to improve their performance. As Figure 5(g) shows, for Hubei’s carbon market, after integrating the network topology information, all the prediction models fail to improve their performance. In summary, for the price trend prediction of seven domestic pilot carbon markets, the prediction model integrated with network topology cannot always effectively improve the prediction accuracy of the benchmark model.

In the following, we compare and analyze the performance of different forecasting models in seven pilot carbon market price forecasts in China from the perspective of the forecasting effect. We calculate the average prediction accuracy of each prediction model and obtain the following results.

From Table 4, we find that, for Beijing’s carbon market, the prediction effect of the models from the highest to the lowest is SVM (0.8301), CN-SVM (0.8300), LDA (0.8296), CN-LDA (0.8288), RF (0.8268), CN-RF (0.8257), CN-NB (0.8060), NB (0.8041), KNN (0.7995), and CN-KNN (0.7873). For Guangdong’s carbon market, the prediction effect of the models from the highest to the lowest is CN-SVM (0.7463), SVM (0.7419), LDA (0.7413), CN-LDA (0.7365), CN-RF (0.7275), RF (0.7270), NB (0.7016), CN-NB (0.6972), KNN (0.6877), and CN-KNN (0.6713). For Chongqing’s carbon market, the prediction effect of the models from the highest to the lowest is CN-RF (0.9112), RF (0.9083), CN-SVM (0.9074), SVM (0.9072), LDA (0.9070), CN-KNN (0.9051), CN-LDA (0.9043), KNN (0.8985), NB (0.8445), and CN-NB (0.8433). For Shanghai’s carbon market, the prediction effect of the models from the highest to the lowest is SVM (0.8305), CN-SVM (0.8305), LDA (0.8286), CN-LDA (0.8280), CN-RF (0.8275), RF (0.8271), NB (0.8045), CN-NB (0.7981), KNN (0.7950), and CN-KNN (0.7933). For Tianjin’s carbon market, the prediction effect of the models from the highest to the lowest is CN-SVM (0.9121), SVM (0.9092), LDA (0.9078), CN-LDA (0.9075), CN-RF (0.9062), RF (0.9055), KNN (0.8971), CN-KNN (0.8901), CN-NB (0.6782), and NB (0.3689). For Shenzhen’s carbon market, the prediction effect of the models from the highest to the lowest is CN-LDA (0.9087), LDA (0.9030), SVM (0.9026), CN-SVM (0.9025), RF (0.9003), CN-RF (0.9002), CN-KNN (0.8956), KNN (0.8924), NB (0.8519), and CN-NB (0.8480). For Hubei’s carbon market, the prediction effect of the models from the highest to the lowest is SVM (0.6915), CN-SVM (0.6914), LDA (0.6624), RF (0.6594), CN-LDA (0.6586), CN-RF (0.6570), NB (0.6415), CN-NB (0.6410), KNN (0.6190), and CN-KNN (0.6082). To sum up the statistical results, the models with the best forecasting effect for the seven pilot carbon markets in Beijing, Guangdong, Chongqing, Shanghai, Tianjin, Shenzhen, and Hubei are SVM (0.8301), CN-SVM (0.7463), CN-RF (0.9112), SVM (0.8304), CN-SVM (0.9121), CN-LDA (0.9087), and SVM (0.6915), respectively. According to the model with the best carbon market prediction effect, after integrating the topology information of the carbon price network, the price trend prediction effect of the four carbon markets in Guangdong, Chongqing, Tianjin, and Shenzhen is significantly improved by 0.5931%, 0.3193%, 0.3190%, and 0.6312%, respectively. For the Beijing, Shanghai, and Hubei carbon markets, the prediction accuracy of the benchmark model has almost no change after the topological structure information of the carbon price network is integrated, and the prediction accuracy varies from 0.0095% to 0.0145%.

4.4. Comparative Analysis of Carbon Price Trend Prediction Results between the EU Carbon Market and Seven Domestic Pilot Carbon Markets

In the following, we make a horizontal comparison of the effect of carbon price fluctuation trend prediction in seven domestic pilot carbon markets and calculate the average accuracy, precision, recall, and F1 value of carbon price fluctuation trend prediction in the EU and seven Chinese pilot carbon markets. The results are shown in Figure 6.

From Figure 6, we can reach the following conclusions: the prediction effect of carbon price in seven domestic pilot carbon markets is significantly higher than that in the EU ETS. The average accuracy of carbon price prediction in the EU ETS is 0.52864, the average precision is 0.54177, the average recall is 0.77535, and the average F1 score is 0.59955. The average accuracy of carbon price prediction in the seven domestic pilot carbon markets is 0.72336, the average precision is 0.75039, the average recall is 0.94293, and the average F1 score is 0.81929. The forecast accuracy, precision, recall, and F1 score of China’s pilot carbon markets are 36.8346%, 38.5079%, 21.6134%, and 36.6508% higher than those of the EU carbon market, respectively. Furthermore, when we look at the comparison of individual carbon markets, we find that the accuracy of price trend prediction of Chongqing’s carbon market ranks first and is 58.0795% higher than that of EU ETS. The accuracy of the price trend prediction of Hubei’s carbon market ranks last and is 4.5587% higher than that of the EU carbon market. The precision of carbon price trend prediction in Tianjin’s carbon market ranks first and is 60.9486% higher than that of EU ETS. The precision of price trend prediction in Hubei’s carbon market ranks last and is 3.6619% higher than that of EU ETS. The recall of price trend prediction in Beijing’s carbon market ranks first and is 26.0111% higher in accuracy than that of the EU ETS. The recall of carbon price trend prediction in Hubei’s carbon market ranks last, with a 9.3414% higher accuracy than that of the EU carbon market. The F1 score of price trend prediction of Chongqing’s carbon market ranks first, with a 51.408% higher accuracy than that of the EU ETS, and the F1 score of carbon price trend prediction of Hubei’s carbon market ranks last, with a 7.6807% higher than that of the EU ETS.

5. Discussion

China’s pilot and EU carbon markets have huge differences in carbon price fluctuations. From the structural characteristics of the carbon price network, the EU carbon price network is an assortativity network, while the seven pilot carbon market price networks in China are all disassortativity networks. Price fluctuations in China’s seven pilot carbon markets also vary widely; the carbon market price in Shanghai presents a left-skewed, flat distribution; the carbon market price in Beijing, Chongqing, Shenzhen, and Tianjin presents a right-skewed, flat distribution; and the carbon market price data in Guangdong and Hubei present a right-skewed, sharp distribution. It can be seen from the prediction results that the prediction effect of the Chinese carbon market price trend using the same model is significantly higher than that of the EU carbon market, indicating that the price fluctuation complexity of China’s pilot carbon market is lower than that of the EU ETS, and the price fluctuation of China’s carbon market is more regular. In fact, in this paper, we predict the future trend of carbon’ price based on the historical data of each carbon market. The more effective the information that can reflect the future trend of price fluctuation in the process of historical price fluctuation, the higher the precision of carbon price trend prediction.

This paper concludes that the prediction accuracy of carbon price trend in China’s pilot carbon market is significantly higher than that in the EU ETS, which indicates that the price fluctuation in China’s pilot carbon market can be explained by historical carbon prices that are significantly higher than those in the EU carbon market. This conclusion is consistent with previous studies [45, 46]. The EU carbon market is currently considered to be a relatively mature carbon market, while the carbon price signal of China’s pilot carbon market has not been fully played. The main reasons for the above problems lie in the inactive trading, imperfect laws, and low marketization degree of the domestic pilot carbon market. Although the market efficiency of China’s pilot carbon market shows a trend of continuous improvement, on the whole, the construction of China’s carbon market is still in its infancy and will face some uncertainties. There is still a long way to go before the carbon market will explore, iterate, and mature.

The importance of low-carbon development is being raised to an unprecedented level after China presented the world with a carbon peak and carbon neutral timetable, which will become the fundamental driving force for the development of the national carbon market. On January 1, 2021, the first implementation cycle of China’s national carbon market was officially launched, involving 2,225 key emitters in the power generation sector. Making the national carbon market a more efficient and active market will be a major challenge. In the process of carbon market construction, industries including steel, cement, chemical, electrolytic aluminum, and paper should be included as soon as possible. More diversified trading parties, including asset operating institutions and financial institutions, should be added. Diversified participants will promote the formation of an active market. To achieve a reasonable price in the carbon market, the total amount should be determined first, and the carbon quota should be traded on the premise of the total amount being determined to establish regular conditions for the price formation. How do we determine and calculate the total amount of carbon? Economic growth, industrial restructuring, energy structure optimization, coordinated control of air pollutant emissions, and other factors need to be taken into account in formulating the total carbon emission quota. As for carbon price, we should give full play to the role of the market itself and reduce the unreasonable intervention of the government in the market as much as possible. However, the government should still establish rules and strict supervision, and at the same time, it should strengthen the connection between the carbon market and the financial market and strengthen the interaction with the carbon finance and capital market, which will increase the size of the carbon market and contribute to the formation of the carbon price. Through the above measures, the carbon financial market can be active, improve the efficiency of the market, and promote the carbon market as an important tool to cope with climate change and achieve carbon neutrality.

The main purpose of this paper is to discuss whether incorporating the topology of the complex transformed from the original carbon price data into the original carbon price data can help improve the prediction accuracy of the existing prediction model, rather than focusing on whether the constructed prediction model has the highest prediction accuracy. In this paper, we select six classical prediction models as benchmark models, and empirical analysis using carbon market price data shows that the prediction accuracy of the benchmark model can be significantly improved after the integration of network topology information, indicating that the idea proposed in this paper is effective. Of course, we can also choose other classical prediction models as benchmark models, such as neural networks and extreme learning machines. Although the neural network model may achieve better prediction accuracy, the neural network model itself contains many parameters such as the number of neurons, the number of hidden layers, activation function, and learning rate, and the choice of parameters has a significant impact on the prediction accuracy of the model. Therefore, in order to reduce the process of parameter tuning and verify the prediction ideas proposed in this paper more quickly, we did not select the neural network model as the benchmark model. In fact, we have built a crude oil price prediction model based on a complex network and neural network in literature [47] and verified that the hybrid model has high prediction accuracy.

In practical application, the core step of the prediction model proposed in this paper is to transform the original data into a complex network. In this paper, the transformation method we use is the visibility graph algorithm, which has the advantage of no parameters, that is, a parameterless mapping algorithm. In addition to the visibility graph algorithm, the complex network transformation algorithm of time series data also includes recurrence network, coarse-grained, and other methods, which need to determine some additional parameters. Therefore, in practical application, we recommend using the visibility graph method to transform time series data into a complex network and then use network topology to extract data information. In the following research, we will continue to study the time series prediction algorithm based on a multilayer complex network.

Data Availability

The carbon price data of EU carbon market (EUETS) and seven pilot carbon markets of China from June 19, 2014, to October 9, 2020, that is, Beijing (BJ), Guangdong (GD), Chongqing (CQ), Shanghai (SH), Tianjin (TJ), Shenzhen (SZ), and Hubei (HB), were selected as sample data.

Conflicts of Interest

The authors declare that they have no conflicts of interest.

Acknowledgments

The research was supported by the following foundations: National Key R&D Program of China under Grant No. 2020YFA0608602, Qing Lan Project of Jiangsu Province (2021), Six Talent Peaks Project in Jiangsu Province (JY-055), and China Postdoctoral Science Foundation Funded Project (2021M691312).