Abstract

This paper studies a Stackelberg differential game between an upstream region and a downstream region for transboundary pollution control and ecological compensation in a river basin and increases the number of pollutants assumed in the model to multiple. Emission and green innovation investment between upstream and downstream regions in the same basin is a Stackelberg game, and the downstream region provides economic compensation for green innovation investment in the upstream region. The results show that there is an optimal ecological compensation rate, and a Pareto improvement result can be obtained by implementing ecological compensation. Increasing the proportion of ecological compensation can improve the nonvirtuous chain reaction between green innovation investment cost, pollutant transfer rate, and ecological compensation rate. Therefore, it is necessary to establish a joint mechanism composed of the government and the market and formulate a reasonable green innovation subsidy scheme according to the actual situation of the basin, so as to restrict the emergence of this “individual rational” behavior. For river basin areas that can establish a unified management department and organize the implementation of decision-making, the cooperative game is a very effective pollution control decision.

1. Introduction

Since the late 1970s, the application of differential game theory has flourished in management science and economics. Relevant research covers economics, industrial organization, oligarchy theory, resource and environmental economics, labor economics, marketing, production and operation management, finance, innovation, and many more. Dockner et al. [1] comprehensively introduced the basic principles and applications of differential game theory. Starting from the differential game theory, this paper analyzes the basic knowledge needed to establish the model and then expounds the practical operation of the theory in the field of economics and management. Sana [2] dealt with an inventory model to determine the retailer’s optimal order quantity for similar products. Jorgensen and Zaccour [3] reviewed the literature on the differential game theory method used in cooperative advertising in marketing channels (supply chain). Cabo et al. [4] analyzed the sustainability of economic growth in a dynamic two country trade model. Janssens and Zaccour [5] studied how to determine the optimal price, subsidy path, and strategic decision of emerging industries such as solar energy under budget constraints. Shib [6] established a differential game model to study the optimal solution and pricing of drug sales team. Andres Domenech et al. [7] simulated the impact of forest depletion on carbon dioxide accumulation in the atmosphere, considering that forest owners use forest resources to obtain wood and deforestation to obtain economic benefits in the form of agriculture, while causing negative externalities to nonowners. Petrosyan and Zaccour [8] believed that it is individual and collective rationality for the participants to sign long-term cooperation agreements. Bhattacharyya and Sana [9] dealt with a mathematical model of the production inventory system of green products in a green manufacturing industry. Then, Roy et al. [10] studied a two-echelon supply chain comprising of one manufacturer and two competing retailers with sales price dependent demand and random arrival of the customers. Sana [11] dealt with a newsvendor inventory model in light of green product marketing of corporate social responsible firms and analyzed including subsidy and tax implementation by the Government. Sana [12] used differential game theory to analyze two green supply chain system models.

The problem of transboundary pollution arises with the development of industrialization and has become a research focus since the 1990s. For example, Ploeg et al. [13] assumed that participants use a linear Markov perfect strategy and developed a differential game model to probe into the transboundary pollution issue, by which the cooperative and noncooperative Nash equilibrium results are reached. Long [14] investigated an international transboundary pollution game and found that the symmetric open-loop Nash equilibrium yields more pollution than in the case of the players use cooperative strategy. Extending the research of Long [14], Dockner and Long [15] showed that, in the transboundary pollution games, noncooperative behavior will result in welfare loss for both countries. Petrosyan and Zaccour [16] devoted the fair distribution of the total cost resulting in the cooperation in transboundary pollution control. Following the idea of Petrosyan and Zaccour [16], Jørgensen and Zaccour [17] investigated the distribution of cooperative residual benefits, and a payment distribution mechanism supporting the subgame consistent solution is given. Yeung [18] first derived the time-consistent solutions in a cooperative differential game and studied the pollution management in a stochastic differential game framework. Immediately afterwards, in paper [19], a cooperative stochastic differential game of transboundary industrial pollution is presented, and a payment distribution mechanism is derived to maintain the subgame consistency. Benchekroun and Martín-Herran [20] explored the effects of foresight in a transboundary pollution game, and they showed an interesting result that, in the case where one myopic country becomes farsighted, the welfare levels will not necessarily increase. Li [21] developed a transboundary pollution differential game model in which the emission permitted trading is taken into account. Huang et al. [22] developed a model in which there is a Stackelberg game between the industrial firms and their local government, while the governments can cooperate in transboundary pollution control. Chang et al. [23] investigated the optimal pollution abatement strategies of two countries involved in transboundary pollution under cooperative and noncooperative games. Additionally, there are several published studies of transboundary pollution problems from other views, such as renewable resources, clean technologies, harmonization of international and domestic law, abatement cost, and R&D spillovers (for instance, [2430]).

Ecological compensation refers to a case that the investment cost of environmental governance is shouldered in part or in whole by the one who enjoys the benefits of environmental improvement or government [31]. It is estimated that ecological compensation is an efficient instrument in cooperative environmental pollution control [32]. However, the research literatures above mentioned on transboundary pollution control do not consider the ecological compensation issue. Until recently, Jiang et al. [33] investigated a transboundary pollution control gaming with ecological compensation in a river basin. In the model of Jiang et al. [33], the upstream region as the leader of the Stackelberg game is responsible for pollution abatement, and the downstream region as the follower compensates the upstream region for her abatement investment in part. In fact, this paper can be viewed as an extension of Jiang et al. [33]. Differences from Jiang et al. [33] are as follows: (i) it is not just the upstream region; in our model, both the upstream region and downstream region in a river basin can invest in pollution abatement (green innovation). (ii) Different from the model of Jiang et al. [33], in our model, the leader of the game is the downstream region who determines the ecological compensation rate firstly, then the follower, e.g., the upstream region, decides the abatement investment under the given compensation rate. (iii) In our model, we also consider the learning effect of two regions through green innovation investment to improve efficiency and reduce costs. The positive effects of learning by doing have been widely studied since Arrow [34]. He thought that a good index of the stock of knowledge is the cumulative investment. Recently, Li and Pan [35] investigated the pollution abatement issue in which the experience using pollution abatement technology is measured by the cumulative abatement from time 0 to ; this may be the first time that the cost-reducing learning by doing in cumulative investment is taken into account. To expand the thinking of Li [21], in this paper, we examine the effects of efficiency-improving and cost-reducing learning by doing in green innovation simultaneously. The reason for taking efficiency-improving learning by doing into account is that the phenomenon of efficiency-improving learning by doing is real, and it is widely accepted [36]. To some extent, it can be called “skill comes by exercise” or “practice makes perfect.” Grosse et al. [37] argued that performance improvements of individuals, groups, or organizations over time are a result of accumulated experience.

However, the research on the treatment of transboundary water pollution based on river basins and the search for effective methods to solve the integrated management have also become the goal pursued by many researchers. How to scientifically and effectively coordinate the interests of the participating entities in the river basin, optimize the water environment planning, and fully solve the problem of water pollution has become the key to the prevention and control of transboundary water pollution. Studying the problem of transboundary water pollution prevention, control from the level of cross-regional coordination, and cooperation in the river basin and seeking solutions to the problem of water pollution control have important practical significance for solving regional water environmental problems.

The rest of this paper is organized as follows. In Section 2, we will establish our basic dynamic general equilibrium model. The strategies of both the upstream region and the downstream region are achieved in Section 3 and Section 4. Some discussions are provided with several numerical examples and policy implications in Section 5. Finally, Section 6 concludes the paper.

2. The Necessary Assumptions and Establishment of the Model

2.1. The Basic Assumptions of the Model

Let us start out with a simple but widely used assumption in a related study (Li and Guo [38]) that a river basin consists of only two regions, upstream and downstream, denoted by subscripts , . In addition, both regions produce an identical consumed good with an identical and given fixed endowment of production factors and an identical and given technology over continuous time . Moreover, in order for the model to accommodate the study of multipollutant transboundary pollution, it is assumed that the production process in the two regions is accompanied by the emission of two kinds of pollutants identified by subscripts , . Further, there is a close relationship of both regions’ revenue and the amount of the pollutants given aswhere , , and and are constant which measure the marginal contribution of the amount of pollutants of region . From (1), one can see that and , which signify that an increase in any one pollutant will increase the revenue of the region but may be because may be . So, the amount of each pollutant should be weighed by the region , . In addition, (1) shows , which means the marginal contribution of each pollutant to returns is diminishing.

Next, consider both regions’ stock evolution of the two pollutants and with time . Assume they are given bywhere and indicate the natural decay rate of emissions of both regions. stands for the transfer coefficient of pollution from the upstream region to the downstream region. Dynamic equations (2) and (3) imply that the pollution can be abated linearly by the green innovation investment and carried out by the upstream region and downstream region, respectively. Further, inquire the abatement investment cost . Assume there are following equations:where and are constant. (4) shows that and which mean that investment in each pollutant reduction incurs an increase in total cost. In addition, the second derivative and indicate that the marginal cost of abatement investment is rising.

Following Lambertini and Mantovani [39] and Guiomar et al. [40], assume that both regions are damaged by the stock of pollution, and we assume that there exists the following pollution damage function:where and measure the marginal impact of the stock of pollution and . In addition, we assume and . means that the two pollutants have different properties, so the marginal effects of the two pollutants on the same region are different. says that the marginal effects of the same pollutant on different regions are different due to the difference of the economic development level. It is generally assumed that the more developed the economy, the greater the marginal effects of pollution damage on the region.

A final but crucial assumption is that, in order to encourage the upstream region to invest in green innovation for reducing the quantity of transferring pollution from the upstream region to the downstream region, we suppose that the downstream region is willing to compensate the upstream region for her investment in green innovation with compensation rate .

2.2. The Establishment of the Differential Game Model

According to differential game theory, the objective of the upstream region is to find the optimal green innovation investment , and the optimal quantity of emissions quantity , to maximize following discounted revenue flow over the continuous time , :

Similarly, the optimization problem of the downstream region can be given as

Visibly, the game between the upstream region and the downstream region in the river basin is a Stackelberg game in which the downstream region as the leader decides the compensation rate at first and then the upstream region as the follower determines her green innovation investment under the given compensation rate. In the next section, we will apply the backward induction to solve the optimization problems of both regions.

3. Decision-Making Choice of Noncooperative Game in Upstream and Downstream

3.1. The Upstream Region’s Optimal

The upstream as the follower of the Stackelberg game determines her optimal strategy under the given compensation rate decided by the leader, e.g., the downstream region. From (6), the following current-value Hamiltonian can be obtained:where and are dynamic costate variables measuring the shadow prices of the associated state equations and , respectively.

From the first-order conditions, costate conditions, and state equation of current-value Hamiltonian (8), following dynamic system (9) can be written out:

To obtain the general managerial insight, we devote our mind to the steady state equilibrium. If there do exists a steady state equilibrium, let us apply the superscript “” to identify the noncooperative equilibrium results. Solving (9) under state equilibrium conditions, we getwhere represents the optimal strategy selection of pollutant 1 emission rate in the upstream region of the basin in the noncooperative game decision model, represents the optimal emission rate of pollutant 2, and represent the optimal green innovation investment choices for pollutants 1 and 2 in the upstream region, respectively, represents the stock level of upstream pollutant 1 under the condition of noncooperative game decision, and represents the stock level of pollutant 2. The above results reflect the optimal strategy choice of noncooperative game decision-making in the basin. That is to say, in practice, the upstream of the river basin adopt the optimal Nash equilibrium strategy for themselves after receiving a certain amount of green innovation subsidies from the downstream.

3.2. The Downstream Region’s Optimal

As the leader, the downstream region sets the compensation rate and decides her green innovation investment and to maximize her benefits. Note, when the downstream region is set the compensation rate, she must take into account the response of the upstream region. We obtain current-value Hamiltonian from (7) as follows:

Using the first-order conditions, costate conditions, and state equation of current-value Hamiltonian (11), the following dynamic system is achieved:

Under steady state equilibrium conditions, solving dynamic system (12), the following noncooperative equilibrium results of the downstream region are reached:where represents the optimal strategy selection of pollutant 1 emission rate in the downstream region of the basin in the noncooperative game decision model, represents the optimal emission rate of pollutant 2, and represent the optimal green innovation investment choices for pollutants 1 and 2 in the downstream region, respectively, represents the stock level of downstream pollutant 1 under the condition of noncooperative game decision, and represents the stock level of pollutant 2. The above results reflect the optimal strategy choice of noncooperative game decision-making in the basin. That is to say, in practice, the downstream of the basin, after subsidizing a certain amount of green innovation in the upstream, adopt the optimal Nash equilibrium strategy for themselves.

Next, we solve the optimal compensation rate . From (7), we get downstream region’s value function (14) under steady state equilibrium conditions:

Substituting the results reached in (8) and (9) and applying the first-order conditions , we obtain following equation:

Here, represents the optimal green innovation subsidy rate, that is, the benefit compensation rate when the upstream and downstream regions of the basin can reach the optimal Nash equilibrium.

4. Decision-Making Choice of Cooperative Game in Upstream and Downstream

Under a cooperative framework, the common goal of the two regions is to maximize the discounted stream of net present joint returns by controlling the quantity of emissions , , , and as well as the investment level of green innovation , , , and over continuous time , , namely,

There is the following current-value Hamiltonian:where , , , and are the costate variables which correspond to the state variables , , , and .

Through the first-order conditions, costate conditions, and constraint equations of current-value Hamiltonian (17), we yield the following dynamic systems:

Solving dynamic differential system (18) under the steady state condition and characterizing the steady state equilibrium solutions by superscript “,” following results are obtained:

5. Numerical Examples and Policy Implications

Through the analysis mentioned above, we have obtained the optimal emission, green innovation investment, and pollution stock under noncooperative game and cooperative game.

In this section, we will simulate and analyze the influencing factors and changes of the green innovation compensation rate, the change trajectory of each variable under different decision-making models, and further analysis. The parameters used in the numerical examples are presented in Table 1 and we use version 7.0 of Wolfram Mathematical MATLAB to obtain the numerical solutions. The parameters are as follows [4143].

5.1. Green Innovation Compensation Rate

There is a positive relationship between the rate of ecological compensation and the coefficient of green innovation investment cost. As shown in Figure 1, the higher the green innovation investment cost, the greater the proportion of ecological compensation. For polluting enterprises or polluted areas, higher green innovation investment costs are not conducive to the improvement of pollution control technologies, and they are more inclined to accept investment ecological compensation from others. When the level of green innovation in the upstream region is low, a certain amount of investment ecological compensation is required from the downstream so that both the upstream and downstream of the basin can reach the optimal solution of the Nash equilibrium.

The proportion of ecological compensation increases as the transfer of pollutants increases. As shown in Figure 2, when the proportion of pollutants transferred from the upstream of the basin to the downstream increases, the proportion of investment compensation from the downstream to the upstream region will increase accordingly. Downstream regions try to reduce the pollution stock by transferring pollution control investment. Figure 3 shows the relationship between the transfer rate of the two pollutants and the ecological compensation rate, which also reflects this positive correlation. In addition, we can also see that the rate of change of this positive correlation is gradually decreasing. In the initial stage, when the amount of pollutants transferred is small, the amount of ecological compensation will increase rapidly. However, when the proportion of pollutant transfer in the upstream basin gradually increases to a certain value, the rate of increase in ecological compensation will slow down. In other words, when a large amount of upstream pollutants are transferred to the downstream regions, downstream regions will no longer expect to reduce the stock of pollutants in the basin by transferring pollution control investment compensation, but may continuously increase the pollution control efforts in the region to change this situation.

There is a chain reaction relationship among the investment cost of green innovation, pollutant transfer rate, and ecological compensation rate. The increase of green innovation investment cost will promote the downstream region to increase the amount of ecological compensation to the upstream. Increasing the compensation rate of the downstream to the upstream will make the upstream increase the pollution control, which will reduce the amount of pollutants transferred to a certain extent. Conversely, the reduced transfer of pollutants will reduce the amount of compensation from the downstream to the upstream region. After such a game, a nonvirtuous circle is formed, which leads to the failure of cooperation in watershed pollution control, and ultimately is not conducive to the treatment of water pollution in the transboundary river basin.

Therefore, it is necessary to establish a joint mechanism composed of the government and the market and formulate a reasonable green innovation subsidy scheme according to the actual situation of the basin, so as to restrict the emergence of this “individual rational” behavior.

5.2. Comparative Analysis of Variables

Figures 4 and 5 show the change trajectory of the emission rate in upstream and downstream regions under the noncooperative decision-making model. It can be found that, in a certain period of time, the pollution control game of green innovation compensation is relatively stable, and the pollution emission rate of each region is relatively stable. With the development of noncooperative decision-making game, the game between the two sides tends to be not cooperative. The investment level of pollution control will also gradually increase, as shown in Figures 6 and 7, which will significantly increase the investment cost of pollution control in various regions. This result has caused the gradual accumulation of pollutants in the upstream and downstream regions of the basin. As shown in Figures 8 and 9, the water pollution situation continues to deteriorate. However, as the level of green innovation investment continues to increase, in the long term, the pollutant stock in the upstream and downstream of the basin will be restricted and tend to stabilize. It shows that the effect of green innovation investment is beginning to appear.

Under the model of cooperative decision-making, the upstream and downstream regions of the basin reach some agreement, set up a joint decision-making department or unified decision-making by a higher management department, and jointly coordinate the pollution discharge strategies of the two regions with the goal of maximizing the total net present value of long-term income. The cooperative decision-making model is different from noncooperative in that it can internalize environmental externalities and maximize welfare, as shown in Figures 1015. The emission rate of pollutants in the river basin has been drastically reduced, the level of abatement investment in pollution control has also been reduced, and the level of pollution stock has been significantly reduced. Finally, the water pollution environment in the basin has been greatly improved.

The level of green innovation investment has not changed much, but the emission reduction effect brought by the continuous strengthening of green innovation investment is very obvious. As shown in Figures 14 and 15, the stock level of pollutants in the river basin becomes stable in a short period of time, and the stable state level is far below the stock under noncooperative game conditions.

5.3. Comprehensive Analysis

In order to analyze the change trajectory of each variable more intuitively, we compare the results of two kinds of decision-making, namely, noncooperative and cooperative game decision-making, as shown in Figures 16 and 17. It can be seen that the investment governance level under the cooperative game decision-making model is slightly lower than that of the noncooperative game, as shown by the solid line in Figure 16. The pollution stock level, as shown by the solid line in Figure 17, is significantly lower than that of noncooperative game decision-making. This shows that although the cooperative game decision-making reduces the investment level of pollution abatement, it greatly reduces the pollution stock level in the basin and effectively improves the water environment quality.

5.4. Further Analysis

From the above comparative analysis, we found that the cooperative decision-making model is an effective transboundary water pollution control model. On further analysis, we can also find that, in the steady state, the pollutant stock is inversely proportional to the ecological compensation rate, and this rate of change gradually becomes faster as the ecological compensation rate increases. It shows that the proportion of ecological compensation has a law similar to the increasing marginal utility. In Section 3, we obtain the Nash equilibrium solution of the ecological compensation rate in the noncooperative decision-making model; the optimal value is 0.41 by simulation, but it seems that this is not the optimal value of the ecological compensation rate when the pollution stock can be reduced to a relatively low level. Therefore, we further calculate that if the pollution stock is reduced to the level of the cooperative decision-making model, as shown in Figure 18, the value of the ecological compensation rate at this time is about 0.8589. In other words, if the compensation rate under the noncooperative decision-making model is 0.8589, the pollution stock in the basin can also drop to a relatively low level.

6. Conclusions and Policy Recommendations

6.1. Main Conclusions

This article mainly studies the decision-making of transboundary water pollution control under the premise of multiple pollutants. Among them, it also includes the determination of the green innovation compensation rate and the analysis of influencing factors. We established a differential game model to solve the optimal Nash equilibrium solution, used numerical simulation to measure and analyze the change trajectory of various variables in cooperative and noncooperative decision-making models, and obtained the optimal green innovation compensation rate value in the noncooperative model. In addition, we further discussed the improvement of the compensation rate.

The investment cost of green innovation is directly proportional to the ecological compensation rate. In the game of transboundary river basins, the investment cost of pollution control in the downstream is the key factor to determine the compensation to the upstream region. The higher the investment cost of green innovation, the greater the ecological compensation rate; otherwise, the lower. In addition, the rate of ecological compensation is also affected by the amount of pollutants transferred. When the proportion of pollutants transferred from the upstream to downstream becomes larger, the proportion of investment compensation from the downstream to the upstream region will increase accordingly. On the contrary, increasing the proportion of compensation to the upstream in the downstream area will make the upstream increase the pollution control, which will reduce the amount of pollutants transferred to a certain extent. The nonvirtuous chain reaction between green innovation investment cost, pollutant transfer rate, and ecological compensation rate is not conducive to the orderly progress of transboundary water pollution control in the river basin.

6.2. Policy Recommendations

Regarding the direct proportional relationship between the green innovation subsidy rate, the green innovation investment cost, and the pollution transfer coefficient, local regulatory agencies in the basin must consider both the cost of innovation investment and the amount of pollutants transferred. In order to achieve the optimal management goals, it is necessary to ensure that the upstream and downstream of the basin can reach the optimal Nash equilibrium.

In order to deal with the nonvirtuous chain reaction between green innovation investment cost, pollutant transfer rate, and ecological compensation rate and to ensure that the transboundary water pollution of the river basin can be effectively and cooperatively controlled, this requires the use of regulatory policies to intervene to encourage the downstream regions to increase the proportion of ecological compensation and form a stable transfer mechanism. To a certain extent, the downstream will not reduce the amount of ecological compensation to the upstream region due to the reduction of pollutant transfer. Therefore, it is necessary to establish a joint mechanism composed of the government and the market and formulate a reasonable green innovation subsidy scheme according to the actual situation of the basin, so as to restrict the emergence of this “individual rational” behavior.

For two adjacent regions with transboundary water pollution problems, if cross-regional unified cooperation cannot be achieved, ecological investment compensation is also a good choice that can effectively alleviate regional pollution. The level of innovation investment under noncooperative decision-making is also relatively high, and to a certain extent, it can also reduce pollutant emissions and facilitate the improvement of clean technology. For river basin areas that can establish a unified management department and organize the implementation of decision-making, cooperative game is a very effective pollution control decision. It can use less innovation investment to reduce the emission of pollutants. At the same time, in a steady state, the pollution stock level of the basin drops a lot compared with noncooperative decision-making.

In addition, the central government should strengthen the modern management of the river basin, improve the transboundary water pollution prevention and control system, establish a new cooperation mechanism for the prevention and control of transboundary water pollution, promote the construction of the upstream and downstream coordination platform, and promote the organic integration and cooperation in the operation process of the coordination mechanism of various river basin management agencies, so as to reduce the transaction cost. Relevant government departments should speed up the market-oriented construction of river basin transboundary water pollution control, innovate the market-oriented mode of transboundary ecological compensation, improve the river basin emission permit system and trading system, strengthen the multiparticipation mechanism of the river basin, and strengthen information disclosure and public participation. River basin management organizations should strengthen publicity and education, enhance the awareness of environmental protection of citizens in the river basin, and advocate resource-saving and environment-friendly production and living technologies.

Data Availability

The data used to support the findings of this study are included within the article.

Conflicts of Interest

The authors declare that they have no conflicts of interest regarding the publication of this paper.

Acknowledgments

This work was supported by the National Social Science Foundation of China (Grant no. 18ZDA040), Humanities and Social Science Foundation of the Ministry of Education of China (Grant no. 17JJD790017), and the Hunan Provincial Social Science Outcomes Review Committee Major Projects (Grant no. XSP20ZDA007).