Impact of Tree Cover Loss on Carbon Emission: A Learning-Based Analysis

Butt, Abdul Haleem; Jamshed, Muhammad Ali; Ur Rahman, Ata; Alam, Faiz; Shakya, Manoj; Almadhor, Ahmad S.; Ur-Rehman, Masoor

doi:https://doi.org/10.1155/2023/8585839

Computational Intelligence and Neuroscience

On this page

Abstract Introduction Results Conclusion Data Availability Conflicts of Interest Acknowledgments References Copyright Related Articles

Special Issue

Computational Overhead vs. Learning Speed and Accuracy of Deep Networks

View this Special Issue

Research Article | Open Access

Volume 2023 | Article ID 8585839 | https://doi.org/10.1155/2023/8585839

Impact of Tree Cover Loss on Carbon Emission: A Learning-Based Analysis

Abdul Haleem Butt,¹Muhammad Ali Jamshed,²Ata Ur Rahman,¹Faiz Alam,¹Manoj Shakya,³Ahmad S. Almadhor,⁴and Masoor Ur-Rehman²

Academic Editor: Muhammad Ahmad

Received21 Jul 2022

Revised09 Aug 2022

Accepted11 Aug 2022

Published01 Mar 2023

Abstract

Describing the processes leading to deforestation is essential for the development and implementation of the forest policies. In this work, two different learning models were developed in order to identify the best possible model for the assessment of the deforestation causes and trends. We developed autoregressive integrated moving average (ARIMA) model and long short-term memory (LSTM) independently in order to see the trend between tree cover loss and carbon dioxide emission. This study includes the twenty-year data of Pakistan on tree cover loss and carbon emission from the Global Forest Watch (GFW) platform, a known platform to get numerical data. Minimum mean absolute error (MAE) for the prediction of tree cover loss and carbon emission obtained through ARIMA model is 0.89 and 0.95, respectively. The minimum MAE given by LSTM model is 0.33 and 0.43, respectively. There is no such kind of study conducted in order to identify the increase in carbon emission due to tree cover loss most specifically in Pakistan. The results endorsed that one of the main causes of increase in the pollution in the environment in terms of carbon emission is due to tree cover loss.

1. Introduction

Global warming is a burning issue causing catastrophic changes and calamities around the world. The increasing occurrence of climate dissipation has been noticed with current global warming that had several biophysical impacts worldwide [1]. One of the main causes of global warming is carbon emission. Sources of carbon emission are both natural and human. Human sources include a variety of man-made actions such as tree logging, forest fires, burning of fossil fuels, cement production, natural gas production, and so on [2, 3].

Hence, the degradation of forests that play an essential role in maintaining a balance in the ecosystem directly adds to global warming. It results in rapid environmental degradation, not only leading to a scarcity of natural resources, decline in quality of life, and long-term public health issues but also inflicting economic losses [4]. Therefore, controlling deforestation can significantly minimize carbon emissions and help improve the environment. Recent studies revealed that the deforestation rate increases with extreme drought and wet years. In another study [5], the authors examined the carbon dioxide emission and climatic effects on major agricultural crop production in Pakistan. The results revealed that the crops such as wheat, maize, sugarcane, cotton, and so on have a constructive association with carbon dioxide emissions. Combating deforestation is being evaluated by cost-effective means of reducing greenhouse gas emissions [6].

Due to the negative impact of carbon emissions on the environment, there is a lot of ongoing research work to find out the efficient methods that have the ability to predict carbon emissions and determine their causes [7–10]. Learning-based techniques have provided new approaches to prediction problems that represent interactions between variables in a deep and layered hierarchy. ML-based techniques like support vector machines (SVMs) and random forest (RF), as well as DL-based algorithms like recurrent neural network (RNN) and LSTM, have attracted lots of attention in recent years because of their applications in a variety of fields [11–14]. In time series forecasting, DL approaches are capable of identifying data structure and pattern, such as non-linearity and complexity [15, 16]. LSTM has been extensively utilized in time series prediction in [17–21]. Autoregressive integrated moving average (ARIMA) is also another forecasting model [22] that predicts the future values based on the past values. ARIMA is the best model for one-step out-of-sample forecasting and is good for the data which consist of linear and short-term dependency (weekly or hourly) [23].

The rapid advancement in the Internet of Things (IoT) would be a future enhancement of this system. There are multiple privacy-related challenges in IoT architecture during communication that can be addressed using blockchain-enabled IoT architecture [24]. Moreover, there is a need for a decentralized auction-based resource allocation mechanism in edge computing-enabled IoT, which would be helpful to make computer resources closer to the devices [25]. To imporved the data quality during the data communication, few studies foucsed on the age of information (AOI) from the prespective of game theory [26]. Game theory is a useful tool to optimize wireless networks by assisting scarce wireless resource allocation, e.g., bandwidth and channels.

Objective of this study is to develop an accessible methodological approach that allows for rapid evaluation of statistical relationships and trends in forest monitoring data using both ML and DL. In this study, two independent learning models were developed using the autoregressive integrated moving average (ARIMA) model and long short-term memory (LSTM). Moreover, this study also focused on understanding the correlation between tree cover loss and carbon emission by taking Pakistan as a case study. Since, there is no such comprehensive effort found in open literature by the authors, it is a novel attempt in this direction. The remaining part of the paper is arranged as follows. The methodology is discussed in Section 2. In Section 3, results are discussed. Section 4 describes the conclusion and future work.

2. Methodology

2.1. Data Collection

Two time series datasets, i.e., tree cover loss and carbon emission in Pakistan, were taken from the GFW platform [27]. Each dataset consists of past 20 years’ information on carbon values added to the environment and the tree cover loss from year 2001 to 2020. It should be noted that a more organized dataset comprising other variables of consideration for this study is not available in the open literature.

Figure 1 shows the proposed methodology of the framework. Following the collection of the dataset, the preprocessing stage is carried out to make the data stationary, as shown in Figure 2. The dataset is then divided into training and testing portions in order to train and evaluate the models. The training component of the dataset is used to train the models, and the testing portion is used for evaluation. The MAP and MAPE error evaluation metrics are evaluated for final analysis of the result.

2.2. Data Preprocessing

Figure 2 shows the preprocessing framework. Firstly, we deal with all missing values of the dataset. For any time series forecasting, being stationary is a mandatory property for a statistical model. A series is called stationary if its statistical property does not change with time. To verify this feature in our dataset, we have used the augmented Dickey–Fuller (ADF) test. After applying ADF, it was found that our dataset taken from GFW platform [27] does not fulfill this condition.

In order to make our dataset stationary, we performed a series of transformations such as power log transformation and differencing before applying (ADF) again for the verification. Figure 1 illustrates our adopted methodology in the form of a flowchart. After preprocessing, the dataset is divided into test and training subsets. The training set is used to train the prediction model and the test set is used to evaluate it. The split between the training and test data is kept at 70% and 30%, respectively.

For forecasting, the ARIMA and LSTM models are used to make predictions and are applied on both datasets.

2.3. ARIMA and LSTM Models

The ARIMA model is a generalization of the simpler autoregressive moving average that incorporates the concept of integration. The ARIMA model parameters are as follows:(i)p: lag order (previously predicted values).(ii)d: degree of difference.(iii)q: order of moving average.

An ARIMA model is a time series forecasting model. It incorporates the properties of two autoregression and moving average models, where in autoregression, lags or previously predicted values are known as “autoregression” while lag or previously predicted error is known as “moving average.” “Difference” is to make time series stationary (also known as integrated stationary time series version).

First step is to determine appropriate hyperparameters of ARIMA, p, h, and q, accurately to predict the behavior of the time series. Then, these hyperparameters are fitted into the training data. Finally, the model fitting residuals are analyzed to check whether the model assumptions are satisfied [6].

This study also utilizes LSTM model, which is a special type of RNN and is able to deal with long-term time dependencies [28]. There are many types of LSTM models that can be used for specific type of time series forecasting problem. In univariate LSTM, single series of observation is required to learn from the past values.

On the contrary, multivariate LSTM makes use of two or more kinds of parallel time series information to learn from the past observations. The basic architecture of univariate LSTM is shown in Figure 3. Basic LSTM network consists of cells that store the data. These cells resemble a transport line that connects one module to another conveying data from the past and gathering the present values [29]. For LSTM, three-layer-based architectures is proposed in this study with a dropout probability of 0.3 and zero non-trainable parameter. The lagged value of time series is used to predict future value with 40,901 learnable parameters.

The LSTM is applied on both of the parameters of carbon emission and tree cover loss with same layers and parameters, and the model for each dataset is trained with 50 epochs. Two separate univariate LSTM models are applied to two different time series variables, i.e., tree cover loss and carbon emission. The models are not only validated based on the difference between observed and predicted values also known as residuals but also exploited for future prediction of tree cover loss association with carbon emission for next three years with upper confidence level of 80% and lower confidence level of 90%.

3. Results

3.1. Stationary Time Series

Being stationary is an essential condition in time series analysis. Most of the time series models assumed that each point is independent of one another. To check this feature in our time series, ADF is applied. Moreover, to make the time series information stationary, differencing and log transformation are performed. The result of ADF test is shown in Table 1. It can be noted that the value of is less than 0.05. This clearly indicates that after applying log transformation and differencing, we have obtained a stationary time series suitable for further prediction analysis.

3.2. Tree Cover Loss with ARIMA

ARIMA results for tree cover loss are shown in Figures 4 and 5. The model is trained with multiple order of ARIMA. The minimum MAE for tree cover loss is 0.95 with ARIMA (1, 1, 1). Similarly, the minimum MAE for tree cover loss is 1.4 and 1.2 with ARIMA (1, 2, 1). Values of p, d, and q are cross-checked using auto ARIMA function in Python. Figure 4 depicts the train (actual) and test (predicted) data showing that the data predict some trends and are at a right scale. In Figure 5, data for past twenty years (2001–2020) are utilized to train the model for prediction of tree cover loss for next five years. The results indicate an increase in tree cover loss in the coming years.

3.3. Carbon Emission with ARIMA

Figures 6 and 7 show the carbon emission results using ARIMA. The model is trained with multiple order of ARIMA. The minimum MAE of 1.20 and mean absolute percentage error (MAPE) of 1.24 are obtained for carbon emission forecasting with ARIMA order of (1, 1, 1) and (1, 2, 1), respectively. The twenty-year data, from 2001 to 2020, are used for the model training. The same model is further exploited to carry out forecasting carbon emission over next five years. The results indicate an increase in the carbon emission for this coming period.

The two results also exhibit that increase in the carbon emission has significant and similarly directed impact on the tree cover loss.

3.4. Tree Cover Loss with LSTM

LSTM has also been applied on the same dataset to compare the results with ARIMA model for tree cover loss. Figures 8 and 9 demonstrate the results for this study. Training and testing loss decay per epoch is shown in Figure 8 while Figure 9 illustrates the tree cover loss prediction for next three years, i.e., 2023–2025.

3.5. Carbon Emission with LSTM

Figures 10 and 11 show the results of LSTM-based carbon emission. The LSTM model is trained up to 50 epochs for both tree cover loss and carbon emission. It is observed from Figure 10 that the train and testing loss is decreasing over time after each epoch while using LSTM. The results indicate that a linear correlation exists between the carbon emission and tree cover loss. Future predictions of the LSTM model also reflect an increasing pattern of the carbon emission.

3.6. Comparison of ARIMA and LSTM

The ARIMA model works by filtering high-frequency noise from data, detecting local patterns based on linear dependencies, and predicting future trends [30]. In addition, the ARIMA model converts tree cover losses and carbon emission features into special temporary variables before matching them and only considers the linear portion of the series [31]. The ARIMA model is simple and forthright and only requires to adjust the values of p, d, and q. The ARIMA model, however, is unable to deal with the non-linear relationship between the tree cover loss and carbon emissions.

On the contrary, the neural network such as LSTM can deal with both linear and non-linear patterns [32]. LSTM is a type of RNN that is meant to learn temporal patterns, capture non-linear dependencies, and preserve relevant memory for a longer period of time, resulting in achieving more accurate predictions [33].

Working of the two approaches, ARIMA and LSTM, for tree cover loss and carbon emission prediction is investigated using mean average precision (MAP) and MAPE evaluation matrices. Results are summarized in Table 2. It is observed that MAP obtained through LSTM is 0.33 while MAPE is 0.25. For carbon emission, LSTM gives MAP of 0.43 and MAPE of 0.40.

The results clearly show that LSTM has performed better than the ARIMA model in estimating and predicting tree cover loss and carbon emission for the analyzed data, hence ratifying that LSTM architecture is more suitable for time series prediction than ARIMA. The results also reflect the strong reliance and linear relationship between the tree cover loss and carbon emission.

4. Conclusion

In this work, a detailed analysis of tree cover loss and carbon emission data is carried out using ARIMA and LSTM techniques. 20-year data, from 2001 to 2020, are utilized to train and test the models and get predictions for next 5 years. The relationship between the two environmental factors is also established. The results have shown that temporal variations in the trend component of both carbon emission and tree loss cover are remarkably associated with each other. It has established that increase in the tree cover loss directly affects carbon emission in the atmosphere. Carbon emission could be one of the significant causes of the tree cover loss and deforestation.

Working of LSTM is found to be more vigorous in these prediction studies. Though very significant, this study is limited by the limited data availability. Both ARIMA and LSTM models showed the same trends. However, LSTM is a model that can learn the long-term dependencies, and it can remember the information that is processed in the model for a very long time [22]. In terms of computational time, the ARIMA models consume more time when using the rolling forecast method, and it is unfeasible to train new models when the orders of p, d, and q increase [34]. LTSM models take significantly less time to train, and once trained, constant predictions can be obtained, while ARIMA models need to be retrained.

Future aspects of this study include better understanding of carbon emission impact and control by considering more factors such as wood fuel, fire, and timber harvest. Inclusion of more factors and parameters can improve the overall prediction accuracy of the models while providing a broader understanding of causes of carbon emission. If more variables that contribute to carbon emissions and deforestation are taken into account, the work described in this paper will be more sophisticated.

Data Availability

The data used to support the findings of this study are available publicly on Global Forest Watch.

Conflicts of Interest

The authors declare that they have no conflicts of interest.

Acknowledgments

This study was supported by Innovative and Collaborative Research Grant under Pakistan-UK Education Gateway (ICRG-2020) (project no. 310366) (Deforestation in Pakistan: Combating through Wireless Sensor Networks (DePWiSeN)).

References

X. Yuan, R. Hamdi, F. U. Ochege, A. Kurban, and P. De Maeyer, “The sensitivity of global surface air temperature to vegetation greenness,” International Journal of Climatology, vol. 41, no. 1, pp. 483–496, 2021.
View at: Publisher Site | Google Scholar
P. Prosperi, M. Bloise, F. N. Tubiello et al., “New estimates of greenhouse gas emissions from biomass burning and peat fires using modis collection 6 burned areas,” Climatic Change, vol. 161, no. 3, pp. 415–432, 2020.
View at: Publisher Site | Google Scholar
M. A. Jamshed, K. Ali, Q. H. Abbasi, M. A. Ur-Rehman, and M. Ur-Rehman, “Challenges, applications, and future of wireless sensors in internet of things: a review,” IEEE Sensors Journal, vol. 22, no. 6, pp. 5482–5494, 2022.
View at: Publisher Site | Google Scholar
W. Qiu, J. Zhang, H. Wu, M. Ahmad, and M. Ahmad, “The role of innovation investment and institutional quality on green total factor productivity: evidence from 46 countries along the “belt and road”,” Environmental Science and Pollution Research, vol. 29, no. 11, pp. 16597–16611, 2022.
View at: Publisher Site | Google Scholar
A. Rehman, H. Ma, I. Ahmad, and M. I. Ahmad, “Examining the carbon emissions and climate impacts on main agricultural crops production and land use: updated evidence from Pakistan,” Environmental Science and Pollution Research, vol. 29, no. 1, pp. 868–882, 2022.
View at: Publisher Site | Google Scholar
X. Liu, Z. Song, and Z. Song, “A comparative study of the data-driven day-ahead hourly provincial load forecasting methods: from classical data mining to deep learning,” Renewable and Sustainable Energy Reviews, vol. 119, Article ID 109632, 2020.
View at: Publisher Site | Google Scholar
M. Usman, R. Kousar, M. S. A. Makhdum, M. R. Yaseen, A. M. Nadeem, and A. M. Nadeem, “Do financial development, economic growth, energy consumption, and trade openness contribute to increase carbon emission in Pakistan? an insight based on ardl bound testing approach,” Environment, Development and Sustainability, pp. 1–30, 2022.
View at: Publisher Site | Google Scholar
S. Ullah, N. . M. Syed, T. Gang et al., “Recent global warming as a proximate cause of deforestation and forest degradation in northern Pakistan,” PLoS One, vol. 17, no. 1, Article ID e0260607, 2022.
View at: Publisher Site | Google Scholar
S. Grunwald, “Artificial intelligence and soil carbon modeling demystified: power, potentials, and perils,” Carbon Footprints, vol. 1, no. 1, p. 6, 2022.
View at: Publisher Site | Google Scholar
N. L. Harris, . E. Goldman, C. Gabris et al., “Using spatial statistics to identify emerging hot spots of forest loss,” Environmental Research Letters, vol. 12, no. 2, Article ID 024012, 2017.
View at: Publisher Site | Google Scholar
A. Y. Dawod and M. A. Sharafuddin, “Assessing mangrove deforestation using pixel-based image: a machine learning approach,” Bulletin of Electrical Engineering and Informatics, vol. 10, no. 6, pp. 3178–3190, 2021.
View at: Publisher Site | Google Scholar
A. Zulfiqar, M. M. Ghaffar, M. Shahzad et al., “AI-ForestWatch: semantic segmentation based end-to-end framework for forest estimation and change detection using multi-spectral remote sensing imagery,” Journal of Applied Remote Sensing, vol. 15, no. 02, Article ID 024518, 2021.
View at: Publisher Site | Google Scholar
K. Taghipour, M. Heydari, Y. Kooch, H. Fathizad, B. Taghizadeh-Mehrjardi, and R. Taghizadeh-Mehrjardi, “Assessing changes in soil quality between protected and degraded forests using digital soil mapping for semiarid oak forests, Iran,” Catena, vol. 213, Article ID 106204, 2022.
View at: Publisher Site | Google Scholar
D. Dominguez, L. d. J. del Villar, O. González-Rodríguez, and M. González-Rodríguez, “Forecasting amazon rain-forest deforestation using a hybrid machine learning model,” Sustainability, vol. 14, no. 2, p. 691, 2022.
View at: Publisher Site | Google Scholar
R. V. Maretto, L. M. G. Fonseca, N. Jacobs, T. S. Korting, H. N. Parente, and L. L. Parente, “Spatio-temporal deep learning approach to map deforestation in amazon rainforest,” IEEE Geoscience and Remote Sensing Letters, vol. 18, no. 5, pp. 771–775, 2021.
View at: Publisher Site | Google Scholar
S. Birim, I. Kazancoglu, S. K. Mangla, A. Kazancoglu, and Y. Kazancoglu, “The derived demand for advertising expenses and implications on sustainability: a comparative study using deep learning and traditional machine learning methods,” Annals of Operations Research, pp. 1–31, 2022.
View at: Publisher Site | Google Scholar
J. Brownlee, “Time series prediction with lstm recurrent neural networks in python with keras,” Machine Learning Mastery, vol. 2016, 2016.
View at: Google Scholar
F. A. Gers, J. Cummins, and F. Cummins, “Learning to forget: continual prediction with LSTM,” Neural Computation, vol. 12, no. 10, pp. 2451–2471, 2000.
View at: Publisher Site | Google Scholar
S. Schmidhuber and J. Schmidhuber, “Long short-term memory,” Neural Computation, vol. 9, no. 8, pp. 1735–1780, 1997.
View at: Publisher Site | Google Scholar
J. Schmidhuber, “Deep learning in neural networks: an overview,” Neural Networks, vol. 61, pp. 85–117, 2015.
View at: Publisher Site | Google Scholar
T. R. R. D. S. R. M. L. K. B. S. S. S. Gadekallu, D. S. Rajput, M. P. K. Reddy et al., “A novel PCA-whale optimization-based deep neural network model for classification of tomato plant diseases using GPU,” Journal of Real-Time Image Processing, vol. 18, no. 4, pp. 1383–1396, 2021.
View at: Publisher Site | Google Scholar
R. Zhang, H. Song, Q. Chen, Y. Wang, S. Wang, and Y. Li, “Comparison of ARIMA and LSTM for prediction of hemorrhagic fever at different time scales in China,” PLoS One, vol. 17, no. 1, Article ID e0262009, 2022.
View at: Publisher Site | Google Scholar
A. K. a. H. D. A. A. S. Muhammad, “Forecasting pet food item stock using ARIMA and LSTM,” in Proceedings of the 4th International Conference on Computer and Informatics Engineering (IC2IE), 2021.
View at: Google Scholar
L. Zhang, Y. Zou, W. Wang, Z. Jin, Y. Su, and H. Chen, “Resource allocation and trust computing for blockchain-enabled edge computing system,” Computers & Security, vol. 105, Article ID 102249, 2021.
View at: Publisher Site | Google Scholar
T. Wang, Q. Yang, X. Shen, T. R. Gadekallu, W. Wang, and K. Dev, “A privacy-enhanced retrieval technology for the cloud-assisted internet of things,” IEEE Transactions on Industrial Informatics, vol. 18, no. 7, pp. 4981–4989, 2022.
View at: Publisher Site | Google Scholar
Z. Han, Y. Yang, W. Wang, L. Zhou, T. N. Nguyen, and C. Su, “Age efficient optimization in UAV-aided VEC network: a game theory viewpoint,” IEEE Transactions on Intelligent Transportation Systems, vol. 2022, pp. 1–10, 2022.
View at: Publisher Site | Google Scholar
G. f. watch, Global Forest Watch Pro, Forest Watcher and forest Atlases, 2014, [Online].
F. Succetti, A. Rosato, R. Araneo, and M. Panella, “Deep neural networks for multivariate prediction of photovoltaic power time series,” IEEE Access, vol. 8, pp. 211490–211505, 2020.
View at: Publisher Site | Google Scholar
S. Siami-Namini, N. Tavakoli, and A. S. Namin, “A comparison of arima and lstm in forecasting time series,” in Proceedings of the 17th IEEE international conference on machine learning and applications (ICMLA), pp. 1394–1401, 2018.
View at: Google Scholar
Y.-w. Wang, Z.-z. Jiang, and Y. Jiang, “Comparison of autoregressive integrated moving average model and generalised regression neural network model for prediction of haemorrhagic fever with renal syndrome in China: a time-series study,” BMJ Open, vol. 9, no. 6, Article ID e025773, 2019.
View at: Publisher Site | Google Scholar
T. Petukhova, D. Ojkic, B. McEwen, R. Poljak, and Z. Poljak, “Assessment of autoregressive integrated moving average (arima), generalized linear autoregressive moving average (glarma), and random forest (rf) time series regression models for predicting influenza a virus frequency in swine in ontario, Canada,” PLoS One, vol. 13, no. 6, Article ID e0198313, 2018.
View at: Publisher Site | Google Scholar
X. Wang and M. Meng, “A hybrid neural network and arima model for energy consumption forecasting,” Journal of Computers, vol. 7, no. 5, pp. 1184–1190, 2012.
View at: Publisher Site | Google Scholar
J. Gu, L. Liang, H. Song et al., “A method for hand-foot-mouth disease prediction using geodetector and lstm model in guangxi, China,” Scientific Reports, vol. 9, no. 1, pp. 17928–18010, 2019.
View at: Publisher Site | Google Scholar
P. K. A. Y. K. R. Kumar, “Three stage fusion for effective time series forecasting using Bi-LSTM-ARIMA and improved DE-ABC algorithm,” Neural Computing & Applications, vol. 2022, 2022.
View at: Publisher Site | Google Scholar

Copyright

Copyright © 2023 Abdul Haleem Butt et al. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

PDF Download Citation

Download other formats

Order printed copies