Intelligent Identification of Coal Crack in CT Images Based on Deep Learning

Yu, Jinxia; Wu, Chengyi; Li, Yingying; Zhang, Yimin

doi:https://doi.org/10.1155/2022/7092436

Computational Intelligence and Neuroscience

On this page

Abstract Introduction Related Work Conclusions Data Availability Conflicts of Interest Acknowledgments References Copyright Related Articles

Special Issue

Lightweight Computational Intelligence for Sequential Data Analysis in Edge Computing

View this Special Issue

Research Article | Open Access

Volume 2022 | Article ID 7092436 | https://doi.org/10.1155/2022/7092436

Intelligent Identification of Coal Crack in CT Images Based on Deep Learning

Jinxia Yu,¹Chengyi Wu,^1,2Yingying Li ,^2,3and Yimin Zhang³

Academic Editor: Le Sun

Received30 Jun 2022

Revised30 Jul 2022

Accepted22 Aug 2022

Published23 Sept 2022

Abstract

Automatic segmentation of coal crack in CT images is of great significance for the establishment of digital cores. In addition, segmentation in this field remains challenging due to some properties of coal crack CT images: high noise, small targets, unbalanced positive and negative samples, and complex, diverse backgrounds. In this paper, a segmentation method of coal crack CT images is proposed and a dataset of coal crack CT images is established. Based on the semantic segmentation model DeepLabV3+ of deep learning, the OS of the backbone has been modified to 8, and the ASPP module rate has also been modified. A new loss function is defined by combining CE loss and Dice loss. This deep learning method avoids the problem of manually setting thresholds in traditional threshold segmentation and can automatically and intelligently extract cracks. Besides, the proposed model has 0.1%, 1.2%, 2.9%, and 0.5% increase in Acc, mAcc, MioU, and FWIoU compared with other techniques and has 0.1%, 0.8%, 2%, and 0.4% increase compared with the original DeepLabV3+ on the dataset of coal CT images. The obtained results denote that the proposed segmentation method outperforms existing crack detection techniques and have practical application value in safety engineering.

1. Introduction

Coal is an important energy source for human society. The phenomenon of deformation and damage of coal and rock mass under load is common, which has a huge impact on the safety of mining engineering. The research on digital core technology based on industrial CT scanning technology is of great significance for the mining safety, and its basis is the high-precision segmentation of cracks in industrial CT scanning images. As the key technology of digital core, 3D reconstruction needs high-precision segmentation results to reflect the original topology of cracks. However, artificial segmentation of coal crack CT images undoubtedly takes a lot of time and energy. And, most of the existing auxiliary software is based on traditional threshold segmentation methods which are still impossible to get rid of the interference of noise even working with some image preprocessing methods. Therefore, intelligent and automated segmentation of coal crack CT images is particularly important.

Digital images contain a lot of important information, which can be extracted in different ways in different fields. For example, it can be used in the field of encryption technology [1, 2], information security [3–6], in the field of industrial engineering [7], in the field of agriculture [8], and so on. Digital image processing technology includes many categories [9, 10], and image segmentation is one of them. Noise is one of the difficulties in the segmentation of coal crack images. In order to reduce the noise and enhance images, methods that were applied in the segmentation of crack include morphological filter [11], wavelet transforms [12, 13], anisotropic diffusion filter [14], and so on. However, many noises cannot be fundamentally removed by the traditional methods. Machine learning algorithms can achieve automatic crack detection and segmentation to a certain extent including structured forests [15, 16], minimal path selection [17], support vector machine [18], etc. Nevertheless, most features in machine learning need to be identified by experts and hand-coded. Deep learning models have powerful learning ability which can automatically complete the tasks of classification, detection, and segmentation after training. Starting from FCN [19], many high-performance semantic segmentation models have emerged such as U-net [20], SegNet [21], and PSPNet [22]. These models are based on convolution operations, apply feature extraction networks as backbones, and incorporate multiscale semantic information to achieve pixel-by-pixel segmentation of images. Deep learning methods have been applied in different crack segmentation fields [23–25] nowadays.

In this work, we present an end-to-end coal crack CT image segmentation method based on the deep learning model DeepLabV3+ [26]. Compared with existing deep learning methods, postprocessing is unnecessary for our method. Besides, our method achieves better performance on some evaluation metrics. These advantages are meaningful for the subsequent 3D reconstruction work and the establishment of digital cores.

All data in this paper comes from the tomograms of high-precision industrial CT during the fracturing experiment of coal samples. The CT scanning equipment is from the Nation Key of Natural Gas Geology and Natural Gas Control of the Henan University of Technology Laboratory which is a phoenix v|tome|xm high-resolution industrial X-ray μCT scanner [27]. The equipment diagram and CT imaging principle are shown in Figure 1. Images collected by this equipment have many noise points in the coal matrix, and different samples may have different colors. CT image samples of coal cracks are shown in Figure 2. The datasets used for training in this experiment are cut from CT images obtained by the aforementioned platform at different sizes. The diversity of the crack structure is fully considered in the interception process to adapt to the segmentation of different images.

(a)

(b)

The high performance of deep learning in computer vision was first demonstrated in classification tasks. Many CNN models can provide good classification accuracy such as Vgg [28], ResNet [29], Xception [30], and so on. Some of them are applied as feature extractors in segmentation models. FCN replaces the fully connected layer in the classification model with deconvolution to upsample the pooled feature map to its original size, pioneered semantic segmentation. The application of deep learning in crack detection can be roughly divided into three types, methods based on classification [31], object detection [32], and semantic segmentation [33, 34]. Xue et al. [35] modified the last few deconvolution modules of FCN to adapt to the needs of crack segmentation. However, this FCN-based method may not be able to guarantee the accuracy of segmentation and maintain the original topological structure of the crack when facing the crack of complex structures.

DeepLabV3+ is a high-performance semantic segmentation model derived from DeepLabV1, V2, and V3 [36–38]. In view of the adverse effect of excessive downsampling on segmentation accuracy, this model proposed to use atrous convolution to reduce downsampling and enlarge the receptive field simultaneously. This model also applied atrous spatial pyramid pooling to capture and fuse multi-scale semantic information which is beneficial to improving the accuracy of segmentation. Besides, Encoder-Decoder architecture is used to recover pixels of features better. DeepLabV3+ achieves new state-of-the-art performance on PASCAL VOC 2012 dataset. However, compared with the public semantic segmentation dataset, the crack image dataset has the characteristics of smaller targets and unbalanced positive and negative samples. So we have made some improvements to the original model for these characteristics. The coal crack CT image segmentation method that we proposed has the following contributions:(1)Given that there are no publicly available datasets for research in this field, we established a dataset of coal crack CT images for our research. All original pictures come from a professional coal sample fracturing experimental platform and all labels are made by hand marking.(2)We modified DeepLabV3+ to adapt to the need for coal crack CT images by adjusting the OS of the backbone, adjusting the encoder-decoder module, and changing the rates of the ASPP module. The modified model achieves better performance than the original model under some authoritative evaluation indicators commonly used in semantic segmentation: PA, MPA, MIoU, and FWIoU.(3)A new loss function is defined by combining the CE loss and Dice loss. While adding contour factors to the prediction, the curve fluctuation of the Dice function in the training is alleviated.

3. Methodologies

3.1. Dataset

Since there is no open-source dataset for CT segmentation images of coal crack, we established a coal crack dataset manually. All these images were taken from the original coal fracturing experimental images in different sizes and different length-width ratios. All data were captured in images acquired by high precision industrial CT introduced before. It consists of 437 RGB images and their segmentation labels, including different crack shapes, complexities, and different background colors. Some representative images and their annotations are shown in Figure 3. These samples can reflect the complexity of crack morphology, noise situation, and background differences in the dataset to a certain extent.

Data augmentation is a technique widely used in deep learning. In supervised learning, fine data annotation is a time-consuming and energy consuming work. Data augmentation can expand the dataset so that the parameters learned during model training are more reliable and can effectively avoid overfitting. So we enhanced the coal crack dataset to 5000 in different ways: rotation, flip and zoom. Angles of rotation were limited to −30 to 30 degrees, the flip direction is horizontal and the ratios of zoom were limited to 80% to 120%. Finally, after data augmentation, the training set contains 3500 images and the test set contains 1500 images.

3.2. Atrous Spatial Pyramid Pooling

Atrous convolution can be used to capture multiscale contextual information. The parameter can set different dilation rates of atrous convolution which can be regarded as the stride of the input signal we sample. The output of atrous convolution of a one-dimensional input signal with a filter of length is defined as follows:

Combined with spatial pyramid pooling, ASPP is applied as a Multiscale information fusion module. The structure which is applied in the DeepLabV3+ model achieved multi-scale information collection using four different rates of atrous convolutions (including image-level pooling). Different from the rates of (1, 6, 12, and 18) used in the original DeepLabV3+, more kinds of combinations of rates were tried using to make the feature extractor more suitable for crack segmentation. As the OS (Output Stride) of the backbone was adjusted to 8 to reduce downsampling, a larger receptive field is required. We tried to make the enlargement of the receptive field follow the size of the feature map output from the backbone. And, the experiment proved that rates of (1, 12, 24, 36) can achieve a better performance. A more intuitive situation about ASPP can be seen in Figure 4.

3.3. Encoder-Decoder

The encoder-decoder structure is widely applied in the field of computer vision. As for the semantic segmentation field, the encoder gains semantic information from images with feature maps reducing as a feature extraction module. DeepLabV3+ model uses DeepLabV3 as the encoder block with some effective improvement. The decoder is applied to reconstruct the segmentation result by restoring the pixel and size of the feature map, at the same time, keeping the details of the original image as much as possible. DeepLabV3+ proposed a simple decoder as shown in Figure 4 and obtained a good effect practically. The first upsampling rate was adjusted to 2 as the OS of the backbone was changed to 8.

3.4. Adjusted Xception as Backbone

Xception, as a high-performance convolution neural network is applied as the feature extractor of ordinary DeepLabV3+. This deep structure is developed based on the Inception model and based entirely on depthwise separable convolution. Unlike conventional convolution, in depthwise separable convolution, each feature map channel only needs to perform an operation with each channel of the convolution kernel. This kind of convolution can effectively reduce the number of parameters and computing costs, and by using this, Xception expanded the scale of the model and became state-of-the-art CNN architecture in classification tasks. The ordinary Xception has an OS = 32 so that it can adapt to the needs of classification tasks. But excessive pooling makes the feature maps too small so that the detailed information can be damaged. In order to get dense feature maps, the OS of 16 or 8 is desirable.

Different from the OS of 16 which performed better in natural scene datasets, crack images need a denser way to extract features because the targets of these images are tiny in most cases. For these small targets, downsampling has a particularly serious loss of accuracy. So the OS of 8 was used in this model, at the same time, the ASPP was adjusted to get a larger receptive field and the decoder also made corresponding adjustments. To achieve this goal, compared with the Xception structure in the DeepLabV3+ original text, we adjusted the stride of the third block of entry flow to 1, and correspondingly doubled the rate of the atrous convolution in the middle flow and the exit flow. The adjusted Xception structure is shown in Figure 5.

3.5. Loss Function

DeepLab series model apply the cross-entropy (CE) loss function which is widely applied in classification tasks to classify every single pixel. This loss function checks each pixel separately and compares the class prediction (the pixel vector in the depth direction) with the hot encoding target vector. The cross-entropy function can be formulated as follows:where refers to the number of categories, refers to the sign function (0 or 1), and refers to the predicted probability that the observed sample belongs to category . Thus, we can consider that the pixels in the image are learned equally with the cross-entropy loss function, and this kind of equality does not apply to the situation where the sample is extremely uneven. In coal crack CT images, the number of pixels corresponding to the crack is much smaller than that of the background. Taking the dataset we established as an example, the proportion of crack pixels in the whole image is less than 5%. Dice Loss [39] was proposed in 2016, designed to deal with scenarios where positive and negative samples are strongly imbalanced in semantic segmentation. Different from distribution-based cross-entropy loss, the Dice function is based on region and is used to calculate the similarity between two images. The Dice coefficient and Dice loss function can be formulated as follows:where and refer to two different samples, they are ground truth and predict mask in segmentation tasks. In a different way, the Dice coefficient and the loss function can be formulated as follows: Where , refer to true positive, false positive, and false negative. However, although Dice loss can calculate the similarity of two contours, it may cause the gradient to change drastically, and the training is difficult so it is not credible to a certain extent sometimes. In this experiment, we did a weighted additive combination of CE loss and Dice loss to add contour features to the classification of pixels and avoid the shock of loss in training. The new loss function is formulated as follows:where is a weight coefficient for adjusting the proportion of CE function. It is a constant in the range [0, 1], and the value of this article is 0.5. The experiment proved that this new loss function effectively improves the accuracy of crack segmentation compared to using the cross-entropy loss function alone.

4. Experiments

All experiments were done in the following environment: Intel (R) Xeon(R) Bronze 3204 CPU @ 1.90 GHz, 32 GB RAM, GPU Tesla V100, CentOS Linux release 7.6.1810. And experiments related to deep learning are completed under PyTorch 1.10.0. We compare the proposed method with existing representative algorithms to the performance of the model on the dataset we established and also compare the visual effects of these segmentation results.

4.1. Metrics

In order to evaluate our work, in addition to the visual effects of segmentation images, we introduced four authoritative evaluation indicators commonly used in semantic segmentation. All experiments are performed on the dataset we established.

Suppose is the number of categories (background is excluded), indicates that the total number of pixels that are mispredicted. means, means and means . Four evaluations are(1)PA, which means the rate of the number of predicted right pixels to total pixels. It can be expressed as follows:(2)MPA, which means the average pixel accuracy of each category. It can be expressed as follows:(3)MIoU, which represents the IoU of each category. It can be expressed as follows:(4)FWIoU, which sets the weight for IoU of each class according to the frequency of its appearance. It can be expressed as follows:

4.2. Ablation Experiments

To scrutinize the effectiveness of the methods we proposed, we conduct experiments with two different backbones which are used in the original DeepLabV3+. Hyperparameters used by these methods are shown in Table 1. When using the ResNet101 as the backbone, the model is trained by three following strategies: (1) DeepLabV3+-res, which is an unmodified DeepLabV3+ model applying ResNet101 [40] as the backbone. (2) DeepLabV3+-res-8 changes OS to 8 and ASPP rates to (1, 12, 24, 36) on the basis of DeepLabV3+-res. (3) DeepLabV3+-x-8-NL changes loss function to the new loss on the basis of DeepLabV3+-x-8. And, the experiment results are shown in Table 2. When using the Xception as the backbone, the model is trained by three following strategies: (1) DeepLabV3+-x, which is an unmodified DeepLabV3+ model applying Xception as the backbone. (2) DeepLabV3+-x-8 changes OS to 8 and ASPP rates to (1,12,24,36) on the basis of DeepLabV3+-x. (3) DeepLabV3+-x-8-NL changes loss function to the new loss on the basis of DeepLabV3+-x-8. Results are shown in Table 3. In order to more vividly reflect the advantages of our method, histograms were drawn in Figures 6 and 7.

It can be seen in Tables 2 and 3, adjusted the OS to 8 and using new loss improve all evaluation metrics. When the backbone is RseNet101, by our methods, the Acc, mAcc, MIoU, and FWIoU improved by 0.1%, 1.1%, 2.3%, and 0.4%. When the backbone is Xception, by our methods, the Acc, mAcc, MIoU, and FWIoU improved by 0.1%, 0.8%, 2.0%, and 0.4%. Experimental results confirm the effectiveness of the proposed method.

4.3. Comparing with Exiting Methods

We compare the proposed method in this paper with other three typical methods: (1) FCN, the most classic semantic segmentation network. (2) U-net, the most widely used segmentation network in the medical field. (3) PSPNet, a very efficient model which applies a pyramid pooling module to fusion features on different levels. The feature extractors of all networks apply transfer learning techniques and are fine-tuned on our augmented dataset. Besides, all these models have trained 100 epochs with regular hyperparameters, and the convergence of these models was guaranteed. We also implement two different threshold segmentation methods on our test set to compare the segmentation effects between traditional methods and deep learning methods: (1) Otsu [41] (2) Max Entropy [42]. The two methods represent different automatic threshold determination methods.

A comparison of evaluation metrics of all these methods is shown in Table 4. As we can see, since the proportion of cracks in the images is very low, and the judgment error rate of image background pixels is low so that the total pixel accuracy of every method is not very different. However, the performance of different methods can still be judged from the remaining evaluation indicators. PSPNet and FCN may have good performance in semantic segmentation under natural conditions, but they do not perform well on the coal crack CT image dataset. U-net is designed to deal with medical images which have similarities with the images we used, so this model can have a nice performance. As the best performing comparison method, U-net achieved an Acc of 98.5%, mAcc of 94.2, MIoU of 86.5%, and a FWIoU of 97.2% which are 0.1%, 1.2%, 2.9%, and 0.5% lower than proposed method. A histogram comparison of the experimental results is shown in Figure 8.

Figure 9 shows a visual effect comparison of segmentation results of all methods. It can be seen that the method we proposed has a certain improvement in the dataset in this paper. Compared with the original DeepLabV3+, the ability to capture details has been improved and cracks whose pixel values are close to the background can be identified. Many locations that should be connected become disconnected during the segmentation process of other models, this problem is also alleviated by our method. Other deep learning methods even have a large number of separation cracks sticking together, which is caused by the insufficient segmentation performance. In addition, the shape and structure of cracks is not guaranteed. Experiments show that less downsampling and the addition of the Dice loss function allow the details to be effectively preserved and recovered. Comparing deep learning methods and traditional threshold segmentation methods, the noise problem is difficult to solve for threshold even though different thresholding methods are used. Although some deep learning methods are rough for object segmentation, they often do not misidentify noise.

5. Conclusions

In this paper, we propose a deep learning method to complete the CT image segmentation task of coal crack CT images. Since the target in the crack image is small, and downsampling can lose the accuracy to some extent, the OS of the backbone was proposed to be reasonably adjusted to reduce the loss of accuracy and adjusted the structure of ASPP to adapt to this adjustment. In order to solve the problem of uneven sample distribution, CE loss and Dice loss were combined to define a new loss function. The experimental results show that our method is effective and has practical application value.

Nevertheless, the presented method can be improved in the following directions. First, scale-up datasets to accommodate more complex environments. And, more data will be added to this dataset which has more complex topologies and tiny targets. Moreover, we will define a new loss function according to the specific target proportion in the data set combined with probability mathematics which may be more adaptable to the needs of the field than the loss function in this paper.

Data Availability

The experimental data used to support the findings of this study are available from the corresponding author upon request.

Conflicts of Interest

The authors declare that there are no conflicts of interest regarding the publication of this paper.

Acknowledgments

The authors would like to thank the grants supported by the Support Plan of Scientific and Technological Innovation Team in Universities of Henan Province (Grant no. 20IRTSTHN013), the State Key Laboratory Cultivation Base for Gas Geology and Gas Control (Henan Polytechnic University) (Grant no. WS2020B13).

References

Z. Qu, S. Chen, and X. Wang, “A secure controlled quantum image steganography algorithm,” Quantum Information Processing, vol. 19, no. 10, pp. 380–425, 2020.
View at: Publisher Site | Google Scholar
Z. Qu, H. Sun, and M. Zheng, “An efficient quantum image steganography protocol based on improved EMD algorithm,” Quantum Information Processing, vol. 20, no. 2, pp. 53–29, 2021.
View at: Publisher Site | Google Scholar
Y. Tang, H. Lian, L. Li, X. Wang, and X. Yan, “A randomness detection method of ZigBee protocol in a wireless sensor network,” Sensors, vol. 18, no. 11, p. 3962, 2018.
View at: Publisher Site | Google Scholar
Y. Tang, J. Li, X. Yan, and Q. Zhao, “Edge-cloud-assisted multiuser forward secure searchable encryption (EMFSSE) scheme in the P2P networking environment,” Mobile Information Systems, vol. 2021, pp. 2021–2114, 2021.
View at: Publisher Site | Google Scholar
Z. Qu, Y. Huang, and M. Zheng, “A novel coherence-based quantum steganalysis protocol,” Quantum Information Processing, vol. 19, no. 10, pp. 362–419, 2020.
View at: Publisher Site | Google Scholar
Z. Qu, X. Liu, and S. Wu, “Quantum identity authentication protocol based on three-photon quantum error avoidance code,” in Proceedings of the 2019 International Conference on Internet of Things (iThings) and IEEE Green Computing and Communications (GreenCom) and IEEE Cyber, Physical and Social Computing (CPSCom) and IEEE Smart Data (SmartData), pp. 471–475, IEEE, Atlanta, GA, USA, July 2019.
View at: Google Scholar
Z. Luo, Y. Zhang, K. Wang, and L. Sun, “Detection of pine cones in natural environment using improved YOLOv4 deep learning algorithm,” Computational Intelligence and Neuroscience, vol. 2021, pp. 2021–12, 2021.
View at: Publisher Site | Google Scholar
F. Dong and Y. Ma, “Railway traffic emergency management relying on image recognition technology in the context of big data,” Computational Intelligence and Neuroscience, vol. 2022, pp. 1–12, 2022.
View at: Publisher Site | Google Scholar
Y. Tang, M. Zhao, and L. Li, “Secure and efficient image compression-encryption scheme using new chaotic structure and compressive sensing,” Security and Communication Networks, vol. 2020, pp. 1–15, 2020.
View at: Publisher Site | Google Scholar
D. Khan, M. A. Muneer, Z. U. Nisa et al., “Effect of climatic factors on stem biomass and carbon stock of Larix gmelinii and Betula platyphylla in Daxing’anling Mountain of Inner Mongolia, China,” Advances in Meteorology, vol. 1, no. 10, 2019.
View at: Publisher Site | Google Scholar
H. Oliveira and P. L. Correia, “Automatic road crack segmentation using entropy and image dynamic thresholding,” in Proceedings of the 2009 17th European Signal Processing Conference, Glasgow, UK, 24-28 August 2009.
View at: Google Scholar
S. Wu and Y. Liu, A Segment Algorithm for Crack Dection, Electrical & Electronics Engineering, 2012.
J. Zhou, “Wavelet-based pavement distress detection and evaluation,” Optical Engineering, vol. 45, no. 2, pp. 027007–027411, 2006.
View at: Publisher Site | Google Scholar
D. Andrushia, N. Anand, and P. Arulraj, “Anisotropic diffusion based denoising on concrete images and surface crack segmentation,” International Journal of Structural Integrity, vol. 11, no. 3, pp. 395–409, 2019.
View at: Publisher Site | Google Scholar
Y. Shi, L. Cui, Z. Qi, F. Meng, and Z. Chen, “Automatic road crack detection using random structured forests,” IEEE Transactions on Intelligent Transportation Systems, vol. 17, no. 12, pp. 3434–3445, 2016.
View at: Publisher Site | Google Scholar
B. Nair, “Fast edge detection using structured forests,” International Journal of Emerging Trends & Technology in Computer Science, vol. 34, 2016.
View at: Google Scholar
R. Amhaz, S. Chambon, J. Idier, and V. Baltazart, “Automatic crack detection on two-dimensional pavement images: an algorithm based on minimal path selection,” IEEE Transactions on Intelligent Transportation Systems, vol. 17, no. 10, pp. 2718–2729, 2016.
View at: Publisher Site | Google Scholar
G. Li, X. Zhao, K. Du, F. Ru, and Y. Zhang, “Recognition and evaluation of bridge cracks with modified active contour model and greedy search-based support vector machine,” Automation in Construction, JUN, vol. 78, pp. 51–61, 2017.
View at: Google Scholar
E. Shelhamer, J. Long, and T. Darrell, “Fully convolutional networks for semantic segmentation,” IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 39, no. 4, pp. 640–651, 2017.
View at: Publisher Site | Google Scholar
O. Ronneberger, P. Fischer, and T. Brox, U-net: Convolutional Networks for Biomedical Image Segmentation, Springer International Publishing, 2015.
V. Badrinarayanan, A. Kendall, and R. Cipolla, SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation, Entry, 2017.
H. Zhao, J. Shi, X. Qi, X. Wang, and J. Jia, Pyramid Scene Parsing Network, IEEE Computer Society, 2016.
Y. Liu, J. Yao, X. Lu, R. Xie, and L. Li, “DeepCrack: a deep hierarchical feature learning architecture for crack segmentation,” Neurocomputing, vol. 338, no. 21, pp. 139–153, 2019.
View at: Publisher Site | Google Scholar
S. Wang, X. Wu, Y. Zhang, X. Liu, and L. Zhao, “A neural network ensemble method for effective crack segmentation using fully convolutional networks and multi-scale structured forests,” Machine Vision and Applications, vol. 31, no. 7-8, p. 60, 2020.
View at: Publisher Site | Google Scholar
E. Protopapadakis, A. Voulodimos, A. Doulamis, N. Doulamis, and T. Stathaki, “Automatic crack detection for tunnel inspection using deep learning and heuristic image post-processing,” Applied Intelligence, vol. 343, 2019.
View at: Google Scholar
L. C. Chen, Y. Zhu, G. Papandreou, F. Schroff, and H. Adam, Encoder-Decoder with Atrous Separable Convolution for Semantic Image Segmentation, Springer, Cham, 2018.
Y. Li, H. Cui, P. Zhang, D. Wang, and J. Wei, “Three-dimensional visualization and quantitative characterization of coal fracture dynamic evolution under uniaxial and triaxial compression based on μCT scanning,” Fuel, vol. 262, p. 116568, 2020.
View at: Publisher Site | Google Scholar
K. Simonyan and A. Zisserman, “Very deep convolutional networks for large-scale image recognition,” Computer Science, vol. 22, p. 12866, 2014.
View at: Google Scholar
K. He, X. Zhang, S. Ren, and J. Sun, Deep Residual Learning for Image Recognition, IEEE, 2016.
F. Chollet, “Xception: deep learning with depthwise separable convolutions,” in Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Honolulu, HI, USA, 21-26 July 2017.
View at: Publisher Site | Google Scholar
Z. Lei, F. Yang, D. Zhang, and J. Z. Ying, “Road crack detection using deep convolutional neural network,” IEEE International Conference on Image Processing, 2016.
View at: Google Scholar
G. Suh and Y. J. Cha, “Deep faster R-CNN-based automated detection and localization of multiple types of damage,” Sensors and Smart Structures Technologies for Civil, Mechanical, and Aerospace Systems, 2018.
View at: Google Scholar
Y. Liu, J. Yao, X. Lu, R. Xie, and L. Li, “DeepCrack: a deep hierarchical feature learning architecture for crack segmentation,” Neurocomputing, vol. 338, no. 21, pp. 139–153, 2019.
View at: Publisher Site | Google Scholar
E. Protopapadakis, A. Voulodimos, A. Doulamis, N. Doulamis, and T. Stathaki, “Automatic crack detection for tunnel inspection using deep learning and heuristic image post-processing,” Applied Intelligence, vol. 76, pp. 28–86, 2019.
View at: Google Scholar
X. Xue, “FCN-based intelligent identification and fractal reconstruction of pore-fracture network in coal by micro CT scanning,” Chinese Journal of Rock Mechanics and Engineering, vol. 39, no. 6, p. 19, 2020.
View at: Google Scholar
L. C. Chen, G. Papandreou, I. Kokkinos, K. Murphy, and A. L. Yuille, “Semantic image segmentation with deep convolutional nets and fully connected CRFs,” Computer Science, vol. 4, pp. 357–361, 2014.
View at: Google Scholar
L. C. Chen, G. Papandreou, I. Kokkinos, K. Murphy, and A. L. Yuille, “DeepLab: semantic image segmentation with deep convolutional nets, atrous convolution, and fully connected CRFs,” IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 40, no. 4, pp. 834–848, 2018.
View at: Publisher Site | Google Scholar
L. C. Chen, G. Papandreou, F. Schroff, and H. Adam, Rethinking Atrous Convolution for Semantic Image Segmentation, IEEE, NEW JERSEY, 2017.
F. Milletari, N. Navab, and S. A. Ahmadi, “V-net: fully convolutional neural networks for volumetric medical image segmentation,” 2016 Fourth International Conference on 3D Vision (3DV), vol. 76, p. 65654, 2016.
View at: Google Scholar
K. He, X. Zhang, S. Ren, and J. Sun, Deep Residual Learning for Image Recognition, IEEE, NEW JERSEY, 2016.
N. Otsu, “A threshold selection method from gray-level histograms,” IEEE Transactions on Systems, Man, and Cybernetics, vol. 9, no. 1, pp. 62–66, 1979.
View at: Publisher Site | Google Scholar
T. Pun, “A new method for grey-level picture thresholding using the entropy of the histogram,” Signal Processing, vol. 2, no. 3, pp. 223–237, 1980.
View at: Publisher Site | Google Scholar

Copyright

Copyright © 2022 Jinxia Yu et al. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

PDF Download Citation

Download other formats

Order printed copies

Computational Intelligence and Neuroscience

Lightweight Computational Intelligence for Sequential Data Analysis in Edge Computing

Intelligent Identification of Coal Crack in CT Images Based on Deep Learning

Abstract

1. Introduction

2. Related Work

3. Methodologies

3.1. Dataset

3.2. Atrous Spatial Pyramid Pooling

3.3. Encoder-Decoder

3.4. Adjusted Xception as Backbone

3.5. Loss Function

4. Experiments

4.1. Metrics

4.2. Ablation Experiments

4.3. Comparing with Exiting Methods

5. Conclusions

Data Availability

Conflicts of Interest

Acknowledgments

References

Copyright