Segmentation and Classification of White Blood Cells Using the UNet

Alharbi, Amal H.; Aravinda, C. V.; Lin, Meng; Venugopala, P. S.; Reddicherla, Phalgunendra; Shah, Mohd Asif

doi:https://doi.org/10.1155/2022/5913905

Contrast Media & Molecular Imaging

On this page

Abstract Introduction Literature Review Methods Conclusion Data Availability Conflicts of Interest Acknowledgments References Copyright Related Articles

Special Issue

Automated Interpretable and Lightweight Deep Learning Models for Molecular Images

View this Special Issue

Research Article | Open Access

Volume 2022 | Article ID 5913905 | https://doi.org/10.1155/2022/5913905

Segmentation and Classification of White Blood Cells Using the UNet

Amal H. Alharbi,¹C. V. Aravinda,²Meng Lin,³P. S. Venugopala,²Phalgunendra Reddicherla,⁴and Mohd Asif Shah⁵

Academic Editor: Avleen Malhi

Received07 Apr 2022

Revised04 May 2022

Accepted24 Jun 2022

Published11 Jul 2022

Abstract

In the bone marrow, plasma cells are made up of B lymphocytes and are a type of WBC. These plasma cells produce antibodies that help to keep bacteria and viruses at bay, thus preventing inflammation. This presents a major challenge for segmenting blood cells, since numerous image processing methods are used before segmentation to enhance image quality. White blood cells can be analyzed by a pathologist with the aid of computer software to identify blood diseases accurately and early. This study proposes a novel model that uses the ResNet and UNet networks to extract features and then segments leukocytes from blood samples. Based on the experimental results, this model appears to perform well, which suggests it is an appropriate tool for the analysis of hematology data. By evaluating the model using three datasets consisting of three different types of WBC, a cross-validation technique was applied to assess it based on the publicly available dataset. The overall segmentation accuracy of the proposed model was around 96%, which proved that the model was better than previous approaches, such as DeepLabV3+ and ResNet-50.

1. Introduction

The disorders in the blood cells are diagnosed in the laboratory using blood microscopy. The accuracy of the results depends on the pathologist/hematologist. Leukocytes, RBCS, and platelets are the three main components of blood. Doctors will be able to identify blood diseases such as blood cancer, anemia, and malaria using micrographs recorded using computer-aided techniques. An important feature of white blood cells is the type and count of leukocytes, which determine the type and severity of infection. WBCs that are having small granules are further categorized as neutrophils, basophils, and eosinophils. There are three types of leukocytes that make up the immune system. The traditional method of counting white blood cells is laborious and requires medical expertise. Cell segmentation and feature extraction are required for the analysis of WBC. In the process of white blood cell segmentation, leukocytes are extracted from blood smear images to find the necessary features for further processing to distinguish them from others. There are two types of segmentation techniques: boundary-based and region-based. In image processing, segmentation helps to identify the objects in an image. Region-based techniques segment images use properties like those of boundaries; boundary-based techniques and segmented images are based on changes in intensity at the boundary. To distinguish WBC from the background, we applied the region-based segmentation. Due to the irregularity of the shape of leukocytes, it is a challenging task for the segmentation process. For example, Figures 1 and 2 show various types of white blood cells along with their ground truth images. Literature indicates that few automated systems are presently available to analyze blood smear images for leukocytes. Research is in progress to design a system that automatically segments leukocytes with maximum accuracy in a short amount of time. In some studies, the authors use algorithms to segment merely WBC, while in others, they also publish methods for segmenting the nucleus and the cytoplasm [2, 3].

(a)

(b)

(c)

The two parts of WBC are the unsupervised learning model and supervised learning model. Thresholding [4, 5], clustering-based technique [6], fringe-situated [7], zone-positioned [8], and fuzzy models [9, 10] are used in unsupervised learning. Based on the supervised algorithm, every pixel was categorized into two in the ROI and nonregion of interest. To enhance detection, DLN was used in segmenting the images. DLN was found to be a useful concept in medical image analysis due to its performance. Deep learning-based models require a large amount of memory and processing resources for training and testing data. The system is trained repeatedly faster on GPUs than on CPUs [11]. Deep learning-based segmentation consists of instance segmentation and semantic segmentation. During instance-based segmentation, the mask is used to detach different ROIS and utilizes masks to classify the images. To characterize WBCs as shown in Figure 3, based on the mononuclear cells, it can be divided into either mononuclear or polynuclear cells [12]. The thresholding method has been used in several prior studies to analyze blood cell frequency to detect cancer. However, finding the optimal threshold value is laborious and complex; nevertheless, previous studies have used this methodology for cancer detection. CNN models can extract thousands of features from images automatically; therefore, we used it, and so, they can extract thousands of features from images automatically. It takes a long time to manually extract many features, whereas CNN models can extract thousands of features quickly and accurately. The segmentation of leukocytes was accomplished using the more accurate model developed to determine the boundaries of internal cells at the cellular level. This implementation can transfer information from encoder to decoder with no loss of detail while avoiding losing the microdetails. The focus of this research is to build a system that can accomplish the task of segmenting and classifying WBC, enabling physicians to easily determine the exact position of a myeloma cancer cell.

2. Literature Review

This section covers the research articles of various researchers who worked on the method of detecting cancer cells in WBCs by using an image processing algorithm to detect myeloma, which is developed as discussed in [13]. A research article by Joshi et al. covered the portion to classify and segment WBC for the diagnosis of leukemia. They applied the Otsu algorithm to enhance and segment WBC. The K-nearest neighbors (KNN) classifier has been used to identify myeloma from normal B cells [14]. Liu et al. worked towards classifying and recognizing using the ML algorithm to extract data. Artificial intelligence utilizes image recognition technology. Intelligence is based on the concept of analyzing data using digital images that can be processed using a computer and then extracting data from them [15]. Martinez et al. illustrated low-dose CT images for detecting myeloma in the bone marrow [16]. Xu et al. worked on image classification as well as the type and position of objects [17], VGG [18], inception [19], fuzzy logic [20], Faster R-CNN [21], SSD [19], and YOLO [11], which are all methods for segmentation and object detection. They concluded that for image classification, they had been using a statistical deep learning framework that emphasized object categorization inside the image [22, 23]. DL segmentation is broadly classified into instance and semantic segmentation, identical masks are used for ROI, and semantic is used for a single image. This approach recognizes individual cells instead of classifying four cells as one instance. The proposed research work will be carried out on semantic segmentation using the CNN to segment WBC cells.

From the above literature, it is found that the existing models suffer from hyperparameters tuning [1, 24–26], gradient vanishing [27, 28], and overfitting [29–31] problems. Therefore, an efficient model is required that can overcome these issues.

3. Resources and Methods

ISBI C-NMC 2019 data were used in this study. Approximately sixty cancer patients’ datasets were collected and forty-one individuals were added to this dataset. In total, around ten thousand six hundred cells are with a train, validation, and test split of 80%, 10%, and 10%, respectively [12]. To maintain integrity, the first steps of preprocessing are standardization and normalization. These images were first calculated to find the global mean and standard deviation (STD). The following equation is normalized equation , where indicates the global mean Yi of the image equation, the STD, and . The training dataset underwent data augmentation.

WBC segmentation is carried out using the MASK-RCNN network for blood images through the semantic segmentation process; meanwhile, these WBCs relate to positive classes and their pixel relates to negative, which is shown in Figure 4; for the input images, the initial process to train the model, ResNet [1, 24] was scaled to 224 ∗ 224 to maintain stability and to scale the output of the network. Later, for the ground truth, sample labels were applied to each pixel by making them to the chosen label/class. A variety of convolutional algorithms [25, 26] followed by postprocessing techniques were applied to enhance future extraction for classification [27, 28] during the decoder phase.

3.1. Process of Semantic Segmentation

This is a type of pixel-level image classification; during the segmentation process, every pixel present in the image will be categorized into one predetermined group. In the blood sample, there will be several leukocytes; while doing segmentation, each WBC pixel will be labeled as an object, such that it will help in the recognition and classification of leukocyte types present in the sample. However, during feature extraction through the deep layer network, due to large layers, this will lead to gradient descent. To overcome these issues, the ResNet model was used, the advantage of using this model was to prevent the colloidal occurring in the layers and combine their outputs. To calculate the loss that was the expected value and the gained value, the backpropagation algorithm technique was implicated. The complexity of this technique will consume a large amount of memory. To solve these issues, the encoder and decoder were combined for upsampling and downsampling. The traditional CNN algorithm [29, 30] will calculate the probability for each class label, but upsampling will change the output dimensions in such a way that matches the input dimensions. This proposed architecture for segmentation tasks has several advantages. To start with, residual units assist in deep learning. A second advantage of accumulating features with recurrent residual convolutional layers is that they enable better feature representation. Third, it enables us to design better UNet [31] architecture with the same number of network parameters, while obtaining better performance for medical image segmentation.

3.2. UNet Architecture

In this architecture, there are three main features, namely, feature reconstruction, feature extraction, and feature fusion. During the process of feature extraction, the location-based feature encoder was mainly used to extract multiscale features based on convolutional blocks and residual blocks. Figure 5 shows that to modify the feature maps of the given image size, convolutional and deconvolutional techniques were applied to the sample for reconstructing the feature stage. The developing and expansive paths consist of three layers in each path of blocks, where each layer will follow 2 ∗ 2 max pooling on the developing path. In the convolution process, the two layers of upsampling are concatenated with the two layers of merging layers. To generate the pixel-by-pixel value scores, a 1 ∗ 1 layer was activated with the sigmoid function to be used as the final output layer. In every layer of the block like 1, 2, and 3, the layer consists of 112, 224, and 448 filters, whereas the expansive path consists of 224, 122, and 122, respectively. Original UNet architecture and proposed CNN dropouts were used on the expansive path, as given in Table 1.

3.3. Layers of the Proposed Model

The layers of the proposed model used are given in Table 1.

3.4. Process of the ResNet Architecture Model

This model consists of 50 layers, and CNN blocks were being used several times in this architecture. In every layer, it consists of a batch normalization of 2D. The advantage of the ResNet algorithm was used to solve the issue of diminishing gradient problems by passing the connection. Figure 6 shows that in this type of a network, the hidden layers will drop to zero after passing several multiple layers.

4. Performance Measure

The following parameters were used to measure the performance of system efficiency: mean, intersection-over-union (IoU), and dice similarity coefficient (DSC).

4.1. Mean

The mean reflects the percentage of true positive in each group of pixels as shown in the following equation:

4.2. IoU

To identify the difference between the predicted and target output, IoU is applied, which is mentioned in the following equations:

4.3. D.S.C

The following equation is used to assess the actual and predicted values that are like each other. These values will range from zero to one where one is closer to the actual and predicted value.

5. Implementation Details

The experiments were run on a server with the windows 10 operating system, 2.30 GHz processor, 128 GB RAM, and NVIDIA Lenovo Think station. The samples were trained by a series of data argumentation, image rotation, zooming, and scaling training samples. The cross-validation process was applied to analyze the proposed model performance.

5.1. Database Creation

The freely available datasets were used for experiment purposes. Set 1 and set 2 consist of cropped WBC samples and a set of three raw samples. Jiangxi Tecom Science Corporation, China, provided set 1 containing 300 WBC samples with a size of 120 ∗ 120 and 24 bit color. Set 2 consists of a hundred samples of 300 ∗ 300 size. Set 3 consists of 720 ∗ 576 samples.

5.2. Results and Discussion

Figure 7 shows that different intensities are caused due to different illuminations. Hence, the size and shape of the leukocyte vary from each other. The experiment was conducted on the proposed model on publicly available datasets. The six-fold cross-validation was performed on the samples to make sure that all samples were tested properly. The segmentation accuracy of 96% was achieved. The overall comparison of the existing model of various researchers’ results was compared with the proposed model as given in Table 2 and the loss function as given in Table 3. The output sample of the proposed sample is shown in Figure 8.

(a)

(b)

(c)

(d)

(a)

(b)

(c)

(d)

6. Conclusion

In this work, the method for identifying and classification of WBC was discussed in detail with the experimental results. The proposed method performed robustness in segmenting WBC, peripheral blood, and bone marrow images with a mean accuracy of 96%. This method can also be applied to the nucleus and cytosol separation. The proposed method can be further continued for better results by using YOLO architecture to identify the objects and classify the same. More object identification and classification techniques can be studied as an extension of this proposed effort to attain better results [32].

Data Availability

The data used to support the findings of this study are available from the corresponding author upon request.

Conflicts of Interest

The authors declare that they have no conflicts of interest.

Acknowledgments

This research was funded by Princess Nourah bint Abdulrahman University Researchers Supporting Project number (PNURSP2022R120), Princess Nourah bint Abdulrahman University, Riyadh, Saudi Arabia.

References

H. Kaushik, D. Singh, M. Kaur, H. Alshazly, A. Zaguia, and H. Hamam, “Diabetic retinopathy diagnosis from fundus images using stacked generalization of deep models,” IEEE Access, vol. 9, pp. 108276–108292, 2021.
View at: Publisher Site | Google Scholar
L. Yang, P. Meer, and D. J. Foran, “Unsupervised segmentation based on robust estimation and color active contour models,” IEEE Transactions on Information Technology in Biomedicine, vol. 9, no. 3, pp. 475–486, 2005.
View at: Publisher Site | Google Scholar
J. Prinyakupt and C. Pluempitiwiriyawej, “Segmentation of white blood cells and comparison of cell morphology by linear and naïve Bayes classifiers,” BioMedical Engineering Online, vol. 14, no. 1, p. 63, 2015.
View at: Publisher Site | Google Scholar
S. Mohapatra, D. Patra, and K. Kumar, “TBlood microscopic image segmentation using rough sets,” in Proceedings of theInternational Conference on Image Information Processing, pp. 53–58, Shimla, India, November-2011.
View at: Publisher Site | Google Scholar
N. M. Salem, “Segmentation of white blood cells from microscopic images using K-means clustering,” in Proceedings of the National Radio Science Conference, pp. 371–376, airo, Egypt, April 2014.
View at: Publisher Site | Google Scholar
Z. Liu, J. Liu, X. Xiao et al., “Segmentation of white blood cells through nucleus mark watershed operations and mean shift clustering,” Sensors, vol. 15, no. 9, pp. 22561–22586, 2015.
View at: Publisher Site | Google Scholar
F. Sholeh, “White blood cell segmentation for fresh blood smear images,” in Proceedings of the International Conference on Advanced Computer Science and Information Systems (ICACSIS), pp. 425–429, Sanur Bali, Indonesia, September 2013.
View at: Publisher Site | Google Scholar
S. Arslan, E. Ozyurek, and C. Gunduz-Demir, “A color and shape based algorithm for segmentation of white blood cells in peripheral blood and bone marrow images,” Cytometry, Part A, vol. 85, no. 6, pp. 480–490, 2014.
View at: Publisher Site | Google Scholar
M. Ghosh, D. Das, C. Chakraborty, and A. K. Ray, “Automated Leukocyte Recognition Using Fuzzy Divergence,” Micron, vol. 41, no. 7, pp. 840–846.
View at: Publisher Site | Google Scholar
T. Chaira, “Accurate segmentation of leukocyte in blood cell images using Atanassov’s intuitionistic fuzzy and interval Type-II fuzzy set theory.,” Micron, vol. 61, pp. 1–8.
View at: Publisher Site | Google Scholar
Y. Lei, B. Du, J. Qian, and Z. Feng, “Research on-ear recognition based on SSD MobileNet v1 network,” in Proceedings of the Chinese automation Congress (CAC), pp. 4371–4376, IEEE, Shanghai, China, November-2020.
View at: Publisher Site | Google Scholar
A. Gupta and R. Gupta, ISBI 2019 C-NMC Challenge: Classification in Cancer Cell Imaging; Select Proceedings, Springer, Singapore, 2019.
View at: Publisher Site
P. P. Banik, R. Saha, and K.-D. Kim, “An automatic nucleus segmentation and CNN model based classification method of white blood cell,” Expert Systems with Applications, vol. 149, Article ID 113211, 2020.
View at: Publisher Site | Google Scholar
S. Bourouis, R. Alroobaea, S. Rubaiee, M. Andejany, F. M. Almansour, and N. Bouguila, “Markov chain Monte Carlo-based Bayesian inference for learning finite and infinite inverted beta-Liouville mixture models,” IEEE Access, vol. 9, pp. 71170–71183, 2021.
View at: Publisher Site | Google Scholar
M. D. Joshi, A. H. Karode, and S. R. Suralkar, “white blood cells segmentation and classification to detect acute leukemia,” International Journal of Emerging Trends & Technology in Computer Science (IJETTCS), vol. 2, pp. 147–151.
View at: Google Scholar
L. Liu, Y. Wang, and W. Chi, “Image recognition technology based on machine learning,” IEEE Access, vol. 1, p. 99, 2020.
View at: Publisher Site | Google Scholar
L. Xu, G. Tetteh, J. Lipkova et al., “Automated whole-body bone lesion detection for multiple myeloma on 68Ga-pentixafor PET/CT imaging using deep learning methods,” Contrast media & molecular imaging, vol. 2018, Article ID 2391925, p. 11, 2018.
View at: Publisher Site | Google Scholar
A. Bozorgpour, “Multi-scale regional attention Deeplab3+: multiple myeloma plasma cells segmentation in microscopic images,” MICCAI Computational Pathology (COMPAY) Workshop, vol. 156, 2021.
View at: Google Scholar
X. Xia, C. Xu, and B. Nan, “Inception-v3 for flower classification,” in Proceedings of the 2017 2nd international conference on image, vision, and computing (ICIVC), pp. 783–787, Chengdu, June 2017.
View at: Publisher Site | Google Scholar
A. Mahajan and S. Chaudhary, “Categorical image classification based on representational deep network (RESNET),” in Proceedings of the 3rd International Conference on Electronics, Communication, and Aerospace Technology (ICECA), pp. 327–330, IEEE, Coimbatore, India, June 2019.
View at: Publisher Site | Google Scholar
C. V. Aravinda, L. Meng, K. R. Uday Kumar Reddy, and A. Prabhu, “Signature recognition and verification using multiple classifiers combination of hu’s and hog feature,” in Proceedings of the 2019 International Conference on Advanced Mechatronic Systems (Icamechs), pp. 63–68, Kusatsu, Japan, August 2019.
View at: Publisher Site | Google Scholar
P. Jana, A. Biswas, and Mohana, “YOLO based detection and classification of objects in video records,” in Proceedings of the 3rd IEEE International Conference on Recent Trends in Electronics, Information & Communication Technology (RTEICT), pp. 2448–2452, IEEE, Bangalore, India, May 2018.
View at: Publisher Site | Google Scholar
M. Lin, B. Lyu, Z. Zhang, C. V. Aravinda, N. Kamitoku, and K. Yamazaki, “Oracle bone inscription detector based on SSD,” New Trends in Image Analysis and Processing – Iciap 2019, pp. 126–136, 3rd ed edition.
View at: Google Scholar
S. C. Mondal, P. L. C. Marquez, and M. O. Tokhi, “Analysis of mechanical adhesion climbing robot design for wind tower inspection,” Journal of Artificial Intelligence Technology, vol. 1, no. 4, pp. 219–227, 2021.
View at: Publisher Site | Google Scholar
A. Balakrishna and P. K. Mishra, “Modelling and analysis of static and modal responses of leaf spring used in automobiles,” International Journal of Hydromechatronics, vol. 4, no. 4, pp. 350–367, 2021.
View at: Publisher Site | Google Scholar
M. Kaur, D. Singh, V. Kumar, B. B. Gupta, and A. A. Abd El-Latif, “Secure and energy efficient-based E-health care framework for green internet of things,” IEEE Transactions on Green Communications and Networking, vol. 5, no. 3, pp. 1223–1231, 2021.
View at: Publisher Site | Google Scholar
P. K. Singh, “Data with non-Euclidean geometry and its characterization,” Journal of Artificial Intelligence Technology, vol. 2, pp. 3–8, 2022.
View at: Publisher Site | Google Scholar
D. Jie, G. Zheng, Y. Zhang, X. Ding, and L. Wang, “Spectral kurtosis based on evolutionary digital filter in the application of rolling element bearing fault diagnosis,” International Journal of Hydromechatronics, vol. 4, no. 1, p. 27, 2021.
View at: Publisher Site | Google Scholar
D. Singh, V. Kumar, M. Kaur, M. Y. Jabarulla, and H. N. Lee, “Screening of COVID-19 suspected subjects using multi-crossover genetic algorithm based dense convolutional neural network,” IEEE Access, vol. 9, pp. 142566–142580, 2021.
View at: Publisher Site | Google Scholar
Y. Xu, Y. Li, and C. Li, “Electric window regulator based on intelligent control,” J. Artif. Intell. Technol., vol. 1, pp. 198–206, 2021.
View at: Publisher Site | Google Scholar
T. V. Hahn and C. K. Mechefske, “Self-supervised learning for tool wear monitoring with a disentangled-variational-autoencoder,” International Journal of Hydromechatronics, vol. 4, no. 1, pp. 69–98, 2021.
View at: Publisher Site | Google Scholar
R. M. Roy and P. M. Ammer, “Segmentation of leukocyte by semantic segmentation model: a deep learning approach,” Biomedical Signal Processing and Control, vol. 65, Article ID 102385, 2021.
View at: Publisher Site | Google Scholar

Copyright

Copyright © 2022 Amal H. Alharbi et al. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

PDF Download Citation

Download other formats

Order printed copies