Abstract

Classification of the morphology of red blood cells (RBCs) plays an extremely important role in evaluating the quality of long-term stored blood, as RBC storage lesions such as transformation of discocytes to echinocytes and then to spherocytes may cause adverse clinical effects. Most RBC segmentation and classification methods, limited by interference of staining procedures and poor details, are based on traditional bright field microscopy. In the present study, quantitative phase imaging (QPI) technology was combined with deep learning for automatic classification of RBC morphology. QPI can be used to observe unstained RBCs with high spatial resolution and phase information. In deep learning based on phase information, boundary curvature is used to reduce inadequate learning for preliminary screening of the three shapes of unstained RBCs. The model accuracy was 97.3% for the stacked sparse autoencoder plus Softmax classifier. Compared with the traditional convolutional neural network, the developed method showed a lower misclassification rate and less processing time, especially for RBCs with more discocytes. This method has potential applications in automatically evaluating the quality of long-term stored blood and real-time diagnosis of RBC-related diseases.

1. Introduction

During long-term storage of blood, the morphology of red blood cells (RBCs) undergoes a deterioration process wherein discocytes are transformed into echinocytes and then into spherocytes [1]. The biochemical and biophysical changes in this process are termed as RBC storage lesions [2]. Such changes are also detected in cell aging [3], suggesting a relevance between RBC morphological changes and cell aging [4]. In the process of RBC aging, the deformability of RBCs is weakened, and the fragility of RBCs is enhanced [5, 6], which may cause changes in RBC function, leading to adverse clinical outcomes such as increased postoperative infection, deep vein thrombosis, and multiorgan failure [7]. Therefore, statistical analysis of the distribution of RBC morphology is crucial to evaluating the quality of stored RBCs. Based on cell morphology analysis, this RBC classification method can also provide diagnostic information on blood diseases [8, 9].

With the development of computer language and artificial intelligence, the classification algorithms based on machine learning [1017] have enabled rapid, efficient, and automatic classification of RBCs. However, in the majority of previous studies, the extraction of morphological features of RBCs is manually screened, which is time-consuming and requires prior knowledge. Moreover, most RBC automatic classification methods are based on traditional bright field imaging technologies. In these imaging methods, due to the small optical absorption coefficient and poor imaging contrast of RBCs, the microscopic image of unstained RBCs is not clear enough to provide enough details to distinguish among discocytes and spherocytes. As a result, the quantitative measurement methods based on morphological analysis of RBCs [18, 19] have been applied to increase the staining-free RBC automatic classification accuracy.

Quantitative phase imaging (QPI) is a widely applied quantitative measurement method of analyzing staining-free translucent or transparent samples [20]. It can be realized by digital holography microscopy (DHM) [21], quantitative differential phase contrast (DPC) [22], or transport of intensity equation (TIE) methods [23]. As a quantitative phase imaging technology, the hologram obtained by DHM was reconstructed by digital simulation to recover QPI, which can be used to analyze the morphology of RBCs [19, 21, 24]. However, off-axis DHM requires high resolution of recording materials and high position accuracy of reconstructed light. Similarly, TIE imaging requires multiple scans, which is time-consuming and difficult to achieve real-time imaging of biological samples. Different from DHM and TIE, DPC is only calculated from two symmetrical oblique illumination [25] images by phase transfer function, and then, QPI is recovered by deconvolution. In this method, the imaging device is simple, and the imaging process is fast.

In this study, we described an automatic RBC classification approach in which the oblique illumination (OI) images of the RBCs were recorded using coded LED array microscopy and the corresponding phase images were reconstructed from DPC images using phase recovery techniques. Specifically, we introduced the stacked sparse autoencoder (SSAE) that used a full connection model for high-level feature learning, thus to reduce processing time. The SSAE was trained from bottom-up in an unsupervised manner to extract hidden features, and efficient representation of the features in turn enabled more accurate and supervised classification. To increase the accuracy of classification, our proposed method included a boundary descriptor known as boundary curvature (BC), which considered the irregularity and diversity in cell boundaries.

2. Materials and Methods

Figure 1 shows the flowchart of the combination of BC and SSAE to achieve RBC classification. First, microscopic images were obtained by the phase reconstruction method in Section 2.1, and images with only single RBC were obtained by the segmentation and extraction method in Section 2.3. Next, the massive single-cell images were divided into three categories of raw data x automatically according to the dataset preparation method in Section 2.4. In the training phase, the different original datasets x of RBCs, which were automatically divided into different classification categories (discocyte, echinocyte, and spherocyte) by the classification hyperplane, were input into SAE1 of SSAE to obtain the activation value h1 and parameter W. Similarly, h2, which was obtained by inputting h1 and W into SAE2, was input into the Softmax classifier. To increase the accuracy of the model, x, h1, and h2 were reloaded into SSAE for fine tuning [26] to minimize L, the process of which was called as the deep neural network (DNN). In the testing phase, BC in Section 2.5 was used to preliminarily screen the RBC phase images. After this screening, the images were input into the DNN to obtain the morphological features of RBC. According to results from the DNN, their probabilities were mapped by the Softmax classifier, as given in Section 2.6; then, the possibility of RBC classification was achieved. This combination of SSAE and BC can learn the morphological characteristics and facilitate the classification of RBCs.

2.1. Experimental Device

In a commercial inverted fluorescence microscope (Olympus IX73, Tokyo, Japan), the halogen lamp was replaced with an LED array (red, green, and blue primary colors, WS2812B, China). We selected green light (the wavelength at 515–530 nm) as the experimental light source. The distance of adjacent LED within the array is 8.3 mm, and the highest power of the LED array is 4.096 w, as shown in Figure 2(a). The LED array was fixed on a 3D printed bracket. Its lightening pattern was controlled by signals from a programmable system-on-chip (PSoC, CY8CKIT-059, USA). When light rays from the LED array illuminated the sample at a specific angle, the transmitted light passing through an objective (40X, NA = 0.75, Olympus) was projected on a charge-coupled (CCD) camera. The CCD (Mindvision, MV-SUA231GM-T, China) has the resolution of 1920 pixels  1200 pixels.

The imaging modes can be switched by illuminations patterns. Bright field (BF) images can be obtained by lightening the entire LED array. If half of the LED array (top, bottom, left or right) is lightened, as shown at the top of Figure 2(b), the corresponding OI images are IT, IB, IL, and IR, respectively. For a set of OI images (IL and IR) with complementary illumination angles in the vertical direction, the DPC image is calculated by [27]

Then, the DPC image is determined by the pairwise complementary OI images (IT and IB) in the horizontal direction according to equation (1). According to the weak object transfer function (WOTF) and the Tikhonov criterion [23], the phase of the QPI image can be determined. The detailed phase can be described as [27]where denotes the inverse Fourier transform, and and represent the conjugate of phase transfer function on different axes and the Fourier spectrum of DPC image, respectively. The sign of i stands for the amount of axis from multiaxis DPC, where , indicating two axial distributions (horizontal and vertical directions). is a parameter for Tikhonov regularization.

2.2. RBC Sample

RBC samples were collected from healthy donors (aged 18–30 years). To understand the morphological changes of RBCs during storage, a 5 ml whole blood sample was obtained through the vein and stored in the CPDA-1 solution at the ratio of 1.4 : 10, and then, the sample was then preserved in a refrigerator at 4°C ± 2°C. The morphology of RBCs was altered by adjusting the osmotic pressure on phosphate buffer saline (PBS) [28]. For each sample preparation, 10X PBS (Spark Jade, CR0015-500 ML, China), 1X PBS (Spark Jade, CR0014-500 ML, China) with pH 7.1, and deionized water were mixed to obtain a specific PBS with different osmotic pressures. The whole blood sample was diluted with PBS at 1 : 100. The dominated morphologies of RBCs in hypertonic PBS (628 mOsm), isotonic PBS (300 mOsm), and hypotonic PBS (148 mOsm) are the echinocytes, discocytes, and spherocytes, respectively. To obtain the three shapes of RBCs in the same image, we blended the blood stored for 4 weeks with fresh blood in isotonic PBS, where the environmental temperature was kept at 22°C during the experiment.

2.3. RBC Segmentation

The preprocessing step of cell segmentation is essential for RBC classification, recognition, and tracking [29, 30]. The connected domain segmentation algorithm can easily lead to inaccurate segmentation due to cell adhesion [31, 32]. The watershed segmentation algorithm based on connected domain analysis is an extension of the connected domain segmentation algorithm [33], which can separate the target of partial adhesion. First, the images were preprocessed by gray processing and bilateral filtering [34] to reduce the influence of background noise on cell segmentation. Second, the preprocessed images were processed by adaptive threshold, dilation, erosion, and morphological filtering, which could reduce the influence of noncellular substances on segmentation. Finally, the watershed algorithm was applied to the binary images to solve the phenomenon of partial cell adhesion. In the process of extracting RBCs, the preprocessing algorithm can achieve the purpose of edge-preserving denoising due to the consideration of spatial information and gray-scale similarity, and the simple and fast Otsu method [35] is not affected by image brightness and contrast.

2.4. Dataset Preparation

For training the network model to learn the morphological features of RBCs, the datasets of the three shapes of RBCs need to be prepared. A subjective error might generate unavoidably if the dataset is prepared manually [36]. To solve this problem, we used the support vector machine (SVM) [37], which needed only a small sample (the three shapes of RBCs selected by experienced biologist experts in Section 3.3) to determine the classification hyperplane, quickly and efficiently. Figure 3 shows the process of RBC dataset preparation. First, the known single RBCs were trained by SVM to form a classification hyperplane, which was the classification label after feature learning. Second, the unknown single RBCs were then predicted by the classification hyperplane. Finally, the dataset was prepared after manual inspection. Our method not only reduced manual subjective error but also saved resources (Section 3.3). Different shapes of single RBC images rotated 90°, 180°, 270°, and 360° and flipped could be magnified up to six times of the original images [38], which constituted the dataset for training and testing in Section 2.6 and further enhanced the robustness of the classification model to increase classification accuracy.

2.5. Extraction of RBC Features
2.5.1. Boundary Curvature

During the process of cell aging, the morphology of echinocyte changes from disco-echinocyte to echinoco-spherocyte [39], and the irregularity and diversity of RBC morphology may lead to incomplete feature learning [40]. Tangent count can be applied to estimate the change in the concavity and convexity of “contour” of RBCs [41]; however, the tangent point is so important that inaccurate positioning may lead to misjudgment. Accordingly, we introduced an edge-based descriptor, termed BC, which was an extension of tangent count. It is beneficial to reduce the dependence on sampling location with this descriptor because the target is uniformly sampled to judge whether the boundary of the cell is smooth. BC is defined as the rate of change of slope. Generally speaking, when moving clockwise on the boundary, if the slope change rate of point P is not negative, P is a part of the convex line segment; otherwise, P belongs to the concave line segment. To obtain BC of RBC, we need to obtain the boundary coordinates, that is, the coordinates of sampling points are , , , , where , and is the perimeter of the boundary cell. The slope of adjacent points is expressed as follows:

Thus, the number of negative values of BC is given as follows [42]:where and is the number of times when the slope of the last two points is negative. The N (calculated by (3) and (4)) reflects the local “roughness” of cell boundaries. We set N as a characteristic parameter of RBC morphology.

2.5.2. Sparse Autoencoder

It has been reported that the autoencoder can add the features of active learning to the original features [43], which considerably enhance the accuracy and yield better results than the traditional classification algorithm. The autoencoder is essentially a simple data compression process, in which the encoding and decoding method is applied to compress and decompress data. In the coding stage, high-dimensional data are mapped to low-dimensional data to reduce the data size. Conversely, the decoding stage involves the reproduction of data. The sparse autoencoder (SAE), which additionally includes some sparsity constraints on the traditional autoencoder, can suppress most of the output of hidden layer neurons to achieve a sparse effect [44]. The sparsity constraint is added to make SAE better in feature representation and data reconstruction. In each layer of SSAE, the activation value h is the output of the training sample x without any label after encoding. Parameters x and h satisfy the requirement as follows [45]:where is the nonlinear activation function, such as the sigmoid function, represents the weight, and b is the neuron bias. is the parameter set. Next, h is inversely transformed into the reconstructed representation r in the decoding process. h and r meet the requirement as follows [45]:where , , , and satisfy the relation , and is the weight network between the hidden layer and the output layer, which is another characteristic parameter of RBC morphology. We constantly modify and to minimize the average reconstruction error L by the back-propagation algorithm [26], which is calculated as follows [45]:

2.6. Classifier Selection

Because the autoencoder itself is only a feature extractor and has no classification function, it is necessary to include a classifier after feature extraction to achieve classification function. In [37], the authors compared three shallow classifiers (nearest neighbor algorithm, K-nearest neighbor (KNN); back-propagation (BP) neural network; and Softmax classifier) and found that the accuracy and training time of the Softmax classifier were superior. The Softmax classifier is the expansion of the logistic regression model in multiclassification problem. It satisfies the formula as follows [46]:where the probability of class n being taken is as follows:where n is the number of categories, the input value is a vector, and represents the score value of the model in the category of . The function of e is to change the score of to , which will not affect the relative size relationship. In this way, equation (9) enables a group of scores to be transformed into a group of probabilities, and the total probability is 1.

2.7. Performance Metrics

From the evaluation of the RBC classification algorithm, we annotated the true values with one-to-one circles of different colors (red, green, and blue), which represent the results of microscopic observation and RBC classification by individual experts. For quantitative evaluation of the effect of our morphological classification of RBCs, we adopted the following evaluation scheme, as shown in Figure 4:(1)If the observation result of experts is consistent with the algorithm prediction result, it can be considered as a correct prediction result (EE, SS, and DD), where the first E in EE is the real value and the second E is the predicted value of our algorithm. Similarly, we can obtain SS and DD.(2)If the observation result of experts is inconsistent with the algorithm prediction result, it can be considered as a wrong prediction result (SE, DE, ES, DS, ED, and SD), where E represents the echinocyte, S represents the spherocyte, and D represents the discocyte. The upper left corner of Figure 4 shows the possible predictive results of the three shapes of RBCs, where green, red, and blue circles represent the discocyte, echinocyte, and spherocyte detected manually, respectively. In addition, green, red, and blue dots are the predicted cell categories for the RBCs. The confusion matrix [47] in the upper right corner of Figure 4, also known as error matrix, a common method to evaluate the performance of classifiers, depicts the relationship between the real attributes of the sample and the prediction results. Examples of the RBC classification results obtained in this study are shown in Figure 4, which highlights the cases of misclassification.

To quantitatively analyze the quality of the DNN, we applied accuracy and F1 measure to evaluate the classification results, as shown in the model evaluation box in Figure 3. Recall is a statistical measurement to quantify the ability of an algorithm to classify objects automatically. Different categories of recall are expressed as , , and , respectively. Different categories of precision are , , and , respectively, which are used to evaluate the robustness of the algorithm in avoiding misclassification. The F1 measure is the harmonic average value of precision and recall, which satisfies the requirement . In addition, we used the index of accuracy to evaluate this RBC classification model, which is estimated as follows:where Num is the total number of classified RBCs. The index of misclassification rate (MR) [19] is then calculated as follows:

3. Experimental Results

3.1. Morphological Differentiation of RBCs by Bright Field Imaging and QPI

It is difficult to observe the details of unstained RBCs by traditional bright field (BF) imaging technology because of the poor contrast of transparent and translucent samples. To improve the image contrast of RBC samples, we used QPI technology for microscopic imaging of unstained RBCs. First, according to the illumination angles of the left and right half of LED arrays, two oblique illumination images were captured by CCD, as shown in Figures 5(a1) and 5(a2). Second, the left and right complementary oblique illumination images were then transformed into the left-right DPC image (Figure 5(b1)) by equation (1). The top and bottom oblique illumination images are shown in Figures 5(a3) and 5(a4), and the top-bottom DPC image is shown in Figure 5(b2) in the same way. Finally, QPI image (Figure 5(d)) was restored by DPC images using phase transfer function (equation (2)). To further visualize the three-dimensional structure of QPI, mesh of RBC was calculated by the mesh function and view function [48]. To better observe the two-dimensional structure of the mesh image, we set the azimuth and elevation to 0 and−88, respectively. Figure 5(c) shows the RBCs in BF. Echinocytes with rough edges can be clearly observed, but the morphological characteristics of discocytes and spherocytes are not apparent. Compared to the BF images, the different shapes of RBCs was distinguished according to different phase information [15] in QPI images.

3.2. Segmentation and Recognition of RBC Using QPI

To achieve RBC classification, the RBCs were segmented, and each RBC was then extracted from the segmented RBC phase image. The segmentation and extraction results are shown in Figure 6, displaying preprocessing results, RBC segmentation analyzed by the relatively simple Otsu method [35], and RBC extraction in different imaging modes. Images in the first line of Figure 6 are the preprocessed microscopic images of RBCs in the same field of view, including BF, OI, DPC, QPI, and mesh of QPI. Images in the second line of Figure 6 are the binary images of RBCs in five imaging modes. The comparison revealed that QPI highlighted by mesh show better morphological characteristics and clearer cell contour of RBCs than BF, OI, and DPC. Here, mesh displays the pseudocolor and pseudostereo effect of QPI. Images in the last line of Figure 6 show the results of cell edge extraction under five imaging modes, where the RBC edge extraction effects of QPI and mesh are better in RBC phase images with several slightly adherent RBCs [49, 50] than BF, OI, and DPC. These results show that the images of QPI have better segmentation performance than those of DPC via the same segment algorithm. This is convenient for subsequent single RBC extraction.

3.3. Dataset Based on SVM

The three shapes of RBC samples were selected by experienced biologist experts: discocytes (105 samples) with relatively circular depression area, echinocytes (102 samples) with locally uneven and irregular boundary, and spherocytes (100 samples) without relatively circular depression area. The images of these samples were input into SVM being trained to form the classification hyperplane, and then, a large number of segmented single RBCs were tested by the hyperplane for the preparation of RBC datasets. To further determine the quality of the datasets, RBC datasets prepared by SVM needed the manual review by two professionals. Figure 7 shows some examples of the datasets of the three shapes of RBCs predicted by SVM. The process of predicting and reviewing 100 cells through our method took 30 seconds. As a comparison, it took approximately 167 seconds to prepare a dataset of 100 cells by the manual preparation method.

3.4. Boundary Curvature Improving the Accuracy of RBC Classification

The measurement based on the morphological analysis of RBCs may provide a possible alternative analysis method [1, 14] to replace the measurement of hemoglobin concentration on determining blood quality [51]. Morphological analysis of the different shapes of RBCs in BF and QPI is shown in Figure 8. Figures 8(a) and 8(b) show the BF and QPI images of RBCs, respectively. Figures 8(c) and 8(d) are the local images of Figures 8(a) and 8(b), respectively, where cells A, B, and C represent the spherocyte, echinocyte, and discocyte, respectively. Figure 8(e) shows BCs of the three shapes of RBCs in BF and QPI and the corresponding mesh; the figure shows that BC can separate cell B from RBCs with smooth boundary (cells A and C) because N mentioned in (4) of cell B is larger than that of cells A and C whether in BF or QPI. Contrary to the regular morphological characteristics of discocytes and spherocyte, the boundaries of echinocytes are irregular with variations, which may lead to incomplete morphological feature learning and even affect the final classification results. Thus, to reduce the misclassification rate of RBCs, we set the number of negative values, N, of BC as the threshold for the preliminary classification of echinocyte-shaped RBCs from RBCs with smooth boundaries (discocyte and spherocyte) before training and learning. The influence of BC on the results of RBC classification in BF and QPI is shown in Figure 9. Figures 9(a) and 9(b) show the RBC classification without BC and with BC in BF, respectively. Figures 9(c) and 9(d) show the RBC classification without BC and with BC in QPI, respectively. These figures show that BC can avoid situations where echinocyte-shaped RBCs are misclassified as discocytes or spherocytes in BF and QPI, as shown in yellow and green boxes of Figure 9. The average MR calculated by (11) is 7.1% without BC, and the average MR is 5% with BC (the average MR has been reduced by 2.1%). In addition, compared to BF, QPI can reduce the probability that spherocytes are misclassified as discocytes, as shown in white boxes of Figure 9. The basic RBC classification relies on SSAE and Softmax classifiers, and QPI and BC jointly improve the accuracy of RBC classification.

3.5. SSAE

Datasets of the different shapes of RBCs were input into the DNN for training and testing. The quality of the model largely affects the final result, and the accuracy is used to evaluate the quality of the model (10) [47]. The sparsity proportion, sparsity regularization, L2 weight regularization, and iterations in the SAE affect the accuracy of the DNN [52]. To verify the influence of these four parameters on the accuracy of the DNN, we adjusted one of the parameters to optimize the accuracy. Figure 10 shows the relationship between different parameters and accuracy. Initially, we changed the parameter sparsity proportion, while leaving the other three parameters unchanged. We set iterations of the single-layer SAE and Softmax layer as 10, the number of hidden neurons of the SAE as 100, and the two parameters of sparse regularization and L2 weight regularization as 4 and 0.004, respectively. The results revealed that the accuracy was the highest when the sparsity proportion was 0.10, as shown in Figure 10(a). Next, according to the above best parameters, we changed the parameter sparsity regularization while leaving the other three parameters unchanged. We set the number of iterations of the single-layer SAE and Softmax layer as 10, the number of hidden neurons in SAE as 100, and the sparsity proportion and L2 weight regularization to 0.10 and 0.004, respectively. The results revealed that the accuracy was the highest when sparsity regularization was 8, as shown in Figure 10(b). Then, according to the above best parameters, we changed the parameter L2 weight regularization while leaving the other three parameters unchanged. We set iterations of the single-layer SAE and Softmax layers as 10, the number of hidden neurons of the SAE as 100, and sparsity proportion and sparsity regularization as 0.10 and 8, respectively. The accuracy was found to be the highest when L2 weight regularization was 0.004, as shown in Figure 10(c). Finally, according to the above best parameters, we changed the parameter iterations while leaving the other three parameters unchanged. We set the number of hidden neurons of SAE as 100 and the three parameters of sparsity proportion, sparsity regularization, and L2 weight regularization as 0.10, 8, and 0.004, respectively. The DNN with 130 and 170 iterations exhibited the highest accuracy, as shown in Figure 10(d). The red and blue curves in Figure 10 show that the accuracy can be further increased by fine tuning, namely, by initializing the network weight according to the original model parameters, optimizing the network parameters using the back-propagation algorithm [26], and minimizing L of equation (7). In [53], the authors reported that SSAE could further learn the features of the dataset and reduce feature dimension. Therefore, we selected SSAE formed by stacking the traditional two-layer SAEs plus Softmax classifier and further fine-tuned to form the DNN. Since the greater the number of iterations, the longer the training time. After several experiments, we chose the parameters with best performance in all experiments, and the values of each parameter are given in Table 1.

4. Discussion

Based on the above parameter settings, the datasets of the three shapes of RBC were input into SSAE composed of a two-layer SAEs plus Softmax classifier for training by fine tuning to form the DNN. If (equation (9)) was greater than 0.5, the prediction matching of the category result was successful. In contrast, the predictions did not match if the mapping probability was less than 0.5. Repeat the prediction three times for any RBC phase images according to the RBC classification process shown in Figure 1. Table 2 provides the final average classification results of six RBC phase images, where E, S, and D are the echinocyte, spherocyte, and discocyte, respectively.

Table 2 provides that our RBC classification scheme that relies only on the phase information on RBC itself is a good RBC classification method, where MR calculated by (11) is less than 10%, and the cell detection time is approximately 20 cells per second. Similarly, we input the same RBC phase images into the convolutional neural network (CNN) [17] that was trained by the same RBC datasets. The classification results, MR, and cell detection time of the three shapes of RBCs are given in Table 3. Compared to the classification results outcome by our developed model, the MR and the average time of cell detection of the CNN were relatively higher (5% and >3.2 s, respectively), especially for samples with more discocytes. We compared the classification of discocytes (D), and our method had significantly improved the classification accuracy for large numbers of cells, which corresponded the case of various clinical assays.

We also used model accuracy [47] to compare our proposed method with the CNN. For the RBC classification algorithm, our model accuracy was 0.973, which was higher than that of the CNN (0.940). In addition, the F1 measure of our RBC classification method was 3.3% higher than the CNN. CNN-based feature representation involves convolution and subsampling operations to learn a set of locally connected neurons through the local receptive fields for feature extraction [18], while SSAE is a full connection model that learns a single global weight matrix for feature representation [30]. SSAE is trained in a bottom-up unsupervised manner to extract hidden features. Compared to the CNN, SSAE achieves higher accuracy with less processing time for training using the same dataset, as shown in the model accuracy (equation (10)) and the results given in Tables 2 and 3.

Although the data augmentation of RBC phase images was used, it was still difficult to increase the performance of our proposed model as not all echinocytes could be trained and learned due to the diversity and irregularity of echinocyte morphology. Moreover, only discocytes, echinocytes, and spherocytes were considered in this work. In the future, our study will involve more shapes of training samples and explore the RBCs classification methods of other suitable feature values, such as texture inhomogeneity and border roughness [12], to further increase the performance. Since QPI technology provides the phase information that can reveal microstructures compared with intensity imaging for unstained cells, our classification method could be applied to various unstained tissue slices of specific recognition. Thus, our proposed method could be potentially used for other real-time applications.

5. Conclusions

In this study, we proposed a method for automatic classification of RBC morphology based on QPI. First, compared with the traditional bright field image, QPI delivers better image for analysis of unstained RBCs. Second, the watershed algorithm based on the connected domain analysis is used to show that QPI is more conducive to the segmentation and recognition of RBCs. Third, compared with the manual dataset, the dataset preparation method based on SVM can avoid the manual subjective error and thus increase the efficiency. Last, the morphological features extracted by BC and SSAE can be classified by the Softmax classifier, and the misclassification rate of RBCs can be reduced by BC. The experimental results show that our classification method can achieve automatic classification of RBC morphology based on QPI, with high accuracy, minimal calculation, fast recognition speed, and no requirement to set the feature selection parameters artificially. In addition, our classification method has the potential to be applied to analysis of RBC-related diseases and evaluation of the quality of long-term stored RBCs.

Abbreviations

RBCs:Red blood cells
QPI:Quantitative phase imaging
DHM:Digital holography microscopy
DPC:Differential phase contrast
TIE:Transport of intensity equation
OI:Oblique illumination
SSA:Stacked sparse autoencoder
SAE:Sparse autoencoder
BC:Boundary curvature
CCD:Charge-coupled device
WOTF:Weak object transfer function
PBS:Phosphate buffer saline
SVM:Support vector machine
KNN:K-nearest neighbor
BP:Back-propagation
DNN:Deep neural network
MR:Misclassification rate
BF:Bright field
CNN:Convolutional neural network
TN:True negative.

Data Availability

The data used to support the findings of this study are included within the supplementary material.

Conflicts of Interest

The authors declare no that they have no conflicts of interest.

Authors’ Contributions

Mengduo Jiang and Meng Shao contributed equally to this work.

Acknowledgments

This research was supported by the Natural Science Foundation of Anhui Province in China (1908085MA14) and the Project of Research Fund of Anhui Institute of Translational Medicine (2021zhyx-B16).

Supplementary Materials

The supplementary material includes training data and testing data. The training data are 10800 (3600 for each shape RBCs) single 50 × 50 RBC phase images used for training, and the testing data include 3600 (1200 for each shape RBCs) single 50 × 50 RBC phase images. Six 1920 × 1080 RBC phase images were used for RBC morphology classification. All phase images are in .jpg format. (Supplementary Materials)