Abstract

This study was aimed at investigating the ultrasound based on deep learning algorithm to evaluate the rehabilitation effect of transumbilical laparoscopic single-site total hysterectomy on pelvic floor function in patients. The bilinear convolutional neural network (BCNN) structure was constructed in the ultrasound imaging system. The spatial transformer network (STN) was used to preserve image information. Two algorithms, BCNN-R and BCNN-S, were proposed to remove sensitive information after ultrasonic image processing, and then, subtle features of the image were identified and classified. 80 patients undergoing transumbilical laparoscopic single-site total hysterectomy in hospital were randomly divided into a control group and a treatment group, with 40 cases in each group. In the control group, conventional ultrasound was used to assess the image of pelvic floor function in patients undergoing laparoendoscopic single-site surgery (LESS); in the observation group, ultrasound based on deep learning algorithm was used. The postoperative incision pain score, average postoperative anus exhaust time, average hospital stay, and postoperative satisfaction of the two groups were evaluated, respectively. The highest accuracy of constructed network BCNN-S was 88.98%; the highest recall rate of BCNN-R was 88.51%; the highest accuracy rate of BCNN-R was 97.34%. The operation time, intraoperative blood loss, and exhaust time were similar between the two groups, and the difference had no statistical significance (). The numerical rating scale (NRS) scores were compared, the observation group had less pain, the difference between the two groups had statistical significance (), and the postoperative recovery was good. The BCNN based on deep learning can realize the imaging of the uterus by ultrasound and realize the evaluation of pelvic floor function, and the probability of pelvic floor dysfunction is small, which is worthy of clinical promotion.

1. Introduction

Pelvic floor disease is a degenerative lesion including urinary incontinence and pelvic organ prolapse [1]. In the United States, nearly 400,000 patients undergo surgery due to pelvic floor dysfunction each year, and 300,000 cases occur during hospitalization. Each year, 33% of women have different pelvic floor dysfunction (PFD) before the age of 60 and require hysterectomy [2]. Hysterectomy is closely related to PFD, which can improve the risk of postoperative pelvic floor dysfunction and affect the quality of life of patients [3]. The pelvic floor function of normal women mainly relies on intact muscle strength, nerves, and ligaments. The interaction of various organs allows the pelvic floor structure to reach a normal dynamic balance state [4]. The hysterectomy can damage the neural tissue, impair vascular nutrition, and severely destroy important supporting structures such as muscles, ligaments, fascia, and connective tissue [5, 6]. PDF in women is usually judged based on the patient’s clinical symptoms as well as the corresponding examination results, and the examinations include finger pressure test, urodynamic examination, and magnetic resonance imaging, but these modalities have the disadvantage of poor reproducibility and cannot comprehensively evaluate the pelvic floor function of the anatomy for dynamic imaging, safety, and repeated procedures [7, 8]. At present, transperineal pelvic floor ultrasound is increasingly loved by researchers to evaluate the dynamic changes of pelvic floor function. Pelvic floor ultrasound enables real-time observation of pelvic anatomy and function and enables dynamic imaging, which has become a major modality for assessing pelvic floor ultrasound [9, 10].

Transumbilical laparoendoscopic single-site surgery (TU-LESS) is a surgical technique performed using surgical instruments through the pore channel, mainly using the umbilicus to place multiple operating pore channels as well as the operating platform of the laparoscope in the abdomen, which can reduce abdominal infections caused by gastrointestinal tract as well as transapproach surgery, with the characteristics of mild pain, rapid recovery, no earthworm-like scar left in the abdomen, safety, and minimal invasion. In this process, it can also use the natural wrinkle site of the umbilicus to cover the surgical incision, which can meet the requirements of women for abdominal “scarless” cosmetology [11, 12]. Compared with traditional multiport laparoscopy, TU-LESS is easier to perform in total hysterectomy, can shorten gastrointestinal recovery time and postoperative ambulation time, and can remove the uterus through a single port at the umbilicus and through an approach in a manner similar to “apple cutting.” “Chopstick effect” between instruments is one reason for prolonged operation time [13, 14].

With the advantages of high sensitivity, noninvasion, easy operation, and low cost of ultrasound imaging, real-time three-dimensional (four-dimensional) ultrasound imaging has become an effective diagnostic tool in imaging systems in the 21st century [15]. Ultrasound is widely used in obstetrics and gynecology without ionizing radiation, cost-effective, and easy to approach, for example, abdominal ultrasound, cardiovascular ultrasound, electrocardiogram, and prenatal diagnostic ultrasound, but the presence of pseudonoise in ultrasound imaging causes reduced imaging quality and is also a challenge in ultrasound; relying only on the experience of doctors, as well as the proficiency of operators, there are significant differences in results in different institutions as well as different ultrasound systems [16, 17]. This requires the use of advanced intelligent ultrasound image analysis algorithms to intervene imaging quality. Deep learning technology has made it successful for natural image recognition in the construction of large-scale image datasets, and high-resolution ultrasound imaging equipment has accumulated many ultrasound image data, which also makes deep learning possible for the auxiliary diagnosis of images [18, 19]. The bilinear convolutional neural network (BCNN) enables a good classification of images [20]. Yosinski et al. [21] pointed put when learning the features of the underlying CNN that the CNN network can get distinct features with increasing depth, and the closer to the input end, the more specific information the feature displays.

The ultrasonic transverse section of patients with total hysterectomy was identified, and the BCNN was used to identify the ultrasonic transverse section of the uterus.

The innovation is that the intelligent algorithm is used to identify high resolutely the transverse section, and the extracted features are fused, which is helpful to retain the subtle features. It is hoped that it can provide reference for ultrasound evaluation of pelvic floor function in patients with transumbilical laparoscopic single-site total hysterectomy and provide theoretical basis for clinical work.

2. Materials and Methods

2.1. Subjects

Eighty patients admitted to hospital from June 2018 to June 2020 who underwent transumbilical laparoscopic single-site total hysterectomy were selected. The age of the study subjects ranged from 32 to 50 years, with an average of years and BMI of . There were 28 cases of uterine fibroids, 19 cases of endometrial lesions, and 33 cases of adenomyosis. The patients were randomly divided into the control group () and the treatment group () by random number table method. There was no significant difference in basic information between the two groups (). Preoperative gynecological examination of the uteri was less than 4 months of gestation with good range of motion, and the operation was completed by the same physician. This study had been approved by the ethics committee of the hospital, and the patients and their families signed the informed consent form.

Inclusion criteria: patients with uterine fibroids or adenomyosis diagnosed by routine cervical biopsy or cervical liquid-based cytology before surgery; patients with complete clinical data; patients without cervical malignant lesions or uterine prolapse; patients without fertility requirements, nonpregnant patients; patients without serious medical and surgical diseases and a history of repeated surgery.

Exclusion criteria: patients who are allergic to the drugs used; patients who are critically ill and unable to cooperate; patients with chronic diseases such as heart, brain, liver, and kidney; patients with mental diseases; patients with systemic infection.

2.2. Treatment Methods

Ultrasonic diagnostic apparatus was used. The intestinal tract of the patient with the probe frequency of 4-8H was emptied, the patient laid on the examination bed, and the probe coated with coupling agent and covered with disposable film gloves was placed at the perineum to clearly show the uterine condition. In the Valsalva state and at rest, distance from the bladder neck to the posteroinferior border of the pubic symphysis (BNSD), distance from the external cervical orifice to the posteroinferior border of the pubic symphysis (CSD), posterior urethrovesical angle (PUA), and bladder neck mobility and the difference in the distance from the bladder neck to the posteroinferior border of the pubic symphysis (BND) under the maximum Valsalva state and at rest were compared between two groups. For measurements in both states, all data were measured in triplicate by 3 examining physicians and then averaged.

The patient was anesthetized by endotracheal intubation. The bladder lithotomy position was taken, and the uterine lifting device was placed. The patient’s posture was low head and high foot. The longitudinal incision (about 10 mm) was made from the center of the umbilical to the lower edge of the umbilical wheel, and then, the self-made operation platform was implanted through the umbilicus. The external diameter of 10 mm and 5 mm instruments and camera lens were implanted at the finger end of the glove. After successfully entering the abdominal cavity, the pneumoperitoneum was established, the pressure was 12-14 (mmHg), the patient’s body position was adjusted to 30°, the head was low, and the hip was high, so as to use the uterine lifting device. The pelvic cavity was evaluated by laparoscopy. Bipolar electrocoagulation was used to shorten the isthmus of the fallopian tube, the round ligament of the uterus, and the inherent ligament of the ovary, and then, the broad ligaments in the anterior and posterior parts and the peritoneal reflection of the bladder were opened, respectively. Bipolar electrocoagulation of bilateral uterine arteries and veins was performed. The cervical stump 1-10# absorption suture was selected to stop bleeding, and the uterus was removed at vertebral body and parallelly at the isthmus above the uterine vascular suture. After cervical reduction, the tract mucosa and pelvic peritoneum were sutured. Before the end of the operation, the pelvic wound was examined, and repeated washing was performed to stop bleeding. After incision suture, the operation ended.

Hardware platform: Ubuntu 16.04 operating system, deep learning framework is Pytorch-1.2.0. Python 3.6 is the development language, memory 128 GB, central processor is IntelXeon (R) Silver 4110CPU@2.10 GHz×32, and image processor is NVIDIA 1080Ti.

2.3. Observation Indicators

The intraoperative blood loss, postoperative pain score, postoperative fever, postoperative exhaust time, conversion operation, and operation time were recorded.

Pain degree: numerical rating scale (NRS) score was used (Table 1).

2.4. BCNN Structure

The BCNN structure is shown in Figure 1. In this algorithm, after optimizing the minimum loss, the two networks supervised each other, and it did not require a lot of time to adjust the parameters. Finally, the accurate recognition results were obtained. The parallel network of CNN Stream A and CNN Stream B extracted the cross-sectional features with high recognition in the input network image. The features extracted by CNN Stream A and CNN Stream B were fused, which helped to identify subtle features.

The image processing technology of deep learning used superimposing multiple convolutions to obtain image characteristics and then used multilayer perceptron to classify them. In clinical practice, the standard constrained deep network was used to realize the transverse section of ultrasound imaging. The doctor needed to spend a lot of experience in finding the horizontal transverse section during the examination, which was particularly important to have a standard access that can accurately identify the transverse section of the uterus. BCNN-S and BCNN-R were proposed to identify the transverse section of uterine ultrasound. First, image processing was performed, then the high transverse section was identified, and finally the features were fused. After obtaining the subtle features, the identification and classification were performed, and finally, the accurate identification of the horizontal direction of the transverse section of uterus was obtained.

The algorithm was optimized. The initial learning efficiency was set to 1, the weight attenuation was set to , the momentum attenuation was set to 0.9, and the learning rate of each epoch was multiplied by 0.1. The gradient was updated after algorithm optimization, which was more stable and smoother. The default parameters can make the model reach a stable level (Table 2).

2.5. Space Conversion Module

Through the attention mechanism, the space transformer networks (STN) can convert the spatial information in the original image to another space to retain key information. The spatial transformer (ST) was proposed. The spatial domain information in the image is transformed to extract key information. The trained STN can find out the areas that need to be concerned in the image information. ST has the function of scaling and rotation. The local information of the image can be extracted by transformation. ST module is input to the existing network structure. The model input equation is shown as follows.

represents the width, represents the height of the output tensor of the upper layer, and represents the channel. Different convolution kernels and basic three channels of the image produce different channel information. The input image enters the double-stream route, and the transformed image is obtained through matrix change.

represents the transformed image features, and the positioning network learns a set of parameters . This parameter generates a sampling signal through the parameters of the grid generator, which is essentially the transformation of the matrix image.

The sampling matrix generated by STN can extract the key information in the original image. One is the sampling matrix for scaling and rotation transformation, and the other is the unit matrix, which is expressed as follows.

matrix is the corresponding sampling matrix. This module can identify the key information of the module on the upper layer. It is also a matrix that can be differentiated. It uses the kernel function to represent the complex change information.

is the conversion kernel function, represents the information before conversion, and represents the information after conversion.

Classification network has strong feature representation ability and good recognition ability for conventional images. In target recognition, the difference between different targets is very small, but it is not ideal to classify directly in conventional images. In the debugging of fine-grained classification network, the loss measurement function is introduced. There will be three weight shared networks , , and when three samples are input each time. The accumulated three network outputs obtain the loss, and in addition to the use of softmax loss function, the three characteristic outputs constitute the Triplet loss.

represents the loss obtained by the softmax loss function, which represents the overall category information of the image so that the network can be optimized to the real category. belongs to the Triplet error of three sub networks with shared parameters. In order to increase the recognition ability of the same category and different samples of the network, the distance between classes is calculated. The two loss functions restrict each other, and the performance of the model is significantly improved.

When detecting the target, CNN can train through the object frame and component annotation in the fine-grained training image, including object head detection, fine-grained object level detection, and trunk detection. Ideal object detection results can be obtained by using position geometric constraints.

After the residual attention module can strengthen the attention and image features, the features are input into the next module at the same time. The function represents different functions and has different attention domains. The equation is as follows.

indicates that the Sigmoid function directly activates the image feature tensor, indicates that the image feature tensor is globally averaged and pooled to obtain channel domain attention, and indicates the average number of Sigmoid functions activating the image feature tensor to obtain spatial domain attention.

2.6. Performance Evaluation Indicators

Reasonable evaluation of performance indicators can effectively evaluate the performance of the algorithm. The cross section of ultrasonic image is evaluated as a binary classification problem. The prediction category and real category of the model are divided into true negative (TN), false positive (FP), true positive (TP), and false negative (FN).

The accuracy calculation is shown in Equation (9). The higher the accuracy of classification, the better the performance of the algorithm. Precision refers to the proportion of TP in all samples predicted to be positive. Recall indicates the proportion of samples predicted to be positive in positive samples. When the recall rate and accuracy rate are high, the average will be higher. If one of them is low, it will lower the average, and its value will be close to the low number, as shown in Equation (12).

It is required to make evaluation on the overall mean Intersection over Union (mIoU) and mean Dice of graph. mIoU predicts the intersection of the target area and the real target area. The higher the value of mIoU, the higher the correlation. mDICE is used to calculate the similarity between two samples. It is a geometric similarity measurement function. in Equation (14) denotes the true region, and denotes the predicted region.

The experimental environment hardware: platform memory (RAM) is 128 GB, the image processor (GPU) is NVIDIA 1080Ti ×2, the central processing unit (CPU) is Intel Xeon®Silver4110 CPU@2.10 GHz ×32, and the operating system is Ubuntu 16.04. The deep learning framework is Pytorch1.2.2, and Python 3.6 is selected as the development language.

2.7. Statistical Methods

The database of all data was established by Excel, and SPSS 19.0 statistical software was adopted. Measurement data were expressed as (), enumeration data were analyzed by the test, and enumeration data were expressed as percentage (%). was considered to indicate a significant difference.

3. Results

3.1. Training and Test Data Results

In this study, 3619 ultrasound images were obtained from the public dataset HC18 and the self-built dataset JFU19 after preprocessing, which were divided into training set and data set according to a certain proportion, and a certain number of training sets were drawn to evaluate the model, it was shown the model had a good recognition effect, and the three positions of horizontal, vertical, and right-angle lines were trained, respectively, and the results are shown in Figure 2. The test of the training set revealed that the model showed a good performance.

3.2. Parameter Information of BCNN

In this study, BCNN-R and BCNN-S feature images were extracted to analyze the performance of the algorithm. The resulting feature is shown in Figure 3. The feature image has distinct activation regions, which also indicates that there are many features that can be used for network identification and classification. Compared with BCNN-S B, BCNN-R A has more activated regions, which also obtains more identification ability. The ultrasound image in Figure 3(c) has more noise and reverberation than that in Figure 3(d). BCNN-R has a more complex structure and is inevitably subjected to a lot of noise.

3.3. Algorithm Performance Comparison

Comparing the algorithm performance of BCNN-R and BCNN-S with that in References [2224], the highest accuracy of BCNN-S was 88.98%, followed by 85.62% of BCNN-R. BCNN-R had the highest recall rate of 88.51%, followed by BCNN-S of 87.54%; the highest accuracy of BCNN-R was 97.34%, followed by 91.67% of BCNN-S (Figure 4).

3.4. Experimental Results of Semantic Segmentation

In order to explore the change of the algorithm with epoch and the performance of the algorithm in ultrasonic image segmentation, the mIoU and mDICE evaluation indexes were analyzed. The results were as follows. Compared with mIoU and mDICE in Reference [24], mIoU and mDICE in BCNN had higher convergence performance, which can increase the robustness of the network to a certain extent and inhibit overfitting (Figure 5).

3.5. Ultrasound Results

The ultrasound parameters BNSD, CSD, PUA, bladder neck mobility, and BND under the maximum Valsalva state and at rest were compared between the two groups. It had a reduced BNSD under Valsalva state compared with that under at rest (Table 3).

3.6. Comparison of Surgical Conditions

The operation time, NRS score, intraoperative blood loss, and exhaust time of the two groups were compared. The operation time, intraoperative blood loss, and exhaust time of the two groups were similar, and the difference was not statistically significant (). In the NRS score, the pain of the observation group was mild, and the difference was significant () (Table 4).

3.7. Comparison of Hospitalization Time and Satisfaction

The satisfaction score of the observation group was , and that of the control group was . The hospitalization time of the observation group was days, and that of the control group was days. The difference between the groups was statistically significant () (Figure 6).

3.8. Postoperative Comparison

Abdominal infection, hematoma, incision bleeding, and incision infection did not occur in both groups after operation, and the patients recovered well. However, in the control group, one patient had active bleeding in the umbilical region after operation, and the dressing became dry after hemostasis with pressurized sand. The incidence of fever in the observation group was 1 case (2.5%), and that in the control group was 3 cases (7.5%). The fever rate in the control group was significantly higher than that in the observation group () (Figure 7).

4. Discussion

Total hysterectomy is a relatively common gynecological surgery. After hysterectomy, the whole physiological state and overall structure of pelvic floor will change, which may produce PDF. Female pelvic floor is intricate, and it is important to find a noninvasive, reproducible, and simple diagnostic method for assessing pelvic floor function. With the rapid development of surgical abdominal surgery, LESS, with the natural approach, is widely used in clinical practice and is easier to understand and accept for some gynecological patients who meet the surgical criteria [25]. Compared with traditional hysterectomy, laparoendoscopic single-site total hysterectomy reduced hospital stay and postoperative blood loss, the two groups of patients had similar operation time, intraoperative blood loss, and exhaust time, and the difference had no statistical significance (); in the NRS score, the pain of the observation group was mild, and the difference had statistical significance (). Chung [26] compared single-site laparoscopic total hysterectomy with conventional laparoscopic total hysterectomy, there was no difference in pain scores between the two groups, and the operation time was significantly longer in single-site laparoscopic total hysterectomy group. In the observation group, intelligent algorithm was added in the ultrasound imaging, and there was no significant difference in the operation time. One patient had active bleeding at the umbilicus after surgery, and the dressing became dry after hemostasis with pressurized sand.

Ultrasonic technique of the BCNN model based on deep learning is accurate for evaluating pelvic floor function. Yu et al. [27] evaluated the effect of laparoscopic hysterectomy based on artificial intelligence imaging; ultrasound imaging based on ISCB algorithm can high-quality display the pelvic floor structure of patients undergoing total laparoscopic hysterectomy, improve the diagnostic rate of doctors, accelerate postoperative rehabilitation of patients, reduce postoperative pain, and improve patient satisfaction. In this experiment, BCNN-S had the highest accuracy of 88.98%; BCNN-R had the highest recall of 88.51%; and BCNN-R had the highest precision of 97.34%. The proposed algorithm shows high convergence performance in ultrasonic image segmentation. This experiment also shows that the intelligent algorithm of deep learning can effectively improve the efficacy of hysterectomy patients. Compared with mIoU and mDICE in Reference [24], mIoU and mDICE in BCNN had higher convergence performance, which can increase the robustness of the network to a certain extent and inhibit overfitting.

5. Conclusion

Eighty patients undergoing transumbilical single-site laparoscopic total hysterectomy in hospital were randomly divided into the treatment group and the control group to investigate the efficacy and safety of ultrasound based on deep learning for pelvic floor function in patients undergoing LESS. Ultrasound technology can be used to dynamically observe the anatomy of female pelvic floor organs; ultrasound technology of BCNN model based on deep learning has no significant toxic and side effects, with higher accuracy for evaluating the pelvic floor function. The incidence of pelvic floor dysfunction after hysterectomy is small, and attention should be paid to pelvic floor disorders and the image of sexual function during surgery, which is more conducive to the recovery of physical and mental health and normal life of patients. There are some shortcomings, due to the limitation of time, there is a lack of long-term follow-up, so long-term follow-up to patients is needed at a later stage to further verify the long-term efficacy. It is believed that it can provide some ideas and experimental support for the recovery of pelvic floor function in patients undergoing LESS.

Data Availability

The data used to support the findings of this study are available from the corresponding author upon request.

Conflicts of Interest

The authors declare no conflicts of interest.

Authors’ Contributions

Yan Zhu and Jiamiao Zhang contributed equally to this work.