Abstract
Internal leakage diagnosis in a hydraulic cylinder is a key technique for the maintenance of hydraulic systems. However, it is difficult to diagnose the internal leakage under different low loads. To solve this problem, a novel fault diagnosis method based on the optimization deep belief network (DBN) combined with the complete ensemble empirical mode decomposition with adaptive noise (CEEMDAN) technique is proposed to treat the collected AE signals. The raw AE signals are decomposed into a set of intrinsic mode functions (IMFs) by using CEEMDAN. Subsequently, according to the decreasing order of the Pearson correlation coefficient values, the first five IMFs are selected for signal reconstruction to suppress the abnormal interference from noise. The reconstructed signals are regarded as the input of the optimization DBN, and the particle swarm optimization simulated annealing (PSOSA) algorithm is adopted to identify the four internal leakage levels. The experimental results show that the proposed method exhibits a higher classification accuracy than other methods under different low loads. This result validates the effectiveness and superiority of the proposed approach to realize internal leakage diagnoses under different low loads.
1. Introduction
Hydraulic systems are widely applied in industry as actuators in hydraulic systems [1–3]. Internal leakage is a major failure mode caused by the wear of the piston seal that closes the gap between the movable piston and cylinder wall [4]. The detection of internal leakage is critical in practical applications because such leakage can affect the dynamic performance of the system. However, internal leakage cannot be detected until the actuator seal is completely damaged, and the actuator fails to respond to a control signal. Moreover, the detection of internal leakage is challenging due to the associated concealment. In addition, online nonintrusive diagnosis of internal leakage cannot be easily realized because of the dynamic working conditions of hydraulic cylinders. Hence, this paper proposes a method to realize automatic internal leakage diagnoses under different loads.
The existing methods for the fault diagnoses of internal leakage in hydraulic cylinders can be categorized into two types, namely, model-based and data-driven methods. Among model-based methods, a linear model-based approach with an adaptive threshold [5] and extended Kalman filter- (EKF-) based fault detection scheme [6] have been developed to detect different types of leakage in hydraulic actuators. However, due to the uncertainty and nonlinearity in hydraulic systems, it is difficult to establish a precise model. Consequently, researchers have adopted data-driven methods to realize fault diagnoses. Data-driven methods usually include two main steps: fault feature extraction and selection using signal processing techniques, and fault classification using classifiers. Many features based on pressure signals have been found to be sensitive to internal leakage, such as the root mean square (RMS) values of level two detailed coefficients obtained using the wavelet transform [7–9], five features extracted through wavelet packet analysis [10], and the instantaneous amplitude of the first IMF obtained through the EMD technique [11]. However, to implement the abovementioned schemes, a baseline value must be predetermined, and this value cannot be automatically adjusted with the change in the working conditions. Hence, Li et al. proposed a featured temporal segmentation method and an AdaBoost-BP detector to automatically evaluate the internal leakage faults in hydraulic actuators [4]. Nevertheless, such traditional data-driven methods exhibit several notable deficiencies: (1) the features input to the classifiers are designed by diagnosticians depending on prior knowledge regarding the signal processing techniques and diagnostic expertise. (2) The adopted classifiers have shallow architectures, which limit the capacity of such classifiers to learn the complex nonlinear relationships in fault diagnosis scenarios [12]. These limitations can be overcome by using deep learning (DL) algorithms. The key property of deep learning models is the multiple layer structure. Using the multiple hidden layers stacked hierarchically, a deep learning model can realize the highly complicated transformation and abstraction of raw signals [13, 14]. Guo et al. adopted a convolutional neural network (CNN) that employed raw strain signals to output the internal leakage of the hydraulic cylinder [15]. However, the strain caused by microflows is extremely small under low pressure, that is, for pressure values lower than 7 MPa, and the abovementioned method cannot effectively diagnose the internal leakage in such cases. In other words, the existing methods cannot effectively diagnose internal leakage under different low loads.
In contrast to the internal leakage detection technologies based on the dynamic pressure or strain, the acoustic emission (AE) technology, as a nondestructive testing (NDT) technique with an extremely high sensitivity, can be used to detect weak internal leakage signals without hindering operations [16–20]. Furthermore, the AE technology exhibits several advantages over other nondestructive testing (NDT) techniques such as the vibration technology [21]. First, the defects are easier to detect because of the enhanced signal to noise ratio. Second, because this approach can detect mechanical abnormalities at high frequencies, it is highly sensitive compared to vibration techniques. In addition, the AE technology can be applied to all types of machinery, regardless of speed [22]. Nevertheless, owing to the use of a highly sensitive transducer, the measured raw signal, which often includes noise and irrelevant and redundant signals, must be subjected to noise removal to ensure reliable fault diagnosis. CEEMDAN-based denoising methods can self-adaptively decompose a signal into several monocomponents according to the local characteristic time scales of the signal, which can ensure a high decomposition performance in nonlinear and nonstationary processes [23]. Moreover, under different working loads of the hydraulic cylinder, the difference in the raw leakage signals of identical patterns increases, which hinders the fault diagnosis process. In this case, Pearson correlation coefficients can be used to account for the main characteristics that differentiate the fault modes while eliminating the difference caused by the different working conditions [24, 25]. Therefore, in this study, a reconstruction denoising strategy combining CEEMDAN with Pearson correlation coefficients is established to obtain the denoised signal. First, the sampled AE signals are decomposed into different IMFs by using CEEMDAN. Subsequently, the Pearson correlation coefficients between the raw signals and each IMF are calculated. The reconstructed data based on the IMFs with large Pearson correlation coefficients are obtained for further analysis.
The deep belief network (DBN) is a promising tool to realize fault characteristic mining and intelligent diagnosis in the case of massive data [19, 26–28]. In contrast to the CNN, the DBN avoids the process of transforming one-dimensional time series data. Moreover, the DBN does not rely on manual feature extraction and selection, and thus, it can automatically learn effective information from the raw signals. To reduce the influence of manual parameter setting on fault diagnosis accuracy, the hybrid particle swarm optimization simulated annealing (PSOSA) algorithm [29, 30] is adopted to identify the optimal structure of the trained DBN [31]. Moreover, experiments are conducted using a gas-hydraulic pressurized cylinder, which is easy to disassemble and can operate under low load conditions.
Overall, this paper proposes a method involving the optimization DBN in combination with the CEEMDAN technique. First, the CEEMDAN combined with the Pearson correlation coefficients is employed to eliminate noise from the raw AE signals for four different internal leakage levels. Subsequently, the denoised signals are directly fed to the optimization DBN by the PSOSA to identify the levels of internal leakage.
2. Theoretical Framework
2.1. CEEMDAN Theory
The process flow of the CEEMDAN algorithm can be summarized as follows [32, 33]: Step 1: Add a white noise series to the investigated signal, that is, , where i = 1, 2, 3, …, I. Decompose each by using the EMD to obtain the corresponding . Next, calculate the first CEEMDAN mode and first residue . Step 2: Decompose each by using the EMD, where i = 1, 2, 3, …, I, represents the k-th mode component of the given signal obtained through the EMD calculation. Define the second CEEMDAN mode as . For k = 2, …, K, calculate the kth residue, ; Step 3: Decompose each by using the EMD, where i = 1, …, I. Calculate the k + 1 CEEMDAN mode as .
2.2. DBN Principle
The DBN consists of one visible layer and three hidden layers, as shown in Figure 1. The learning process of DBN includes two stages: the unsupervised pretraining stage and the supervised fine-tuning stage. In the supervised fine-tuning stage, the labelled target data are input to the model, and the parameters are updated [34, 35]. Suppose that there exist I visible units and J hidden units in an RBM. The states of the visible layer and hidden layer are represented as and h, respectively. The RBM energy function can be defined as follows:where ; θ represents the model parameters; is the visible unit i; is the hidden unit j; is the weight connecting the visible unit i and hidden unit j; is the offset of the visible unit i; and is the offset of the hidden unit j.

The contrastive divergence (CD) algorithm is adopted to update the model parameters as follows:where denote the learning rates of the weights between 0 and 1; a and b represent the biases of the hidden layer and visible layer, respectively; and refer to the expectation derived using the training data distribution and reconstructed data, respectively.
2.3. Hybrid PSO-SA Optimization Method
The following steps are involved in the hybrid PSOSA algorithm [36–38]: Step 1: Randomly initialize the parameters, including the position , velocity , and iterations. Calculate the fitness value of each particle by using the fitness function defined in where refers to the number of misclassified data, and N refers to the total number of data. Next, update the best position of a particle and the best particle position of the swarm according to the history. Initialize temperature . Step 2: Update the positions and velocities of all the particles by using the following function: where is the inertia coefficient, . and are cognitive parameters. r1 and r2 are random values in the interval [0, 1]. Step 3: Evaluate whether the new solution is accepted by using the metropolis acceptance criteria as follows: Otherwise, calculate the acceptance probability . Step 4: Update the best position of a particle and the best particle position of the swarm considering the history. Step 5: If , terminate the cycle, and output the best solution and objective. Otherwise, gradually decrease the temperature , and return to step 3.
3. Experimental Setup
3.1. Test Rig
As shown in Figure 2(a), the test rig consists of a gas-hydraulic pressurized cylinder 1, Jack 2, and pneumatic device 3.

The operating principle of the gas-hydraulic pressurized cylinder is shown in Figure 3. The red, green, blue, purple, and black regions correspond to the high-pressure air, low-pressure air, pressure-free hydraulic oil, preloading hydraulic oil, and high-pressure hydraulic oil, respectively. AE sensor 3 with an electromagnet is fixed on the mounting plane. The piston of rod 2 uses a y-sealing ring. Notches with different sizes are set on the sealing rings for a load of 1 MPa and 2 MPa to simulate four internal leakage levels corresponding to no internal leakage, small internal leakage, medium internal leakage, and large internal leakage. The seal rings with different size notches under 1 MPa are illustrated in Figure 2(b). The leaking oil flows out through the P1 port, and the internal leakage rate of the flow is determined using a measuring cup and stopwatch. The operating principle of the gas-hydraulic pressurized cylinder can be defined as follows:(1)As shown in Figure 3(a), the P2 port connects the high-pressure air, the P3 and P4 ports connect the low-pressure air, the P1 port is the drain port, rods 1 and 2 retract, and the hydraulic oil is located in the oil tank.(2)As shown in Figure 3(b), the P4 port connects the high-pressure air and preloading hydraulic oil. The hydraulic oil enters into nonrod chamber of the cylinder, and rod 2 extends.(3)As shown in Figure 3(c), the P3 port connects the high-pressure air, and P2 port connects the low-pressure air. Rod 1 extends, thereby pressurizing the hydraulic oil. The leaking hydraulic oil flows out through the P1 port.(4)As shown in Figure 3(d), the P2 port connects the high-pressure air, P3 port connects the low-pressure air, rod 1 retracts, and pressure relief occurs. Rod 2 is compressed by Jack 4, and the hydraulic oil flows back to the oil tank.

(a)

(b)

(c)

(d)
3.2. AE Sensor and Data Acquisition Device
As shown in Figure 4(a), AE sensor 1 is the PXR04 device manufactured by Changsha Pengxiang Electronic Technology Co. LTD. This device has a frequency range and resonant frequency of 30–140 kHz and 40 kHz, respectively. Amplifier 2 shown in Figure 4(b) has a fixed gain of 100 dB. Data acquisition card 3 pertains to NI-6340 manufactured by National Instruments; this device has a maximum acquisition frequency and output voltage amplitude of 400 kHz and 10 V, respectively.

4. Internal Leakage Diagnosis Method Based on the Optimization DBN
The proposed diagnosis method based on the optimization DBN can be described as follows: Step 1: Define four internal leakage levels of the hydraulic cylinder under two loads (1 and 2 MPa). Collect raw AE signals and decompose the signals into a set of IMFs through the CEEMDAN. Step 2: Calculate the Pearson correlation coefficient between each IMF and the raw AE signals. Next, select the first five sensitive IMFs to obtain the reconstructed signals by considering the decreasing correlation coefficient values. Step 3: Divide the reconstructed signals into training and testing samples. Train the optimization DBN by using the PSOSA algorithm and training samples. Use this network to classify the internal leakage levels by considering the testing samples. These steps are implemented to apply the proposed method to diagnose four internal leakage levels, and the results are recorded.
4.1. Signal Reconstruction and Data Description
According to the Shannon sampling theorem, the sampling frequency of the data acquisition card is set as 400 kHz. The AE signals in different internal leakage levels under a load of 1 and 2 MPa are shown in Figures 5 and 6, respectively.


The original signals are preprocessed using CEEMDAN. The Pearson correlation coefficient represents the correlation between each IMF and the original signal. The Pearson correlation coefficient between and the original signal data sequence is calculated as follows [24]:where T is the number of points in the original signal data sequence; and denote the mean values of and , respectively.
To eliminate the noise and enhance the signal to noise ratio, the first five IMFs are used for signal reconstruction. The reconstruction process can be expressed aswhere denotes the reconstructed signal; , , , , and are the first five IMFs.
In this case study, datasets A and B are established to verify the performance of the proposed method. Datasets A and B include the data for the four conditions under loads of 1 and 2 MPa, respectively. Both datasets contain 100 training samples (each sample has 2000 data points) and 50 testing samples for each condition. The sample distributions for the four conditions in datasets A and B are presented in Table 1.
4.2. Comparison Approach
In this case study, PSOSA is applied to develop the optimization DBN based on the training samples. The parameter settings for the hybrid PSOSA are presented in Table 2. Table 3 lists the values of the parameters for the optimization DBN derived using the PSOSA. The optimal architecture of the DBN is 2000-860-284-78-4. The input and output layers consist of 2000 and 4 neurons, respectively, which are set based on the dimensions of the input data and fault patterns. The learning rate and momentum obtained using the PSOSA are 0.48 and 0.23, respectively.
To illustrate the effectiveness of the proposed method in actual applications of the fault diagnosis for the internal leakage, comparative experiments are conducted considering the following six methods:(1)DBN method: The original AE signals are used directly as the input of the optimization DBN.(2)EEMD + DBN: The reconstructed AE signals based on the EEMD method are used as the input of the optimization DBN.(3)CEEMDAN + DBN: The reconstructed AE signals based on the CEEMDAN method are used as the input of the optimization DBN.(4)CEEMDAN + BP: The reconstructed AE signals based on the CEEMDAN are used as the input of the BP network. To enable a fair comparison, the BP network and optimization DBN used the same data, architecture, learning rate, momentum, batch size, and epochs.(5)T + BP: the original AE signals are preprocessed to extract the time domain features including the mean value, root mean square, peak, skewness, kurtosis value, crest factor, clearness factor, shape factor, and impulse factor. Next, the 9 selected features are used as the input of the BP network with 1 hidden layer. The architecture of the BPNN is 9-78-4.(6)T + F + BP: The original AE signals are preprocessed to extract the time and frequency domain features. The 9 time domain features are the same as those for method 5. The 4 frequency domain features include the mean frequency, frequency centre, root mean frequency, and standard deviation frequency. Next, the 13 selected features are used as the input of the BP network with 1 hidden layer. The architecture of the BPNN is 13-78-4.
The six methods are run 20 times, and the average classification accuracy is determined, as listed in Table 4.
The results show that the proposed method exhibits a considerably higher classification accuracy than that of the traditional methods based on the BP network. The BP network has the same architecture as the DBN, and the same reconstructed AE signals without any feature extraction are used to train the DBN and BP network in methods 3 and 4. The comparison of the diagnosis accuracy of methods 3 and 4 indicates that the accuracy of the DBN is considerably higher than that of the BP network when managing massive data. This phenomenon occurs because deep learning methods can learn the valuable information from the input data through multiple feature transformations. According to the comparison of the diagnosis accuracy of methods 5 and 6, different features yield different results, which indicates that the realization of the accurate classification of BP network requires manual intervention to select the proper features to implement the fault diagnosis.
In addition, the DBN combined with the CEEMDAN to realize the denoising in method 3 exhibits a higher accuracy than that of the DBN based on the original AE signals (method 1). This finding demonstrates that the proposed method can effectively denoise signals and enhance the robustness of the fault diagnosis results. Furthermore, the CEEMD + DBN outperforms the EEMD + DBN, and in contrast to the EEMD, the CEEMDAN can decrease the reconstruction error and avoid the problem of different numbers of modes for different realizations of the signal plus noise. Furthermore, the proposed method can achieve a higher classification accuracy than that of the other methods in datasets A and B. The results demonstrate that the proposed method can effectively diagnose the fault types even under load fluctuations.
5. Results and Discussion
The Pearson correlation coefficients are used to select the sensitive IMFs for further processing. However, no prior knowledge is available to set the number of sensitive IMFs for further reconstruction. Therefore, an investigation is conducted to determine the number of sensitive IMFs. According to the decreasing correlation coefficient values, the number of sensitive IMFs is increased sequentially from 1 to 12. The procedures are repeated 20 times, and the average accuracy for the testing samples of datasets A and B is separately determined.
The average classification accuracy varies with the different numbers of selected IMFs, as shown in Figure 7. Initially, the accuracy increases with the number of selected IMFs. However, the accuracy decreases as the number of selected IMFs increases to 12. The result indicates that, in the initial stage, as the number increases from 1 to 5, the selected IMFs include considerable fault-related information. As the number increases from 5 to 12, the selected IMFs contain an excessive amount of fault-unrelated information or a high degree of overlap. As the number of selected IMFs increases, the accuracy exhibits the opposite trend. Thus, the first five sensitive IMFs with a large correlation coefficient are considered for subsequent reconstruction in this paper.

To observe the clustering result of the proposed method, the principal component analysis (PCA) is performed on the input and output data of the third hidden layer. A certain confusion occurs among the four conditions in the input data, as shown in Figures 8(a) and 9(a). After the 3 hidden layers of the DBN, as shown in Figures 8(b) and 9(b), most samples are correctly assigned to the corresponding cluster. The results indicate the effectiveness of the proposed method in mining the fault characteristics and classifying internal leakage conditions.

(a)

(b)

(a)

(b)
6. Conclusions
This paper proposes a fusion method that combines the optimization DBN with the CEEMDAN to realize the internal leakage diagnosis of hydraulic cylinders under different low loads. The main contributions of the paper are as follows: (1) first, the AE signals are examined to be used for the hydraulic cylinder internal leakage diagnosis. Based on the AE fault signals, the CEEMDAN-based denoising method and optimal number of representative IMFs for signal reconstruction are investigated. (2) The optimization DBN is obtained using the hybrid PSOSA algorithm instead of subjective human experience. The performance of the proposed method under identical loads is superior to those of the existing state of the art methods. (3) The proposed method can effectively diagnose faults under different low loads, which cannot be easily realized through the existing methods.
Data Availability
The data used to support the findings of this study are available from the corresponding author upon request.
Conflicts of Interest
The authors declare no conflicts of interest.
Acknowledgments
This research was funded by the National Natural Science Foundation of China, grant number (51975425).