Abstract

Feature extraction from vibration signal is still a challenge in the area of fault diagnosis and remaining useful life (RUL) estimation of rotary machine. In this paper, a novel feature called phase space similarity (PSS) is introduced for health condition monitoring of bearings. Firstly, the acquired signal is transformed to the phase space through the phase space reconstruction (PSR). The similar vibration always exists in the phase space due to the comparable evolution of the dynamics that are characteristic of the system state. Secondly, the normalized cross-correlation (NCC) is employed to calculate the PSS between bearing data with different states. Based on the PSS, a fault pattern recognition algorithm, a bearing fault size prediction algorithm, and a RUL estimation algorithm are introduced to analyze the experimental signal. Results have shown the effectiveness of the PSS as it can better grasp the nature and regularity of the signals.

1. Introduction

Currently, fault diagnosis is an extremely significant research field in industry; intelligent health monitoring is gradually supplemented in maintenance of the machine to ensure the stability of systems and decrease the downtime. Rolling bearing is a key component in the rotary machine, and faults occurred frequently, leading to a disastrous accident. Typical damage of bearing is located at outer race, ball, and inner race. One of the challenges for health monitoring is sensitive feature extraction from sensor signals that are often mixed by noise [1, 2]. Representative feature extraction methods are based on time-domain analysis and frequency-domain analysis [35]. Although the above methods cannot extract the nonlinear features from the vibration signals, the time-frequency analysis can overcome this problem, investigating a signal in both time and frequency domains [6]. In addition, empirical mode decomposition and wavelet analysis are able to express the nonstationary signals well and can also extract the sensitive features [7, 8]. Nevertheless, the extracted features are still redundant and with high dimension. Therefore, the advanced dimension reduction techniques are proposed to solve the aforementioned problem such as principal component analysis (PCA) [9] and independent component analysis (ICA) [10], and they have been employed to extract sensitive features from high-dimensional data.

Chaotic time series analysis is an excellent method for nonlinear signals, which can grasp the nature and regularity of the time series [11]. And it has been verified in signal processing. In the literature [12], vibration signal is reconstructed by PSR in which the phase trajectory of different kinds of signals will demonstrate different structures. In the literature [13], the PSR is combined with time-frequency synthesis and utilized in data denoising. In these studies, the PSR is introduced for fault diagnosis and RUL estimation, which can extend time series to a high-dimensional phase space and maintain topologically equivalent. Theoretically speaking, a time series can sufficiently reconstruct an original dynamic system by PSR [14].

On the one hand, the template matching is widely applied to image analysis, which can calculate the dissimilarity or similarity between the template and the image. Then the position with maximum correlation is taken to represent the instance of the template into the image [15]. Classical correlation matching methods are as below: maximum close distance (MCD), minimum absolute difference (MAD), and NCC [16]. Among them, the NCC has been widely used in correlation measure because it proved to be good at template matching [17, 18]. In these studies, NCC is utilized in sensor signal processing. Besides, the NCC is combined with PSR in order to serve for feature extraction. The new feature was called PSS. Furthermore, template matching that is based on the proposed feature is employed in the recognition of bearing fault patterns and estimation of bearing RUL.

On the other hand, on the basis of these features that are extracted from the sensor signals, machine learning is widely used in machine fault diagnosis such as artificial neural networks (ANN) [19] and support vector machine (SVM) [20]. Although these strategies have been successfully utilized in fault diagnosis, the ANN still has a slow convergence and poor generalization ability and the SVM is only being used to handle a binary problem. What is more, the aforementioned methods are inefficient or even infeasible in regression problem. According to the study of the literatures [2123], the support vector regression machine (SVR) can establish a stable nonlinear relationship between inputs and outputs and then make time series prediction. As a result, the SVR is introduced for bearing fault size prediction so as to overcome the above limitation.

In this research, firstly, the autocorrelation function is used to determine the delay time. Meanwhile, Cao’s algorithm is used to determine the embedding dimension [24], and then all of data are transformed to the phase space by PSR. Thereafter, a sliding window is employed to obtain the data segment and NCC is adopted in template matching to realize the recognition of bearing fault patterns. Secondly, a SVR structure is trained by PSS, which aims to quantitatively predict the fault size. Finally, the RUL of the tested bearing is estimated by template matching that is based on PSS. Going further, three applications based on PSS are verified by case studies; the experimental results show that the proposed method can effectively realize the fault diagnosis and RUL estimation for bearing.

The rest of the paper is organized as follows. Section 2 gives a brief introduction of PSR, NCC, and SVR. The proposed fault pattern recognition, fault size prediction, and RUL estimation procedures for bearings are illustrated in Section 3. Experimental validations and the comparison with original signal’s similarity and root mean square (RMS) are presented in Section 4. Finally, the conclusions are drawn in Section 5.

2. Theory Background

2.1. Phase Space Reconstitution (PSR)

The research of PSR on chaotic time series was started by Packard et al. [25]. The underlying dynamics characteristic of a system can be exhibited in the phase space. In this research, the time delay embedding is applied to characterize the nonlinear feature of time series. Suppose that the time series is , and a phase point is constructed as where is the number of points in time series, is the index of the phase point, which ranges from 1 to , the embedding dimension is chosen as , and is the time delay. According to the literature [26], the embedding dimension and the time delay of the ideal time series (infinite length and noiseless) can be chosen as any value. However, the sensor signals are distributed with ambient noise. Therefore, it is difficult to determine the parameters of PSR [27].

The adjacent components in the phase space will became similar when the time delay is selected as a low value. By contrast, the components will be independent in statistics with a high value of time delay. Consequently, the autocorrelation function is utilized in this research for time delay identification; it is defined aswhere . According to Rosenstein’s theories [28], the minimum value of time delay that firstly makes lower than can be considered as the most optimal time delay, where is the natural constant that is approximately equal to 2.718.

After that, Cao’s algorithm is employed to determine the embedding dimension. In accordance with (1), the th phase point is written as ; suppose that the nearest phase point is . In the light of the embedding theorem, if is qualified as embedding dimension, then any two points that are close in the dimensional reconstructed phase space will continue to be close in the dimensional space:where indicates the relative distance increment and indicates the mean statistics of the . will be gradually stabilized with the increase of ; therefore, the minimum that stabilizes is the embedding dimension. However, it is difficult to judge that has been steadied owing to the finite length of obtained samples. Therefore, supplementary criteria are proposed as follows:

Suppose that the time series is random, is equal to 1. In contrast, as to a deterministic sample, will be changed with the increasing of . The appropriate embedding dimension can be determined by observing the curve of both and [29].

2.2. Normalized Cross-Correlation (NCC)

Classical correlation matching methods are as follows: MCD, MAD, and NCC. In this research, the NCC is employed to calculate the similarity because it is invariant to a linear transformation and is less sensitive to noise [16]. Suppose that the sample phase space is and the current phase space is , where is the number of phase points. The PSS is expressed as where is a part of and it is obtained by a sliding window with length and and are the mean vectors of and .

2.3. Support Vector Regression Machine (SVR)

SVR was developed for solving regression problems, which maximizes predictive accuracy and avoids overfitting simultaneously [30]. Suppose that the training sample is , , where is the input features and is the label of experiment condition. It can be represented by a linear function of , where identifies the weight vector and corresponds to the threshold coefficient. The SVR function can be more comfortable by using the -insensitive loss function as follows:

Flatness of linear function can be ensured by minimizing the , leading to the following convex optimization problem: with regularization parameter and the slack variables and . Then we get the following convex quadratic programming problem by introducing a Lagrange function:where and are the Lagrange multipliers. The function can be expressed as follows:

The regression function is

3. Methods and Procedures

In this paper, a novel feature extraction method that is based on PSR and NCC is proposed for fault diagnosis and RUL estimation. The framework of the proposed method is shown in Figure 1, where the signals come from bearing by accelerometer sensor. In the step of data reconstruction, PSR is applied to embed the vibration signals into the high-dimensional phase space. Assumed that the th sample signal is , where is the number of signal points and ranges from 1 to . Then the autocorrelation function is employed to determine the delay time; Cao’s algorithm is employed to determine the embedding dimension in th sample of database. To simplify, the embedding dimension is unified to , where , and the th sample phase space is described in where is the th phase point. Suppose that is the current data; in order to reduce the influence of the phase offset, the sliding window is utilized to obtain the segment from current data, where the window width is and step length is . The segment is written as and the corresponding phase space is given inwhere is th phase point, changes from 1 to , and changes from 1 to . Thereafter, the PSS between and is calculated in accordance with (5); suppose that PSS- is the maximum value between current data and th sample data. Finally, the PSS is employed to health condition monitoring and it can contribute to the recognition of bearing fault patterns, prediction of fault size, and RUL estimation. The detailed description of the three applications is described as follows.

(1) Recognition of Bearing Fault Patterns. Taking all fault patterns signal as database, the maximum PSS from PSS- will be found and the corresponding fault pattern is considered as the current fault pattern. The framework of the proposed method is shown in Figure 2.

(2) Prediction of Fault Size. As described in Figure 3, take normal vibration signals as database and bearing vibration signal with different fault size as the train data. The PSS between train data and normal signal as the training sample is to train the SVR structure, and the PSS between test data and normal signal as the testing sample is used for the trained SVR structure. Finally, the fault size of test data can be predicted by SVR.

(3) RUL Estimation of Rolling Bearing. The framework of the proposed method is shown in Figure 4. Take vibration signals of the entire life cycle as database, which are acquired from bearing run-to-failure and chronological arrangement. Due to the random fluctuation of signals, the accuracy of template matching will be decreased. Therefore, the similarity metric is improved as follows: where is the current operating time. In this similarity metric, the factor of operating time has been considered. If gets the maximum value at , where the current health condition is most similar to the th segment, then is assigned to . The RUL of the current data can be calculated aswhere denotes the failure time or lifetime that has been recorded in tested bearing sample.

4. Experiment Validation

4.1. Validation of Fault Patterns Recognition

The rolling bearing fault data of the Case Western Reserve University [31] are used to verify the proposed method; the vibration data are recorded on the test bench given in Figure 5. The test platform consists of four parts that are a motor, a dynamometer, a torque transducer, and a control electrometer. The resulting vibration was measured by an accelerometer being mounted to the motor housing at the fan end of the motor. Single-point faults of size 0.007, 0.014, and 0.021 inches were set on the fan end bearings by using the electric discharge machining approach. These faults were set on ball, inner race, and outer race. The motor speed was 1797 r/min, the sampling frequency is 12 kHz, and motor load is 0.

All fault locations and sizes are listed in Table 1, which includes 10 patterns; every pattern contains 122426 points, and then divided into 60 samples, each sample has 512 points and is without overlap between adjacent samples; the 60 samples are divided into 30 training samples and 30 testing samples. Therefore, this is a complex classification problem of ten patterns, which requires comprehensive identifying of both the fault location and the fault size.

At the very beginning, the embedding dimension and the time delay should be determined as described in Section 2.1. Take inner-0.014-inch-fault case as an example, the curves of , , and are shown in Figure 6.

It can be seen that tends to a stationary value when is larger than 8 and changes with the increasing of . The same conclusions are drawn for other cases. Hence, the embedding dimension is set as 8. And is lower than when equals 2; hence, the time delay is set as 2.

Thereafter, a sliding window with a window width of 64 and step length of 16 is used to intercept the segments from the current data. Take inner-0.014-inch-fault case as current data, the maximum PSS between current data and every fault pattern is calculated and the results are shown in Figure 7.

It can be found that PSS of pattern 3 is generally higher than others; therefore, the fault patterns of current data are most similar with the health condition of pattern 3. Thus, the current fault pattern is considered as inner-0.014-inch-fault pattern. Afterwards, all of testing samples are processed as above-mentioned methods that are based on PSS. In order to demonstrate the improvements, the proposed method is compared with the recognition method that is based on original signal similarity and SVM such that the classical feature RMS is selected as input. The results are shown in Table 2.

From Table 2, it can be seen that the recognition of fault patterns based on PSS has a higher accuracy. Moreover, the accuracy of the ball-fault (pattern 5 to pattern 7) recognition is lower than other fault patterns because the vibration signal of defective ball is more complicated and with strong randomization.

4.2. Prediction of the Fault Size

To evaluate the effectiveness of the proposed feature for fault size prediction, the SVR is adopted in this section. The PSS based on two-class mode is proposed to predict the fault size. At first, take normal bearing vibration signal as one class and bearing vibration signal with different fault sizes as another class. The results of PSS are shown in Figure 8.

From Figure 8, the classes 1–4 are represented as fault sizes 0, 0.007, 0.014, and 0.021 inches, respectively. The height of the bar is the PSS between corresponding fault size signal and normal signal. It can be seen that the PSS gradually decreases with the increasing of the fault size. After that, consider the PSS of classes 1, 2, and 4 as the training sample to train the SVR structure and the PSS of class 3 as the testing sample. In this study, the kernel function of SVR is set as RBF kernel , where and are represented as feature vectors, can be recognized as the squared Euclidean distance between the two feature vectors, and is a free parameter. Finally, the fault size of the testing sample will be regression-analyzed and predicted by SVR. The results of prediction are presented in Figure 9.

The maximum error is utilized to determine the accuracy of the fault size prediction and is defined as ; the error is 0.043, 0.049, and 0.019 inches, respectively.

The values of RMS of three bearing fault cases are shown in Figure 10. It can be seen that the values of RMS are nonmonotonic with growth of fault size, so it was not suitable in fault size prediction.

4.3. RUL Estimation of Rolling Bearing

To evaluate the effectiveness of the proposed feature for RUL estimation, the entire life cycle data of bearing is originated from the Center for Intelligent Maintenance Systems [32]. The experimental data sets are generated from run-to-failure bearing, which is tested on the designed test platform. Four bearings are installed on a shaft. The rotation speed is kept at 2000 r/min by using an AC motor coupled to the shaft via rub belts. A 6000 lb radial load is applied onto the bearing and shaft by using a spring mechanism. A PCB 353B33 High Sensitivity Quartz ICP Accelerometer was installed on bearing housing. A vibration data sample of 20480 points was collected every 10 minutes by a National Instruments DAQCard-6062E data acquisition card. The sampling rate is set at 20 kHz, the vibration data are recorded on the test bench given in Figure 11, and more details can be found in [33].

Taking outer-race-fault case as an example, the curves of , , and are shown in Figure 12. From Figure 12(a), it can be seen that keeps stable when is larger than 8, and randomly fluctuates with . Figure 12(b) shows changes with the increase of delay time, and the minimum that makes lower than could be considered as the most optimal time delay. Consequently, the time delay is set to 2 and embedding dimension is set to 10. Then, the entire life data is reconstructed by PSR, and the similarity between the two adjacent data is calculated by NCC and shown in Figure 13.

There are three health states: normal operation stage, slight degradation stage, and serious degradation stage. It can be seen that the value of PSS becomes higher in the serious degradation stage. The reason is that the vibration becomes more regular when the failure occurs. PSS can better reveal the potential links between adjacent signals, which have been proved in the front part of this research. At the same time, PSS begins to increase in the region of 500-phase point, which can be considered as the threshold for the beginning of the RUL estimation. After that, the original data are divided into reference data and test data; reference data are viewed as the database; test data are considered as the current data to estimate the RUL. The results are shown in Figures 1416.

It can be seen that the estimated RUL is close to the actual RUL. Furthermore, the comparisons with the original signal similarity and RMS are conducted to prove the superiority of the proposed method in RUL estimation. The results of estimation based on original signal similarity are shown in Figures 1719. Three criteria defined in the following equation are applied to evaluate the accuracy of the RUL estimation: where is the estimated RUL, is the actual RUL, and is the length of the data points. The estimation results of the last 15 h are shown in Table 3.

As for the findings in Table 3, the proposed method has a higher accuracy, which indicates that the PSR can better grasp the nature and regularity of signal. It is effective in the RUL estimation of bearing, especially in the serious degradation stage.

5. Conclusions

In this paper, a novel method based on PSS is proposed to recognize the fault patterns, predict the fault size, and estimate the RUL of rolling bearing. The PSR is employed to transform the vibration signal into high-dimensional phase space. The NCC is employed to calculate the similarity between two health conditions, which is not affected by the scaling and shifting of the signals. The proposed method has two important characteristics. Firstly, the nonlinear vibration signal of bearing is represented by phase space, which can better grasp the nature and regularity of the signals. Secondly, the proposed feature of PSS is utilized in template matching and as a powerful feature for data-based bearing fault size prediction. To verify the effectiveness of the proposed method, the PSS is evaluated by the actual data acquired from bearing and compared with the original signal similarity. The experimental results show that the proposed method has a satisfying performance. According to the analysis, the PSS is also been suited to other machine faults diagnosis and RUL estimation, such as gears and cutting tools.

Moreover, owning to the limited samples, the SVR structure cannot be fully trained to obtain a better prediction result. Part of the validation is not very comprehensive, which is the deficiency of this paper. Therefore, more experiments will be carried out in the future. At the same time, the technical fields in the proposed PSS can be further improved and strengthened.

Competing Interests

The authors declare that they have no competing interests.

Acknowledgments

The authors would like to thank Case Western Reserve University and Center for Intelligent Maintenance Systems (IMS), University of Cincinnati, which provided free download of the rolling element bearing fault data sets. This work is supported by the National Natural Science Foundation of China (51505001), Education Department Key Projects of Anhui Province (KJ2013A010), and Natural Science Foundation of Anhui Province (1508085SQE212 and 1408085ME81). The valuable and constructive comments from the reviewers are sincerely appreciated for their help to further improve the paper.