Abstract
Rolling bearings, as important parts on supporting rotating shafts, frequently suffer from fatigue failures. If these rolling bearing failures are not found in time, it will have a huge impact on the whole mechanical system’s operating safety and operating life. To improve the diagnosis of different faults as well as different degrees of faults, a fault diagnosis method based on the multifractal detrended fluctuation analysis (MFDFA) method-singularity power spectrum (SPS) with extreme learning machine (ELM) is proposed. First, MFDFA and SPS analyses are performed on vibration acceleration signals with different faults and different degrees of damage under the same operating conditions, the spectral parameters of stability and quantitative description of differentiation are selected for feature extraction, and then the selected six feature parameters are put into the extreme learning machine for fault classification. The effectiveness of the MFDFA-SPS feature extraction method is demonstrated by analyzing and testing the measured bearing signals. The fault diagnosis accuracy of the bearing fault signals can reach 99.2% based on the MFDFA-SPS with ELM method by using the Case Western Reserve database. The improvements are 6.79% and 18.42% compared to the fault diagnosis methods based on MFDFA with ELM and SPS with ELM. Compared with the methods based on MFDFA-SPS with LSSVM classifier and SVM classifier, the accuracy improvements are 3.54% and 4.25%, respectively. The results show that the method proposed in this paper can achieve the diagnosis of bearing faults and the method based on MFDFA-SPS with ELM is more efficient than the methods based on MFDFA-SPS with LSSVM and SVM classifiers, which is suitable for practical engineering problem-solving.
1. Introduction
Rolling bearing is one of the important components of the mechanical transmission system, due to corrosion and fatigue factors, and easy to cause bearing abrasion, pitting, and cracking failures [1]. Different faults cause different vibration noise and increased rotational resistance, making the whole transmission system life decay or even fail [2]. Therefore, it is important to ensure the normal operation of the bearings in the gearbox or drive shaft and the timely diagnosis and analysis of the structural components after a failure occurs for the normal operation of the whole unit.
Since the concept of “fractal” was first introduced by Mandelbrot in the 1980s [3], this theory of studying nonlinear systems has been widely applied in many fields and has achieved fruitful results. The object of study of fractal theory is not smooth and irregular geometry in nature, and the bearing fault diagnosis method based on fractal theory is one of the more suitable and effective methods for bearing fault identification. Commonly used fractal theory basis includes iterative function system (IFS), fractal Brownian random field, single fractal, extended fractal, multifractal, and multifractal correlation [4, 5]. The difference in internal signal characteristics between faulty vibration signals and normal operation vibration signals is usually used in research for fault diagnosis by extracting fractal geometric change features.
For decades, scholars worldwide have proposed many methods for fault diagnosis of bearings, and the current mainstream diagnosis methods are the temperature analysis method [6] and vibration analysis method [7]. The enriched fault features such as cyclic power spectra [8] and vibration cyclostationarity [9] parameters can be extracted when using bearing vibration signals for fault monitoring, which enables excellent diagnostic performance. The vibration analysis method mainly performs digital signal acquisition of the faulty bearing vibration signal, and its acquisition method is low cost and contains rich fault information in the vibration signal, which is convenient for real-time diagnosis and analysis.
For the analysis of nonlinear signal characteristics from the perspective of multifractals, Shao et al. [10] used the fractal box dimension method to quantitatively characterize the vibration signals of faulty bearings and found that the fractal dimensions of the faults differed from each other in their manifestations. Liu et al. [11] used detrended fluctuation analysis (DFA) to extract features of wind turbine bearing faults and proved that the fault characteristic parameters can be obtained by multifractal detrended fluctuation analysis (MFDFA). Lin and Chen [12] used the shape and location feature parameters of the multifractal spectrum obtained by the detrended fluctuation analysis and combined with the martingale repulsion to achieve the identification of the bearing damage degree. Chen et al. [13] proposed a fault prediction method based on fractal dimension using an extreme learning machine (ELM). Xiong et al. [14] used multifractal detrended fluctuation analysis methods with PSO optimized LSSVM classification method to achieve fault diagnosis of bearings. Numerous worldwide research papers have demonstrated that fractal theory can be applied to faulty bearing vibration signal processing, and the difference lies in the different effects of using different feature parameters’ extraction criteria and classifiers. At present, the classifier based on combining multifractal theory and deep learning is the current niche use diagnosis method, and the diagnosis effect is fast and has a good diagnosis rate with further development trends. ELM and deep learning are complementary, and some applications combine the two to get promising results, such as using CNNs for feature extraction [15] and ELMs for classifiers.
In summary, to further realize the applicability of fractal theory to bearing fault analysis and diagnosis, this paper proposes a rolling bearing composite fault method based on MFDFA-singularity power spectrum (SPS) and ELM. The method is a quantitative description of nonlinear signals using multifractals for feature extraction. ELM is applied for further exploration of multiple classifications, and the validity of the method is verified by the actual measurement of many different types of faulty bearing signals.
This paper is organized as follows: Section 2 describes the specific model of the MFDFA algorithm; Section 3 describes the specific steps of the SPS algorithm; Section 4 presents a brief description of the ELM tool and the way to extract features using the algorithms in Sections 2 and 3; Section 5 describes the source of data, the specific feature parameter distribution, and the results of the experiments; the main findings and conclusions of the article are summarized in Section 6.
2. The Multifractal Detrended Fluctuation Analysis
The MFDFA algorithm implementation steps are described as follows [12]: Step 1: for a times series of length , , construct a new sequence : Step 2: partition the sequence into mutually disjoint equal-length subintervals. There are data points in each interval, . Since the sequence length is not necessarily an integer multiple of , then will have segments remaining, and to make the sequence of the last small segment processed, the partitioning process can be repeated from the tail of the sequence , thus obtaining equal-length subintervals. represents the order of polynomial fit, generally taken as . Here, MFDFA can be denoted as MFDFA-, and when takes the value of 1, the method is denoted as MFDFA-1. The th-order polynomial in the th subinterval is fitted by the least-squares method, and the residual corresponding to the label in this interval is calculated as follows: Step 3: calculate the mean square error separately: Step 4: calculate the global root mean square value of order , when , with When , the calculation formula is Step 5: change the value of scale to obtain the corresponding ; if the signal sequence has a power-law relationship scale-invariant signal, then where is the -order Hurst exponent. Find the scaling range with a linear relationship between and . Find the slope of its linear regression line, denoted as , by calculating the first-order polynomial fit coefficient between the scale and the logarithmic value of the function in the scaling range. If the relationship between and is -shaped curve, the value of cannot be estimated. Step 6: the correlation between and the multifractal mass exponent can be expressed as where denotes the topological dimension of the multifractal signal, and for a one-dimensional time series of signals, . Step 7: the singularity intensity function and the singularity dimension can be obtained by the Legendre transformation:
In the process of changing value from negative to positive, if remains almost constant, the relationship between -order mass exponent and is linear; then, the signal has single fractal characteristics and the width of the singularity spectrum is small and tends to 0. On the contrary, if changes to a large extent, the relationship between -order mass exponent and is not linear; then, the signal has multifractal characteristics and the width of the singularity spectrum is large. In this paper, we use MFDFA-1 method with one-direction partition process, just do the partitioning process from head to tail of the sequence , thus obtaining equal-length subintervals.
3. The Multifractal Singularity Power Spectrum Algorithm (SPS)
3.1. SPS Algorithm
Suppose a set of signals or time series is a continuous signal.
Defining as a fractal subband signal or a fractal subset of signal , the time-indexed set corresponding to constitutes the set ; then, is denoted aswhere is the singularity exponent of at the time . According to the theory of multifractals, is a thick subset supported by a tight set . Meanwhile, form the segments of , where are mutually disjoint, and we obtain .
All measurable subsets constitute the set system , and is a topological measurable set. There exists a measurable function , and constitutes a measurable space.
Based on the singularity decomposition, we can obtain the singular subset of under the fractal truncation signal. The fractal energy measure of can be expressed aswhere is the singularity measure of each fractal subset and the singularity power spectral density of . It reflects the distribution of the fractal power of with the singularity exponent. The fractal power analysis of the singularity fractal subset gives the singularity power spectrum of as
The fractal power measurement of the signal sequence and the singularity power spectrum distribution satisfy the following relationship:
Therefore, the SPS distribution function of the signal can be defined as a limit form:where is the local orientation angle of the truncated signal (equivalent to the tangent direction of a conventional curve) and denotes the differential of the Hausdorff measure of . Based on the modulus squared of the signal, the local azimuth of the fractal time element, and the Hausdorff measure, equation (15) represents the power distribution of the fractal signal concerning the singularity exponent. Thus, the fractal energy measurement and the singularity power spectrum satisfy the relationship
For fractal signals with a period or finite time support , . Singularity energy spectral density and singularity power spectral density functions can be defined as
It can be obtained that the singularity power spectrum reflects the fractal energy per unit singularity exponent interval, and it reflects how the fractal energy is distributed with the variation of the singularity exponent.
3.2. Specific Steps of the SPS Algorithm
The multifractal singularity power spectrum (SPS) algorithm is a quantitative analysis method to study the correlation of a sequence [16]. The steps of the SPS algorithm are as follows: Step 1: calculate the instantaneous singularity exponent: for a given time series , estimate the instantaneous singularity exponent of the series. Step 2: construct the singularity subset signal: according to the instantaneous singularity exponent , set the singularity variable change interval , where , and divide the interval into discrete singularity exponent sets according to the singularity exponent interval equivalence, where . is the number of discretized singularity indices. Step 3: according to the interval segment of the singularity exponent after discretization, the singularity decomposition of this time series is performed; the data points whose singularity indices fall in the interval , are specified as subsets belonging to the same singularity exponent , and the data points whose singularity indices fall in the interval , are specified as subsets belonging to the same singularity exponent ; thus, it is possible to obtain the fractal subsets with the same singularity exponents. We denote the subset of the same singularity exponent by and the total sequence by , , where denotes the set of all moments in the time series and has the instantaneous singularity exponent value . The number of points in the subset is related to the specific calculation, and the subset corresponding to each singularity exponent is denoted as . Step 4: calculate the average power of the singularity subset with the same singularity exponent : based on doing the singularity subset decomposition on , for the singularity subset , for a given singularity exponent , calculate the power of the subset (time series) with the same singularity exponent; for the convenience of presentation, denote this time series with its corresponding singularity subset as and its discrete form as , where is, respectively, the number of discrete point sets contained in the time series , as the average power . Step 5: calculate the singular power spectrum: iterate over all the singularity indices to obtain the singularity power spectrum .
4. Extreme Learning Machine
The ELM algorithm [17] has the advantages of strong generalization ability and fast learning speed, which overcomes the disadvantages of a traditional neural network learning algorithm (such as BP neural network algorithm) that requires continuous iteration, many training times, low learning efficiency, and slow convergence speed, and is a new type of single implicit layer feedforward neural network. The ELM consists of three parts: input layer, hidden layer, and output layer, and the specific structure is shown in Figure 1.

is the value of the connection between the input layer and the hidden layer, which is denoted as
is the value of the connection between the hidden layer and the output layer, which is denoted aswhere is the number of neurons in the input layer, is the number of neurons in the hidden layer, and is the number of neurons in the output layer.
The threshold of the implicit layer neuron is expressed as
Let be the hidden layer neuron excitation function, and the output can be expressed as the following equation:
Equation (22) can also be expressed as , where is the transpose matrix of and is the hidden output array, and the expression is
The system of equations’ least squares is
The solution is , where is the Moore–Penrose generalized inverse of the output matrix of the hidden layer.
4.1. Extraction of Fractal Characteristic Features in Bearing Signals
Statistical physics can be used to obtain a partition function for the set of probabilities of regular and irregular fractals, which uses the power of the probability as a weight to distinguish subsets with different sizes of probabilities. In some cases, the collocation function has scale invariance of the form over a certain range (and over an infinite range in the case of regular fractals). Figure 2 shows the sketch of the multifractal spectrum used to extract features, Figure 3 shows the sketch of the Hurst exponents, and Figure 4 shows the sketch of the singularity power spectrum.



The generalized fractal dimension can be obtained from , and the multifractal spectrum is obtained by the Legendre transform. and of the multifractal spectrum reflect the properties of the smallest subset of probabilities. and reflect the properties of the largest subset of probabilities. reflects the properties of the most contingent subset. The width of the multifractal spectrum reflects the size of the probability distribution range.
The more inhomogeneous the probability distribution is, the wider the corresponding curve is and therefore the larger the value becomes. In the practical problem of bearing fault signals such as having irregular distribution in space, the quantitative characterization of the inhomogeneous distribution of physical quantities can be achieved with the specific parameters of the multiple fractal spectrum, so it is suitable for the feature extraction of different kinds of bearing faults and different degrees of damage to achieve the diagnosis and identification of faults.
The value of the weighting factor is also worth discussing when calculating the multifractal spectrum of the measured signal. When calculating the multifractal spectrum according to the multifractal detrended fluctuation analysis, different values of are used to divide the signal into regions with different levels for the study. Theoretically, the larger the range of is , the more complete the representation of the multifractal spectrum, but in practice, as increases, the computational effort increases exponentially, and the increase of to a certain level will inevitably cause computer overflow errors. However, if the range of is too small, the change of is still large for every increasement of . The obtained is only a part of the multifractal spectrum, which cannot fully reflect the probability distribution of the object of study. In general, when increases to a certain degree, regardless of the degree of a signal probability distribution, and basically do not change with the increase of , and the range of can be cut off. In the calculation of the stochastic fractal, a criterion for determining can be chosen based on this property, and a coefficient for the variation of with the value of can be defined as follows:
When is infinite, , . In the actual calculation, a value slightly larger than 0 is chosen as a criterion, for example, .
In the fractal analysis of the actual faulty bearing signal, the data are divided into segments of equal length, and each segment contains 2048 points. The efficiency can be ensured when , and without losing the multiple fractal spectrum features, the spectral parameters are characterized comprehensively.
Mandelbrot in his book “Fractals: Form, Chance, and Dimension” [15] gives a portrayal of the Hurst phenomenon, that is, river flow in which a larger-than-average flood may follow a larger flood and then, suddenly, a below-average flood, followed by more below-average floods; the whole process like there is a cycle, but not for the periodicity, and the standard statistical analysis also reveals significant correlations between observations. To separate the fractal time series from other time series, the Hurst exponent was therefore constructed to estimate the fractal dimension of the time series and the degree of series persistence or inverse persistence. When calculating the Hurst indices for measured bearing failure signals, the Hurst indices for different failures and signals with different fault levels also reflect different distributions, and the Hurst indices corresponding to different values are taken for feature extraction when the value of is given. The features taken in this paper are included in Table 1.
5. Applications
5.1. Dataset Introduction
The bearing signals used in the experiments were obtained from the Bearing Data Center at the Electrical Laboratory of Case Western Reserve University, USA [18, 19]. The experimental platform consists of a 1.5 kW (2 hp) electric motor (electric motor in the left side of Figure 5), a torque sensor/translator (torque transducer and encoder in the middle of Figure 5), and a power test meter (the dynamometer in Figure 5). The rotational speed is 1772 r/min, the sampling frequency is 12 kHz, and the sampling time is 10 s. Single point faults were introduced to the test bearings using electrodischarge machining with fault diameters of 7 mils, 14 mils, 21 mils, 28 mils, and 40 mils (1 mil = 0.001 inches). SKF bearings were used for the 7, 14, and 21 mils diameter faults, and NTN equivalent bearings were used for the 28 mils and 40 mils faults. All experimental data use drive-end acceleration data, which are called vibration acceleration signals. The validity of the proposed method in this paper is verified by selecting the data of the normal state shown in Figure 6(a) and 6 fault states. The fault state data are inner-race 7 mils (slight inner-race fault) shown in Figure 6(b), inner-race 28 mils (serious inner-race fault) shown in Figure 6(c), outer-race 7 mils (slight outer-race fault) shown in Figure 6(d), outer-race 28 mils (serious outer-race fault) shown in Figure 6(e), rolling elements 7 mils (slight ball fault) shown in Figure 6(f), and rolling elements 28 mils fault (serious ball fault) shown in Figure 6(g). Because of the inconsistent length of the bearing signals, each of the data shown in Figure 6 is only for a sampling time of 5 seconds. Each type of data is divided into 50 equal segments with 2048 points per segment. We have 7 kinds of signals in different states. The validation acceleration data of each state are divided into 50 sets, and we can get 350 sets. During the analysis of the MFDFA-1 algorithm for all the selected bearing signals, the subinterval segments are taken as 12 values equally spaced and rounded between 10 and 128, and the values of are taken as 31 values equally spaced between according to the rule of equation (24). The data storage format, the feature extraction algorithms utilized, and the classifier application process all run on the MATLAB2020a platform. In the singularity power spectrum algorithm analysis process, the selection of subintervals is to distinguish the singularity exponent into 50 equally spaced segments. The time-domain waveform of the vibration signal of each state bearing is shown in Figure 6.


(a)

(b)

(c)

(d)

(e)

(f)

(g)
5.2. Steps of Bearing Fault Diagnosis
To ensure the reliability of the experimental results, the bearing signals in the database are tested with the same type of bearings and the same operating conditions except for the degree of damage. The flowchart of the proposed algorithm is shown in Figure 7. Step 1: the MFDFA and SPS algorithms are used to process the normal bearing signals and the bearing fault signals with different damage faults and different damage degrees to obtain the corresponding multiple fractal spectrum and SPS spectrum Step 2: feature extraction using the six specific features mentioned in the above section Step 3: the ELM classifier is used to classify the bearings in seven states, and multiple classification tools are taken to compare the classification effects

5.3. Experimental Results
First, the MFDFA and SPS algorithms were used to analyze the bearing signals in seven states, with ten samples for each state, and the corresponding multifractal spectrum and SPS spectrum were obtained, as shown in Figures 8–10.



Figure 8 shows multifractal spectra of seven signals based on MFDFA-1, where MFDFA-1 means the order of the polynomial of MFDFA is 1. It can be seen that the multifractal spectral lines in different states are hump-shaped and the spectrum widths and the location of the spectrum vertices of the corresponding multifractal spectra are different from each other. The location of the top of the multifractal spectrum of the normal bearing signal is in the rightmost part of Figure 8, and the length of the spectrum width is the shortest. The locations of the top of the spectra of the rest of faulty bearing signals are shifted to the left side of the spectrum of the normal signal. The spectrum widths of the rest of the faulty bearing signals are wider than the one of the normal signals, which indicates that the internal probability density of the faulty bearing signal has changed to different degrees.
The multifractal spectrum width of the normal bearing vibration acceleration signal is narrower than that of the faulty bearing signal, which indicates that the distribution of the probability of the amplitude of the normal operation signal is more even. An increase in the multifractal spectrum width usually implies an increase in the irregular properties within the signal. The purple solid line corresponds to a larger value of for slight outer-race fault signal, indicating that the signal has a larger number of subsets with small probability, and the value is the smallest, which corresponds to the subsets with maximum probability. Compared with the serious outer-race fault (olive green dotted line), the two multifractal spectra are in the shape of right hook, which indicates that the signal of the outer-race fault signal is predominated by the subset with small probability. The positions of the two signals are different, indicating that the most probable subsets are different. The spectrum widths of the two signals are similar to each other, which means that the probability density distributions of the corresponding signals are similar to each other, and the difference lies in the range of probability distributions of the two kinds of signals.
The values of corresponding to slight ball fault (blue-green dotted line) and serious ball fault (red solid line) are at a large level. The spectrum widths of the spectra of the two kinds of signals are smaller than those of the other faults, indicating that the probability distribution of the ball fault signal is more uniform than the probability distributions of the other faults. The two spectrum lines are in the shape of left hook, which means that the large probability subsets dominate the signal.
The spectrum lines of slight inner-race fault (red solid line) and serious inner-race fault (blue solid line) are both left hook-shaped, and the positions of two curves are also close to each other. The values of and of serious inner-race fault signal are smaller than those of slight inner-race fault signal. This means that the number of both the small and large probability subsets of the serious inner-race fault signal is smaller. The value of for serious inner-race fault signal is larger than that of slight inner-race fault signal, so the probability density distribution of serious inner-race fault signal is more uneven than that of slight inner-race fault signal.
By comparing the multifractal spectra of signals with the same fault and different degrees of damage, it can be found that the multifractal spectra of signals with different degrees of damage for the same fault have similar shapes (left hook or right hook), but the difference lies in the difference in the uniformity of the probability density distribution (the size of value).
Figure 9 shows the -order Hurst exponent calculated by MFDFA-1. It can be seen in Figure 9 that when is changing from −5 to 5 by increasement of , the curve of for the bearing vibration acceleration signal in the normal state is different from the line of for the overall faulty bearing vibration acceleration signal distribution, indicating that once the bearing fails, the internal tortuous state of the signal will change. The -order Hurst exponent of serious bearing fault signal is generally larger than that of slight bearing faults. for the normal state is calculated in the range of ; it is uniformly distributed. The distinction is evident when comparing with the -order Hurst exponent values of faulty signals. When , the values for the signals of serious inner-race fault, serious outer-race fault, and serious rolling element fault are greater than the values of the corresponding slight fault signals. When , as increases, there appear noticeable differences of the values for signals of slight inner-race fault, slight outer-race fault, and slight ball fault. Therefore, the two points at the endpoints with the points in the middle can be chosen as the feature values to distinguish the degree of fault for a given value of . In the range , the values for the signals of the serious outer-race fault and the serious inner-race fault are more distinguishable from the rest of the fault signals, and the curves of the values for the signals of the two ball fault signals are difficult to be distinguished from the curves of values for the slight outer-race fault and the slight inner-race fault in this interval. However, with the increase of value, the values for the signal of slight ball fault do not drop as rapidly as the values for the rest of the fault signals. The curve of slight ball fault signal is distinguishable from that of the serious ball fault in the interval . Similarly, as the value of increases, the difference between curves of the slight outer-race fault signal and serious outer-race fault signal is almost the same except in the range . Conversely, the curves of signals of the two inner-race faults and the serious outer-race faults in the range are difficult to be distinguished from each other. The difference between the curves of the signals of the two kinds of inner-race faults is greater in the range than that in the interval . When , reflects the degree of fluctuation of the major trend within the signal, and when , reflects the degree of fluctuation of the minor trend within the signal. As a result, it can be obtained that the degree of fluctuation of minor trends within the signal of serious degree of bearing fault is usually greater than that of the signal of slight degree of bearing fault when .
Figure 10 shows the singularity power spectra of seven signals based on the SPS method. In Figure 10, the characteristics of the SPS spectra of the faulty bearing signals in different states are not the same and the power spectra at specific different singularity values changed to a larger extent due to the shift in the distribution range of the singularity values. The normal bearing signal has a large singularity exponent distribution and a uniform power distribution. The distribution range of the singularity exponents of the faulty bearing signal changes, and the power distribution becomes uneven compared to the normal state. According to the occurrence of such a phenomenon, extracting the features with a large degree of variation can effectively distinguish the bearing fault and the degree of failure. The specific distribution of the 5 feature parameters of the 70 samples (all training sets consisted of 10 sets of each state, so we get 70 sets for training) calculated by MFDFA-1 and calculated by using the SPS method are shown in Figures 10–15. The horizontal coordination denotes the number index of the sample signal. The vertical coordination denotes the feature parameter chosen for classification. The values of the six specific characteristic parameters are shown in Table 2.





In Figure 11, the normal state signal has the largest value of and the value of the signal for the occurrence of faults is smaller than the value of the normal state. Similarly, the values of the same faults at different degrees present a larger value for the signal with serious degree faults than for the signal with slight degree faults. This indicates that the two parameters and can be used as characteristic parameters to distinguish the normal state from fault states and also provides some possibilities to distinguish the degree of the fault as well.
In Figure 12, the distribution of values for normal signals is at a low level, accompanied by an increase in values in the case of faults, while the values for faults of the same type but of different degrees are larger for serious faults than for slight faults.
In Figure 13, when , the values of the -order Hurst exponent for different degrees of the same fault start to be distinguishable, especially for serious and slight faults in the inner-race as well as for serious and slight faults in the outer-race. Therefore, the -order Hurst exponent value at is sensitive to the degree of inner- and outer-race faults and is suitable for extraction as a feature parameter.
In Figure 14, when , corresponding to serious rolling element faults can be distinguished as a whole from those corresponding to slight rolling element faults. It also shows good differentiation between serious outer-race faults and slight outer-race faults.
In Figure 15, when , for the signal of rolling element fault is relatively larger than those for the inner-race fault and outer-race fault values, but it is less sensitive to the degree of damage of the fault.
The singularity power spectra of the seven signals also exhibit distinguishability between normal and fault states. In Figure 10, the distribution of singularity exponents for the normal state lies in the rightmost of the figure. The distribution of the singularity power of the signal in the normal state is flat compared to the singularity power of the signal in the faulty state. Among them, in the distribution of , the serious inner-race fault signal lies in the leftmost of the figure, and the serious outer-race fault signal has the largest power extrema in its distribution range of . The values of the singularity power of slight inner-race fault signal are overall smaller than those of severe degree inner-race fault signal. However, the singularity power spectra cannot be separated from each other evidently.
In Figure 16, the distribution of for the seven states of the bearing signal is depicted. The distribution of for serious outer-race fault signal is the widest; comparing to slight outer-race fault signal, there are crossovers existed in some samples, but the majority of samples have greater values than slight degrees of outer-race faults. distinguishes the degree of two inner-race fault signals, and the distributions of both feature lines are distinguished evidently in Figure 16. Since the singularity power spectrum distribution of ball fault signal is stable, there is only a minor difference between the feature lines of serious ball fault signal and slight ball fault signal, but is more stable for the samples.

Some valuable patterns can be drawn from these plots; the magnitude of the spectrum vertex position usually increases as the degree of damage increases for the same fault type. The multifractal spectrum width is more likely to increase for severely damaged faults than for slightly damaged faults. Except for rolling element faults, the SPS summation feature values for severely damaged inner and outer-race faults are usually larger than those for slightly damaged inner- and outer-race faults. The above analysis proves that the six characteristics selected in this paper can effectively quantify the types of bearing faults and the degrees of damage.
According to the representation law of these six features, the six parameters are composed into a vector to represent the different bearing state characteristics.
For classification of types using ELM, the number of hidden neurons is 50 and the transfer function is sigmoidal function. The training set consisting of 70 samples is fed into the ELM classifier, and the remaining samples after segmenting the data are used as the test set. The ELM classification results are shown in Figure 17. Similarly, the training set is trained using SVM and LSSVM tools, respectively, and the same is verified for the test set. The classification results obtained are shown in Figures 18 and 19. In the multiclassification problem of bearing faults using SVM, Support Vector Machine for Regression (SVR) is chosen. The kernel function is chosen as radial basis function with a gamma parameter coefficient of 0.05. For classification of fault signals using LSSVM, the optimization function is chosen as simplex, and the training and test sets are classified using a cross-validation method.



As can be seen in Figures 20 and 21, we have 7 kinds of signals in different states; the validation acceleration data of each state are divided into 50 sets, so we can get 350 sets. All training sets consisted of 10 sets of each state, so we get 70 sets for training, and the number of total test sets is 350. The average runtime and correct recognition rates of five methods are shown in Table 3. The diagnostic results of the five classification tools illustrate that this feature extraction method can distinguish different kinds of bearing fault signals with different faults. Among the ELM classifier results, there are no errors in the test set except for only one data belonging to the serious rolling element fault in the test set is classified into the slight rolling element fault with a correct rate of 99.25%. In the results obtained by the MFDFA-SPS-based detection method with SVM classification, the highest probability of errors was found for ball faults, with an overall correct rate of 95%. In the LSSVM classification results, the correct rate also reached 95.71% due to the excessive number of parameters that needed to be preset and did not have optimization.


Therefore, based on the overall classification correctness, it can be obtained that the MFDFA-SPS-based feature extraction method can be applied to the fault diagnosis of bearings, and the classification method can be further optimized to achieve better classification results.
6. Conclusions
Based on the above study, the following conclusions can be drawn:(1)By applying the bearing faults from the Case Western Reserve database, the bearing fault diagnosis method combining MFDFA with SPS and ELM can be applied to diagnose different faults as well as faults with different damage levels(2)The MFDFA-SPS fault feature extraction method based on ELM does not require preprocessing of fault data and is suitable for real-time monitoring of the operation status of bearing faults
If only SPS is used to extract the specific feature parameters of the faulty bearing because the distribution range of the singularity values of different degrees of the faulty signal is different (except for some serious faulty signals), it will cause the stacking of different signal spectrum lines in different intervals of distribution, and the intuitive spectrum line discrimination is not strong only from the graph, so we need to combine other feature extraction methods and choose a more effective classifier for classification to achieve a better classification effect on the rolling bearing signal.
The proposed method has 3 advantages. First, the proposed method has no preprocessing procedure for the sampled data, which substantially reduces the complexity of the overall diagnostic process. Second, the feature extraction time using the MFDFA-SPS algorithm is short, which improves the efficiency of online fault diagnosis. The various bearing signal data used in this paper have only 2048 points, and the need for fault diagnosis can be achieved in a short time. Finally, the effect of classification can reach more than 99%, which ensures the reliability of the diagnosis method. Therefore, the proposed method is suitable for application to practical engineering problems.
With the approach proposed in the above section, we can still achieve good diagnostic results based on the Case Western Reserve bearing database using a smaller number of training set. The future direction of research is to test the sensitivity of this method to the feature extraction and fault diagnosis effect of different bearing vibration signals, as well as to realize the composite fault diagnosis of different faults superimposed.
Data Availability
The acceleration data of the bearings used to support the findings of this study have been deposited on the Case Western Reserve University Bearing Data Center website (https://engineering.case.edu/bearingdatacenter/).
Conflicts of Interest
The authors declare that they have no conflicts of interest.
Acknowledgments
This work was sponsored by the National Natural Science Foundation of China, under Grant 61901195, and the School Research Start-Up Foundation for on-the-job doctoral training, under Grant 1032931804.