Abstract

The accuracy of underwater sound source localization depends on the preprocessing effect of signal denoising and delay estimation performance. In order to meet the needs of high-precision underwater positioning, this article studies a time delay estimation method based on nonlocal mean-enforced mean empirical mode decomposition (NLM-EEMD) autocorrelation denoising preprocessing and fuzzy C-means clustering (FCM)-generalized quadratic for underwater sound source localization technology. The theories such as NLM, EEMD, FCM, and generalized quadratic correlation delay estimation are studied in detail, and the speech signal of the test library in the TIMIT standard library is selected for simulation analysis, which verifies the correctness of the delay estimation method. Experiments were carried out in a larger outdoor pool. The analysis results show that the NLM-EEMD adaptive denoising method effectively reduces noise interference and improves the quality of EEMD decomposition and the FCM-generalized quadratic correlation method increases the correlation between two signals. The time delay estimation accuracy is improved, and the positioning accuracy is 48.66% higher than that of the direct generalized cross-correlation method.

1. Introduction

Underwater sound source localization has always been the focus and hot issue of hydroacoustic research and is widely used in military and civil fields such as ocean exploration and development, coastal defense, etc. Sound source localization techniques based on acoustic sensor arrays can be divided into three categories: localization methods based on maximum controllable response power beamforming, localization methods based on high-resolution spectral estimation, and localization methods based on TDOA. Carter first proposed the sound source location method of maximum controllable response power beamforming in 1977 [1] and later generations continue to optimize and improve it. Dibiase Joseph Hector proposed the SPR-PHAT method [2]; Wan and Wu proposed the SRP method [3] based on the principal eigenvector; and Do and Silverman and Lima et al. proposed an improved search method from coarse to fine, which greatly reduced the amount of computation [4, 5]. However, this location method requires a priori knowledge of the sound source and environmental noise, it is sensitive to the initial search value, and it not only reduces the amount of calculation but also reduces the accuracy of estimation [6]. The spatial resolution of the positioning method based on high-resolution spectral estimation is not limited by the signal sampling frequency, and arbitrary positioning accuracy can be achieved under certain conditions. However, this method relies on searching the entire space to determine the location of the sound source, which is more expensive and complex than other algorithms [7, 8]. This method is generally not used in single sound source positioning. Compared with the first two methods, the positioning technology based on TDOA is the simplest one, with high positioning accuracy, less computation, and good real-time performance. It can achieve good results in the application of single sound source positioning [9].

Time delay estimation accuracy is the decisive factor to determine TDOA positioning accuracy. Preprocessing underwater acoustic signals such as denoising or optimizing time delay estimation algorithms are effective methods to improve time delay estimation accuracy and target positioning accuracy. Knapp et al. and Mosayyebpour et al. estimate time delay based on generalized cross-correlation, cross power spectrum phase, quadratic correlation, adaptive least mean square (LMS), acoustic transfer function, subspace decomposition, and cepstrum filtering methods [1016]. Jiafan Yin and Lu Yang et al. used the NLM method to denoise the human ECG signal and eddy current sensor signal, respectively [17, 18]. Lu, Liu, and Zhang et al. used the EEMD method to denoise the microseismic signal, pulse signal, and blasting vibration signal of underground mines, respectively [1921]. Li, Yu, and Su et al. combined the FCM method to process pipeline leakage signal, EEG signal, and micro-Doppler radar intermediate frequency signal, respectively [2224].

Based on the above research, this article proposes an underwater sound source location method based on NLM-EEMD and FCM-generalized quadratic correlation. First, NLM-EEMD autocorrelation denoising is performed on the collected signal, which makes up for the unsatisfactory denoising effect of EEMD on low SNR signals. Then FCM fuzzy clustering is performed to adaptively extract the effective segment of the signal, making the correlation peak after generalized quadratic correlation time delay estimation more prominent; thus, the accuracy of sound source location is improved [22].

2. NLM-EEMD Signal Denoising Algorithm

2.1. NLM Algorithm

Let the signal model be expressed as , where is the sound sensor that actually collects signals, is valid signal, and is noise signal.

For one-dimensional signals , denoising processing [18, 2530] based on NLM is to eliminate the interference of noise , it is the process of recovering valid signal , and is the weighted average of finding all similar blocks in the entire search area [31, 32], the weighted mean can be obtained bywhere represents the sum of the similarity degrees of all similar blocks, that is, the normalization factor, and the calculation equation is as follows:where is the weight, and the calculation equation is as follows:where represents the sum of the squares of the Euclidean distance of the neighborhood blocks of the two sample points and , controls the decay speed of , the value is , represents the neighborhood block centered on the sample point, represents the neighborhood block centered on the sample point, represents the bandwidth parameter, and its value is expressed as according to the SURE criterion, where is the signal variance.

The weight depends on the similarity between the sample points and and satisfies the following conditions:

The parameter relationship diagram of the NLM algorithm is shown in Figure 1. N represents half of the search area , and the parameters , , and jointly determine the final result of the filtering.

2.2. EEMD Algorithm

Empirical mode decomposition (EMD) is essentially a method of decomposing the time domain signal according to the frequency domain scale, producing a series of intrinsic mode function (IMF) components with different characteristics and a residual component; it is different from variational modal decomposition (VMD) [33, 34]. But EMD decomposition may cause modal aliasing problems and endpoint effects, so the EEMD decomposition method has been developed. The steps of the EEMD algorithm [21, 22] are as follows [3539]:(1)Gaussian white noise is added to the target signal:(2)The noise-added signal is decomposed by EMD to obtain multiple groups of IMF components , where , is the decomposition scale.(3)The average value of each group of IMF components is calculated as the final value of EEMD decomposition:

The screening process of EEMD is adaptive. In the decomposition process, the Gaussian white noise signal is added, and the Gaussian white noise can be used to cancel each other when multiple trials are superimposed, and the abnormal events in the original signal are “smoothed.” Therefore, it is a noise-assisted data analysis method, which solves the modal aliasing problem that may occur in EMD and makes the denoising effect better and the final sound source localization accuracy higher. The EEMD decomposition flow chart is shown in Figure 2.

2.3. Combination of NLM and EEMD to Reduce Noise

The signal is first preprocessed by NLM denoising, which can improve the quality of EMD decomposition. To avoid modal aliasing problems and end effects, the signal is decomposed using the EEMD method. EEMD decomposes the signal into a set of IMF components with frequencies from high to low, removes some IMF components that do not meet the threshold condition, and adaptively reconstructs other remaining effective IMF components to obtain the final denoised signal.

Adaptive signal reconstruction [40, 41] is to select two evaluation indicators—correlation coefficient and root mean square error—that determine the effective IMF component by calculation and reconstructing the signal. The steps are as follows:(1)The correlation coefficient between each IMF component and the original signal is calculated separately, and the calculation equation is as follows. In the equation, represents the original signal, represents the sampling point of the signal, and represents the IMF component decomposed ; the larger the , the higher the correlation.(2)The root mean square error of each IMF component and the original signal is calculated separately, and the calculation equation is as follows. The smaller the RMSE, the closer it is to the original signal.(3)Calculating the adaptive threshold and , the equation is as follows. In the equation, represents the number of IMF components, and the IMF components whose threshold satisfies the condition of Equation (10) are selected to reconstruct the signal.

The flow chart of EEMD signal reconstruction is shown in Figure 3.

2.4. Simulation

The voice signal of the test library numbered FDAC1-SX304.WAV in the TIMIT standard library is selected, Gaussian white noise is added to make SNR = 0, the signal sampling frequency is set as 16000 Hz, and time-frequency domain analysis and NLM-EEMD noise reduction processing on the signal are performed. Time domain diagram, spectrogram, time spectrum, and Hilbert spectrum of the original signal are drawn in turn, as shown in Figure 4.

Adding Gaussian white noise to the original signal to make SNR = 0, and the original signal with noise is shown in Figure 5. The noisy original signal is first preprocessed by NLM, and the result is shown in Figure 6; then EEMD is decomposed, and the decomposed IMF components and the remaining components and the corresponding spectrum are shown in Figure 7; the IMF components are distinguished according to frequency, it is divided into high-frequency component, low-frequency component, and trend quantity. The time domain diagram of each component is shown in Figure 8.

The correlation coefficient and root mean square error of each IMF component and residual component and the original signal are calculated, and then the adaptive threshold of 0.2692 and of 0.0074 are obtained. The IMF components with correlation coefficient greater than 0.2692 and root mean square error less than 0.0074 are selected for reconstruction, and the adaptive reconstruction process is shown in Figures 9 and 10. According to the result of adaptive reconstruction, the components IMF1, IMF2, and IMF3 are selected for reconstruction. Figure 11 shows the comparison diagram of the signal before decomposition and the reconstructed signal and the reconstruction error diagram.

Figure 12 shows the comparison of original signal with the original signal with noise and the signal processed by NLM-EEMD; it can be seen that the denoising effect is very obvious, and the original signal characteristics are basically restored.

3. FCM-Generalized Quadratic Correlation Adaptive Delay Estimation Algorithm

3.1. FCM Algorithm

FCM algorithm (Fuzzy C-means algorithm) [22] is a clustering algorithm based on flexible fuzzy partition. This method uses the concept of determining the geometric closeness of data points in Euclidean space and obtains the membership degree of each sample point to all class centers by optimizing the objective function, so as to automatically classify the samples.

Supposing the signal point sample is , represents the number of cluster centers, represents the corresponding categories, represents fuzzy classification matrix, and represents the cluster center vector, we use the weighted sum of squared distances from the signal point to each cluster center as the objective function:where m is the weighting parameter and and are the membership degree and Euclidean distance from the signal point sample to the cluster center , respectively.

The essence of the FCM algorithm is to repeatedly modify and through iteration. When the algorithm converges, theoretically, various cluster centers and the membership degrees of each sample to each pattern class are obtained, thus completing the fuzzy clustering division.

3.2. Generalized Quadratic Correlation Delay Estimation Algorithm

It is assumed that the acoustic signals collected by the two acoustic sensors are shown in Equation (11). In the equation, represents the original sound source signal, and represent the attenuation factors of the two signals, and are noise, and represents the time delay difference between the sound source reaching the signals collected by the two sound sensors.

The simplest time delay estimation is to directly perform cross-correlation processing on the two signals and , as shown in Equation (12).

Substituting (11) into Equation (13) to get:

Because , , and are not related to each other, (13) can be further simplified:where represents the autocorrelation function of the sound source , and is the cross-correlation function of the two noises and . If and are uncorrelated noises, (14) can be further simplified as: .

According to the properties of the correlation function, when takes the maximum value, is the time delay difference between the sound source signal reaching the two acoustic sensors.

The generalized cross-correlation algorithm [4244] transforms the cross-correlation function into the frequency domain and then weights the cross power spectrum to achieve the effect of suppressing noise, and then inversely transforms it to the time domain to obtain the generalized cross-correlation function. As shown in (15), refers to the cross power spectrum of the two collected signals, is the weighting function.

Common weighting functions and their characteristics [4549] are shown in Table 1.

The principle block diagram of the generalized cross-correlation delay estimation algorithm is shown in Figure 13.

The quadratic correlation algorithm [5052] is also a common time delay estimation method. It performs a correlation operation on the autocorrelation function and the cross-correlation function of the two time domain signals to improve the time delay estimation accuracy.

Ignoring the cross-correlation function of signal and noise, (16) can be simplified aswhere represents pure signal for secondary correlation and represents noise for secondary autocorrelation. Compared with the primary correlation, the secondary correlation reduces the influence of noise on the signal and can estimate the delay value more accurately in the low signal-to-noise ratio environment.

The block diagram of the quadratic correlation delay estimation is shown in Figure 14.

In this article, the advantages of generalized cross-correlation and quadratic correlation are combined for time delay estimation, that is, generalized quadratic correlation time delay estimation [50]. The principle block diagram is shown in Figure 15. The weighting function of generalized cross-correlation chooses SCOT weighting function.

3.3. Combination of Delay Estimation of FCM and Generalized Quadratic Correlation

This article uses the FCM algorithm to adaptively extract the effective segment signal and then performs generalized quadratic correlation processing, which can sharpen the correlation peak and improve the accuracy of delay estimation. The steps are as follows [22]:(1)In the FCM algorithm, the weighting parameter is set, and the number of cluster centers is . The acoustic signal used for underwater positioning studied in this article is the shock signal, so it is divided into three categories: the front part of the shock signal, the shock signal generation segment, and the post-impact signal segment.(2)Using FCM to process the acquired signal, the effective segment signal point samples are extracted and reconstructed, and the shock generation segment signal is obtained.(3)For the acoustic signals received by other acoustic sensors, the same processing methods of steps (1) to (2) are adopted. In this way, each acoustic sensor corresponds to a signal of an impact generation segment, and the minimum value of the start point and the maximum value of the end point of these signals are taken as the final start point and end point, respectively, that is, the segment to is intercepted from the received signal of the acoustic sensor. The signal is processed as the final pending delay estimation.(4)The generalized quadratic correlation delay estimation is performed to obtain the delay difference between the signals.

4. Experiment and Performance Analysis

4.1. The Principle and Process of Underwater Sound Source Localization

In this article, the ternary acoustic sensor array is used to collect the underwater sound source signal, and the time delay between the signal processing and the signal is estimated, and finally the underwater sound source position is calculated by using the time delay difference. The schematic diagram of the experimental site is shown in Figure 16.

The azimuth distance equation used by the ternary acoustic sensor array for sound source localization is as follows:where is the azimuth of the explosion point to be estimated; is the distance of the explosion point to be estimated; is the speed of sound; is the distance between the array elements; and , , and represent the time delay difference of the three-channel signals, respectively.

4.2. Experimental System Construction

The experimental device is built as shown in Figure 16, and the sound source localization experiment is conducted in a large outdoor pool. The underwater sound source signal is generated by throwing heavy objects at different angles and distances relative to the ternary array. The distance between the elements of the ternary acoustic sensor array is 10 m and arranged in a straight line. The sensitivity of the acoustic sensor is less than −200 dB when detecting the sound source in the frequency range of 10 Hz–50000 Hz, the sampling frequency of the acquisition instrument is 128k Hz, and each acquisition time is set to 10 s (Figures 17 and 18).

4.3. Experimental Signal Processing

The collected acoustic signals at different locations are first analyzed in the time-frequency domain, and then NLM preprocessing, EEMD decomposition, and adaptive reconstruction are performed to obtain denoised signals. Finally, combined with the FCM algorithm, based on the generalized quadratic correlation delay estimation, the algorithm estimates the time delay of the three-channel signal in pairs and then solves the sound source position.

Taking the acoustic signal collected in an experiment as an example, the above analysis and processing are carried out. The time domain diagram and spectrogram and time spectrum of the signals collected by the three hydrophones are shown in Figures 19 and 20, respectively.

The comparison between the original signal of the three-channel signal and the NLM preprocessing results is shown in Figure 21, the NLM-EEMD denoising results are shown in Figure 22, and the signal adaptive reconstruction process is shown in Figure 23. The comparison between the reconstructed signal and the NLM processed signal is shown in Figure 24, and the reconstruction error is shown in Figure 25. It can be seen that the NLM preprocessing has an ideal de-drying effect. In the process of EEMD signal decomposition and reconstruction, the three signals are decomposed into 10 IMF components, adaptive threshold is 0.2291, is 0.0019, is 0.2954, is 0.0072, is 0.2872, is 0.0026, the first channel signal selects the 2nd, 3rd, 8th, 9th, and 10th components for reconstruction, the second channel signal selects the 3rd, 4th, and 8th components for reconstruction, the third channel signal selects the 2nd, 3rd, 4th, 8th, 9th, and 10th for reconstruction, and the original signal is basically restored, and the reconstruction error meets the requirements.

Figure 26 shows the clustering diagram and the change diagram of the clustering objective function after the three-channel signal is processed by FCM. Finally, the generalized quadratic correlation delay estimation is performed as shown in Figure 27.

4.4. Algorithm Verification and Analysis

In order to further verify the superiority of the algorithm in this article for underwater sound source localization, three different methods of direct cross-correlation method, generalized cross-correlation method, and generalized quadratic correlation method based on NLM-EEMD-FCM were used for the acoustic signals of the experiments in the previous section. The time delay estimation algorithm is used to calculate the sound source position. The delay estimation diagrams of direct cross-correlation and generalized cross-correlation are shown in Figures 28 and 29, respectively.

The positioning results of the three delay estimation methods are shown in Table 2.

It can be seen from Table 2 that the error is extremely large when using direct cross-correlation, which is basically invalid; for this group of collected signals, after the signal is processed by NLM-EEMD-FCM, the azimuth error is slightly improved, but compared with the distance error reduced by nearly 1.5 meters, it is negligible, so in general, the generalized quadratic correlation delay estimation method based on NLM-EEMD-FCM sharpens the peak value, improves the delay estimation accuracy, and reduces the underwater sound source localization error.

5. Conclusions

In order to improve the accuracy of underwater sound source location, an underwater sound source location method based on nonlocal mean overall average empirical mode decomposition (NLM-EEMD) autocorrelation denoising preprocessing and fuzzy C-means clustering (FCM)-generalized quadratic correlation time delay estimation is studied in this article. The experiment was carried out in a large outdoor pool. Through the analysis of the collected signal data, it was found that the NLM-EEMD adaptive denoising method has good effect, which improves the decomposition quality of EEMD. The FCM-generalized quadratic correlation method increases the correlation between two signals, sharpens the correlation peak, and improves the accuracy of time delay estimation. Taking the signal data analyzed above as an example, when the sound source distance is about 85 meters, the sound source location error based on NLM-EEMD and FCM-generalized quadratic correlation method is only , which is less than that of direct generalized cross-correlation method. This article provides a new idea to improve the accuracy of time delay estimation and also provides a certain reference for the research of high-precision positioning of underwater sound sources.

Data Availability

The data that support the findings of this study are available from the corresponding author upon reasonable request.

Conflicts of Interest

The authors declare that there are no conflicts of interest regarding the publication of this article.

Acknowledgments

This study was supported by the Naval University of Engineering, PLA Scientific Research Development Fund Self-Establishment Program (grant no. 425317S091).