Abstract
Modern cognitive electronic reconnaissance methods for radar systems must contend with the complex electromagnetic environments arising from the deployment of multiple signal sources and radar countermeasures, which greatly limit access to the degree of prior information required to enable effective target recognition. The present work addresses this issue by proposing a multiperspective collaborative clustering method for sorting radiation sources based on the multiperspective information of radar signals. In contrast to conventional collaborative training approaches, which are suitable only for semisupervised learning, the proposed multiperspective collaborative clustering method performs unsupervised clustering, cluster label transfer, and dimensionality reduction by linear discriminant analysis iteratively based on the differences between the clustering results obtained from two signal perspectives radiation signal sorting can be conducted in a noncooperative context. The results of comparative experiments demonstrate that the proposed multiperspective sorting method can make full use of the difference information between basic signal characteristics and intrapulse features and thereby improve the accuracy of clustering-based radiation source sorting. Accordingly, the sorting ability of the proposed method is superior to those of other state-of-the-art clustering methods and that of the single-perspective clustering-based sorting method.
1. Introduction
The rapid development of information technology in recent years has made electronic warfare a key factor in victory at war [1]. In this regard, electronic countermeasures, such as radar countermeasures, have become a principal component of electronic warfare that has been proven to be particularly important in air-to-air combat, where these countermeasures directly affect the completion of combat missions and protect the lives and safety of aircraft pilots. The accurate sorting and recognition of radar signals is a prerequisite for managing the complexity of the electromagnetic environment in the battlefield arising from the use of radar countermeasures. Moreover, these activities provide a reference for subsequent threat assessment and the rational use of jamming resources.
The increasingly complex electromagnetic environment of the battlefield and the continuous development of radar technology have promoted a transition from single-parameter radar signal sorting methods to multiparameter sorting. Moreover, the development of multiparameter sorting methods has transitioned from template matching methods to comprehensive sorting based on clustering in recent years. In 1989, Mardia [2] proposed radar signal sorting based on a sequence search and proposed the cumulative difference (CDIF) histogram algorithm. In 1993, Nelson [3] first proposed the pulse repetition interval (PRI) transform algorithm, which improved the autocorrelation function by increasing the rotation factor and improved the PRI extraction method based on the time of arrival (TOA) sequence of complex autocorrelation integral peaks. In 1992, Chandra and Bajpai [4] proposed a two-stage clustering and sorting algorithm based on pulse description words (PDWs). Expectation-based maximization clustering algorithm [5], K-modes [6], and Density-Based Spatial Clutering of Application with Noise (DBSCAN) [7] are widely used in radar signal sorting.
The basic characteristics of radar signals include the parameters carrier frequency, pulse width, pulse amplitude, pulse angle of arrival, and pulse TOA. These five basic parameters can be directly measured by radar signal receivers. However, the wide application of low intercept technology and parameter agility technology in modern phased array radar systems has increased the pulse density significantly, and the problems associated with pulse loss and pulse overlap have become increasingly serious [8–11]. As a result, conventional radar signal characterization methods based on pulse description parameters suffer from coarse information granularity and insufficient discrimination. These issues make the effective separation of phased array radar signals difficult to achieve using conventional signal sorting methods based on pulse descriptors. This has prompted an urgent need for the development of new signal features. For example, Klein et al. [12, 13] used the cutting and segmentation method to calculate the statistical feature vector of the instantaneous sequence. They selected statistical features, including variance, skewness, and kurtosis, and named this statistical feature vector RF-DNA (replicative form deoxyribo cucleic acid). Subsequently, Dudczyk and Kawalec [14–16] proposed the use of fractal dimensions to extract intrapulse features based on multifractal theory.
The recent development of artificial intelligence technology has promoted the application of machine learning algorithms to the analysis of radar signals. Haykin [17, 18] first proposed the concept of cognitive radar, which was progressively transformed from the artificial cognition process of machines to an autonomous cognition process requiring no human intervention. In some cases, the process for recognizing radar emitter signals has been conducted using unsupervised learning algorithms such as the k-means clustering [19], the C-means clustering [20], the ambiguity function [21], and the density clustering algorithm [22]. However, while these methods are relatively simple and computationally efficient, the final recognition result is not ideal because these algorithms cannot adequately label the signal categories. Other approaches have obtained better signal recognition results by applying machine learning algorithms such as extreme learning machines [23], support vector machines [24], neural networks [25–28], and Bayesian programming [29]. However, the learning process of these algorithms requires a large volume of prior information, which is greatly restricted in the complex electromagnetic environment prevalent on modern battlefields [30]. Moreover, the noncooperative identification process inevitably introduces inaccuracies in the signal recognition results.
This issue has been addressed by the development of the collaborative training method. Here, Blum and Mitchell [31] proposed a standard collaborative training method for conducting machine learning from two sufficiently independent perspectives. This method has been subsequently developed by relaxing some of the assumptions required in the standard method, which has enabled the collaborative training method to be conducted under conditions where the perspectives are not independent of conditions [32, 33]. This has led to the development of multiperspective clustering, which integrates the complementarity, difference, and consistency information of multiple signal features derived for a clustering target. The use of multiperspective clustering methods has been demonstrated to improve the signal sorting ability of modern cognitive electronic reconnaissance systems for radar signals with characteristics obtained from two perspectives, including basic signal characteristics and intrapulse features. However, the presently available collaborative training methods are only applicable for learning multiperspective data with partial labels. Therefore, these methods are only suitable for semisupervised learning, which greatly restricts the level of automation required for conducting cognitive electronic reconnaissance in complex electromagnetic environments. Moreover, multiperspective data suffers from high dimensionality and is therefore subject to common problems, such as the appearance of equidistant data, making the clustering of data problematic [34, 35]. Therefore, dimensionality reduction is necessary. Nonetheless, the nonlinear characteristics of the distributions in the intrapulse feature data of radar signals make the application of the standard principal component dimensionality reduction method problematic.
The present work addresses this issue by proposing a radar signal sorting and recognition method based on a multidimensional feature expression system and a multiperspective clustering method. Inspired by the multiperspective discriminative clustering algorithm, the proposed method also takes into account the characteristics of radar signals and the noncooperative nature of electronic reconnaissance. In addition, a collaborative training framework is adopted, which combines multiple perspectives, including the basic characteristics of radar signals and intrapulse feature data. The convergence of the clustering process is achieved by iteratively conducting unsupervised clustering and cluster label transfer. In addition, we expand the radar signal characteristic system and increase the dimension of radar signal characteristic expression by extracting the instantaneous and statistical characteristics of radar signals. Finally, we apply the kernel principal component analysis (KPCA) method to reduce the dimensionality of the characteristic spectrum of radar signals. Application of the multiperspective clustering method to actual radar signals in a realistically complex electromagnetic environment is demonstrated to improve the ability of cognitive electronic reconnaissance systems to sort radar signals.
2. Composition of Radar Characteristic Spectrum
2.1. Instantaneous Characteristics
Due to the differences in signal modulation methods, unintentional modulations of different radiation sources, and different propagation paths of signals to reconnaissance receivers, the instantaneous characteristics of signals from different radiation sources are significantly different. Therefore, the signals can be distinguished by extracting the instantaneous features of the signals to realize signal sorting.
For a detection signal , and the analytical form of the signal after applying the Hilbert transform is where and are the respective instantaneous in-phase and quadrature components of the signal. Therefore, the instantaneous amplitude , instantaneous phase , and instantaneous frequency of the signal are given as follows.
In practical applications, the impact of environmental noise and receiver equipment tolerances must be reduced. Therefore, a(n), φ(n), and f(n) must be given with respect to their mean values (i.e., centralized). The centralization processing applied for a(n) and f(n) is given as follows.
Here, and are the mean values of the instantaneous amplitude series and instantaneous frequency series, respectively. For , the nonlinear phase distortion caused by the frequency estimation error in the receiver down-conversion process must be removed before conducting centralization. This nonlinear phase distortion can be expressed as where is the sampling interval. Therefore, the centralized instantaneous phase can be obtained on the basis of formula (4) as follows: where is the mean value of . After the centralization process, the instantaneous features are normalized as follows.
However, the instantaneous feature dimensions are relatively high after centralization and normalization, and this will produce feature redundancy if used directly for signal sorting. Therefore, we apply statistical features, including variance, skewness, and kurtosis, as proposed in a previous study [29]. To this end, we assume that A(n), P(n), and F(n) are divided into segments of length . For the -th sequence segment , we calculate its variance , skewness , and kurtosis according to the mean value in a sequence segment as follows.
Therefore, the statistical characteristics of the -th instantaneous feature are
2.2. Bispectral Characteristics
The high-order spectrum of a signal is a k −1 dimensional Fourier transform of the k-th order cumulant of the signal [13, 36]. The high-order spectrum contains the phase and amplitude information of the signal and has the characteristics of time-shift invariance, scale variability, and phase retention. A bispectrum represents the third-order spectrum in the high-order spectrum of a signal, which can be obtained by applying a two-dimensional Fourier transform to the third-order cumulant of the signal. The bispectrum of a signal is particularly useful because it contains phase information that is not available in second-order statistics, such as the power spectrum. Moreover, the use of a bispectrum is more convenient than other high-order spectra because the bispectrum operation can better avoid the common problems associated with high dimensionality. [37, 38]
For a radiation source signal , where is time, its bispectrum is given as where is the third-order cumulant of , w1 and w2 are the signal frequency, respectively, and and are time, respectively. Here, is defined as follows: where is the mathematical expectation and x(t) is the signal transpose.
Two nonparametric methods can be applied for signal bispectrum estimation, which include the indirect method and direct method. Here, we apply the indirect method, and the statistical features and entropy features of the integral bispectrum are calculated by cutting segmentation. The statistical characteristics of the integral bispectrum include variance, skewness, and kurtosis, while the entropy features of the integral bispectrum include permutation entropy and fuzzy entropy.
As discussed above, we assume that the integral bispectrum of the signal is divided into segments. We then calculate the variance, skewness, kurtosis, permutation entropy (), and fuzzy entropy (FuEni) of the -th sequence segment of the integral bispectrum to obtain the following bispectral eigenvector: where is the probability of each sequence and is the signal sequence length.
The feature vector composed of the feature vectors of all segments is
Finally, the signal bispectrum statistical feature set is obtained according to the order of axial integral bispectrum (AIB), circular integral bispectrum (CIB), radial integral bispectrum (RIB), and contour integral bispectrum (SIB), as follows:
The proposed bispectral feature extraction process is presented in Figure 1.

According to Figure 1, the specific steps for extracting the bispectral feature vector of the signal are given as follows: (1)Bispectrum estimation of the radar signal: apply the indirect method to estimate the bispectral plane of the signal(2)Extraction of integral bispectrum: extract the integral bispectral sequence according to the integral path(3)Cutting segmentation and feature extraction of bispectral sequence: divide the integral bispectrum of the signal into segments and calculate the statistical characteristics and entropy characteristics of each bispectrum sequence according to formulas (11)–(17)(4)Signal bispectrum feature set: combine the statistical features and entropy features of the signals into a signal bispectral feature set according to the order AIB, CIB, RIB, and SIB
3. Dimensionality Reduction of Intrapulse Features Based on KPCA
For a single radar pulse signal, the instantaneous feature parameters are 60 dimensions, and the integral bispectral feature parameters are 80 dimensions. Therefore, the intrapulse feature parameters attain a relatively high data dimensionality of 140 dimensions. This can be addressed by noting that the different characteristics in the radar spectrum are not strictly independent and by taking account of the nonlinear distribution of the characteristics in a radar signal pulse. The present work conducts this process by applying the KPCA method.
The basic process of KPCA dimensionality reduction firstly maps the original data to a high-dimensional space through a nonlinear mapping function. Then, linear principal component analysis (PCA) is applied to the high-dimensional space to achieve dimensionality reduction. The specific process is given as follows.
For a dataset , where is the number of features and is the number of data samples, applying a nonlinear mapping transformation maps the data from the original feature space to the high-dimensional feature space F:
The transformed feature vector data in can be expressed as a linear combination of and because the data samples in the high-dimensional feature space are centralized. Therefore, the covariance matrix in the transformed high-dimensional feature space is given as follows.
The fact that the feature vector in can be expressed as a linear combination of enables the application of the inner product rule to obtain the following equation:
Among these terms, is the eigenvector of and is given as , and is the eigenvalue of . Therefore, (16) can be rewritten as
Defining an matrix enables (17) to be simplified as and because is symmetric, (18) can be simplified as
Therefore, and can be solved according to (19).
For the eigenvector , we note that . This, in addition to the given definition of , yields the following.
The dimensionality of the data in is then reduced by calculating the projection of the sample on the feature vector in . This is obtained for sample and its transformation in as follows:
Accordingly, the principal components of the sample can be obtained.
The kernel function employed in KPCA can be any symmetric function satisfying Mercer’s theorem. A number of kernel functions are commonly employed, such as polynomial kernel functions, Gaussian kernel functions, and neural network kernel functions, which are, respectively, given in the following equations.
The present work adopts the Gaussian kernel function in (23) as the mapping function.
4. Multiperspective Collaborative Clustering and Sorting Algorithm
The basic process of the proposed multiperspective collaborative clustering and sorting algorithm is presented in Figure 2, and Algorithm 1 presents its detailed processing steps.
|

The proposed multiperspective collaborative clustering and sorting method mainly offers the following important advantages: (1)The preprocessing stage focuses on the basic characteristics of the signal and the intrapulse feature data, and PCA and KPCA are, respectively, applied for data dimensionality reduction. It should be noted that the basic signal characteristics data employed in the simulation experiments in this article are of generally low dimensionality (i.e., 3 dimensions). Therefore, the dimensionality of this data remains unchanged during the actual processing, and only the dimensionality of the intrapulse feature data is reduced using the KPCA method. This processing enhances the discrimination between the characteristics of different radiation sources because, on the one hand, it can achieve data dimensionality reduction while maintaining the nonlinear characteristics of the data as much as possible. On the other hand, it can increase the impact of core features while reducing the impact of redundant features. In addition, data dimensionality reduction can also reduce the calculational burden in the subsequent clustering process(2)The multiperspective collaborative clustering and sorting method learns data from different perspectives alternately and uses the difference between the learning results of the data from different perspectives to modify the clustering sorting model. Accordingly, the iterative process maximizes the consistency of a given radiation source signal under different perspectives(3)The nonuniform DBSCAN clustering method requires no initially established number of clusters in the dataset in advance, which conforms to the noncooperative characteristics of electronic reconnaissance. At the same time, the clustering effect is enhanced because it can also cluster datasets with uneven pulse densities(4)The Jaccard coefficient is an index based on pairwise measurement and is introduced to determine the similarity between the data distributions associated with the cluster labels derived from the two perspectives. Then, the determined level of similarity is employed as the algorithm termination condition. For the distribution of the two clustering results, the Jaccard coefficient is defined as the ratio of true positive (TP) point pairs to the sum of TP and the false negative (FN) and false positive (FP) point pairs without considering true negative (TN) point pairs. Accordingly, the Jaccard coefficient is expressed as
According to the definition, the similarity between two distributions increases as the value of the Jaccard coefficient increases up to a maximum value of 1, and a better clustering effect is therefore obtained by the algorithm. The Jaccard coefficient threshold representing the clustering termination condition is set herein as 0.99.
5. Simulation Experiments
5.1. Dataset Generation and Experimental Settings
A complex electromagnetic environment simulation system was used to generate basic signal characteristics and intrapulse feature data based on real radiation source signals, and these analog signal characteristics were extracted by a signal detection system. The specific process employed for experimental data generation is presented in Figure 3.

According to the actual conditions encountered by reconnaissance receivers in the field, the number of radiation sources employed in the scene was 19, including 6 types of radiation sources: airborne fire control radar, shipborne fire control radar, surface-to-air missile target indicating radar, ground-to-air missile target indicating radar, air missile guidance radar, early warning aircraft radar, and large ground early warning radar. The different radiation sources were switched between the different signal types according to changes in the combat process. Accordingly, the number of radiation sources in the 6 categories at any given time was no less than 15. The reconnaissance receiving equipment captured 19,331,293 pulse data from the 19 radar radiation sources over an 8 min period of system simulation. The basic characteristic data captured from the radar signals was the carrier frequency (RF), pulse width (PW), TOA, and angle of arrival (AOA), and these characteristics of the received pulse data are plotted in Figures 4–6.



To simplify the discussion presented here, the large amount of collected data was reduced by selecting only that data collected within a 2 min period with the most abundant number and types of pulsed radiation sources for conducting clustering and sorting. The basic signal characteristic data consisted of only the RF, PW, and pulse AOA data of 20 signal segments in the X-band with an azimuth angle of 80°–160°, and the number of pulses in each signal segment was 30,000. In addition, the intrapulse feature data included the instantaneous feature and bispectral feature data extracted from the leading edge of an automatic dependent surveillance–broadcast (ADS-B) signal pulse of a civil aviation aircraft. The extraction time of the pulse front was 0.2 μs, and the instantaneous and bispectral feature data are, respectively, presented in Figures 7 and 8. After feature splicing, the dimensionality of the intrapulse features at the leading edge of each pulse signal was 140 dimensions. The instantaneous features of the signal and the number of segmented cuts of the integral bispectrum were 3 segments, and the variance, kurtosis, skewness, permutation entropy, and fuzzy entropy were extracted for each feature sequence and for the entire feature sequence. The intrapulse feature data was reduced to 10 dimensions using KPCA dimensionality reduction. Accordingly, the sklearn function setting was set to ’ and . Because the basic signal characteristic data included only 3 dimensions, this data was not reduced in dimensionality. Therefore, the PCA parameter setting included .


5.2. Dimensionality Reduction Analysis of Intrapulse Features
Figure 9 presents the KPCA dimensionality reduction distribution of 7 radiation source signals to 50 dimensions, where the number of samples for each radiation source was 1000. It can be seen that the features of the different radiation sources within the pulse are reduced after dimensionality reduction, and the main differences between the different radiation sources are concentrated in the most principal components. The eigenvalues of each principal component are analyzed in Figure 10. It can be seen that the feature value decreases sharply when the number of principal components exceeded 5, and the cumulative contribution to the variance of the first 5 principal components was 66.4%. Therefore, the intrapulse features were reduced to 5 dimensions by the KPCA method in subsequent experiments. The dimensionality reduction ensures that the feature distributions of the different radiation sources are different, which enables the signals of different radiation sources to be effectively distinguished for realizing signal sorting.


5.3. Comparison of Clustering and Sorting Algorithms
The clustering performance of the multiperspective collaborative clustering sorting method was compared with those of the data field grid clustering algorithm and the DBSCAN algorithm. Figure 11 presents segment 10 of the original data, which included only 4 radiation sources, and Figures 12–14, respectively, present the corresponding clustering results of the data field grid (Figure 12), DBSCAN (Figure 13), and the proposed clustering and sorting algorithms (Figure 14). The data application of DBSCAN algorithm and data field grid clustering algorithm is the basic characteristics of the signal. Here, the basic signal characteristic data (i.e., AOA, RF, and PW) have been normalized to simplify the plots along values ranging from 0 to 1.0.




From Figures 12–14, it can be seen that the data field grid and DBSCAN clustering and sorting algorithms present obvious clustering errors. This is particularly the case for the data field grid algorithm, which presents a relatively large number of clusters owing to the box-like distribution of the collected data that can support multiple potential centers. Here, the 30,000 data points from the 4 radiation sources are classified into 6 categories. The use of a globally fixed neighborhood radius by the DBSCAN clustering algorithm introduces clustering errors for those data points with uneven density at high PW. In contrast, we note that the clustering effect of the proposed multiperspective collaborative clustering and sorting algorithm is best. Here, the basic characteristic data are correctly clustered, except for four discrete data points at high PW.
The performances of the clustering and sorting algorithms were further evaluated comprehensively by means of indices composed of 4 external standards and 2 internal standards. For all evaluation standards considered, the performance of an algorithm increases with increasing index value. The evaluation index system employed is illustrated in Figure 15, and the results of analysis are listed in Table 1. It can be seen from the table that the proposed multiperspective collaborative clustering and sorting algorithm provides superior performance for all evaluation standards considered.

The effectiveness of the multiperspective collaborative clustering approach proposed in this paper was further verified by comparing the performance of the nonuniform DBSCAN clustering algorithm employed in the proposed approach under both single-perspective and multiple-perspective frameworks. The performance results are listed in Table 2. As can be seen from the table, the clustering results obtained by the nonuniform DBSCAN clustering algorithm under the multiperspective framework are superior to that obtained under the single-perspective framework.
5.4. Clustering and Sorting Process Analysis of the Proposed Algorithm
The multiperspective collaborative clustering and sorting process was further analyzed by evaluating the changes in the clustering results of the radiation source data of segment 6 from its original distribution, which included 6 radiation sources, to the initial clustering results and those obtained over three successive iterations, whereas discussed above, data is learned from the two different perspectives alternately, and the difference between the learning results of the data from the two perspectives is employed to modify the clustering sorting model. The various distributions of the segment 6 data are presented in Figures 16–20 based on perspective 1. Here, the actual distribution of segment 6 data is presented in Figure 16, while Figure 17 presents the initial clustering results, and Figures 18–20 illustrate the evolution in the clustering results after three collaborative clustering iterations.





The collaborative optimization process shown in Figures 17–20 demonstrates that the clustering accuracy is gradually improved with successive iterations. This is because the data derived from the two different perspectives describe the target attributes differently. As a result, the cluster label transfer and LDA dimensionality reduction processes, respectively, correct the clustering model and generate new subspaces, under which, the clustering results of the unsupervised clustering algorithm are improved.
6. Conclusion
The present study addressed the disadvantages of currently available cognitive electronic reconnaissance methods under the complex electromagnetic environments on modern battle fields by proposing a multiperspective collaborative clustering method for sorting radiation sources based on the multiperspective information of radar signals. The two perspectives employed include the basic signal characteristics and intrapulse features of radar signals. The proposed method performs unsupervised clustering, cluster label transfer, and PCA dimensionality reduction iteratively based on the differences between the clustering results obtained from the two signal perspectives. Therefore, radiation signal sorting can be conducted in a noncooperative context without human supervision. The common problems associated with the high dimensionality and correlation characteristics of the intrapulse features of radar signals are addressed by applying nonlinear dimensionality reduction using KPCA. The results of comparative experiments demonstrate that the proposed multiperspective sorting method can make full use of the difference information between the two signal perspectives and thereby improve the accuracy of clustering-based radiation source sorting. Application of the multiperspective clustering method to actual radar signals in a realistically complex electromagnetic environment demonstrated its capability of improving the radar signal sorting results of cognitive electronic reconnaissance systems compared with other state-of-the-art clustering methods. Moreover, the multiperspective collaborative clustering sorting method is demonstrated to provide a better clustering and sorting ability than the single-perspective clustering method. However, in the simulation process, it is found that the data processing time is long, and reducing the complexity of the algorithm in the later stage is one of the problems that needs to be solved.
Data Availability
The data that support the findings of this study are available upon request from the corresponding author Y.Z. The data are not publicly available due to restrictions, e.g., their containing information that could compromise the privacy of research participants.
Conflicts of Interest
The authors declare no conflict of interest.
Acknowledgments
The author would like to thank the Shaanxi Provincial Science and Nature Foundation (approval number 2021JM-225) for providing funds for the experiment.