CSL-SFNet for Cooperative Spectrum Sensing in Cognitive Satellite Network with GEO and LEO Satellites

Yang, Kai; Hu, Shengbo; Zhang, Xin; Yan, Tingting; Zhu, Manqin

doi:https://doi.org/10.1049/2024/5897908

IET Signal Processing

On this page

Abstract Introduction System Model Analysis Conclusion Data Availability Conflicts of Interest Authors’ Contributions Acknowledgments References Copyright Related Articles

Research Article | Open Access

Volume 2024 | Article ID 5897908 | https://doi.org/10.1049/2024/5897908

CSL-SFNet for Cooperative Spectrum Sensing in Cognitive Satellite Network with GEO and LEO Satellites

Kai Yang,^1,2Shengbo Hu ,^2,3Xin Zhang ,¹Tingting Yan,²and Manqin Zhu²

Academic Editor: Wanli Wen

Received24 May 2023

Revised25 Mar 2024

Accepted15 Apr 2024

Published29 Apr 2024

Abstract

In a cognitive satellite network (CSN) with GEO and LEO satellites, there is a large propagation losses between the sensing satellite and the ground station. The results of spectrum sensing from a single satellite may be inaccurate, which will create serious interference in the primary satellite system. Cooperative spectrum sensing (CSS) has become the key technology for solving the above problems in recent years. However, most of the current CSS techniques are model-driven. They are difficult to model and implement in CSNs since their detection performance is strongly dependent on an assumed statistical model. Thus, we propose a novel CSS scheme, which uses convolutional neural networks (CNNs), self-attention (SA) modules, long short-term memory networks (LSTMs), and soft fusion networks, called CSL-SFNet. This scheme combines the advantages of CNNs, SA modules, and LSTMs to extract the features of the input signals from the spatial and temporal domains. Additionally, the CSL-SFNet makes use of a novel soft fusion technique that improves detection performance while also considerably reducing communication overhead. The simulation results demonstrate that the proposed algorithm can achieve a detection probability of 90% when the signal-to-noise ratio is −20 dB; it has a shorter running time and always outperforms the other CSS algorithms.

1. Introduction

Currently, air-space-terrestrial-sea integrated networks have become a hot research topic for B5G and 6G [1]. To achieve global coverage with low time delays, the Cubesat-based LEO mega satellite constellation, i.e., a satellite system consisting of multiple orbital planes and hundreds of small satellites, has become an important candidate technology.

However, with LEO mega satellite constellations becoming increasingly operational, the available spectrum resources are more crowded. To improve spectrum utilization, cognitive satellite networks (CSNs) with GEO and LEO satellites [2] have become important candidate technologies. In the CSN, LEO satellites are permitted to access the authorized spectrum of the GEO satellites through spectrum sensing technology, which can effectively increase the utilization efficiency of the authorized spectrum. According to Wang et al. [3], the GEO must be safeguarded, and the LEO’s interference with the GEO must be kept to a minimum. Nevertheless, due to the large propagation delay and the fadings between the sensing satellites and the ground station [4], the spectrum sensing results of one sensing satellite may be unreliable, causing other LEO satellites to incorrectly access the authorized bands of the GEO satellites and interfere with them. Fortunately, the probability of these false sensations can be decreased by combining the sensing results from multiple space-separate LEO satellites that sense the same spectrum band, i.e., cooperative spectrum sensing (CSS) [5]. Fundamentally, CSS takes advantage of spatial separation diversity, which greatly reduces the likelihood of multiple channels deep fading at the same time [6]. Therefore, in recent years, the CSS has received an increasing amount of attention in satellite communication scenarios [7]. In CSS, the fusion center (FC) is used to make the final decision on spectrum availability. The decision-making process primarily involves either soft combining or hard combining [8]. In soft combining, each sensing satellite directly sends the local sensing data to the FC, which provides the best detection performance but at the cost of reporting link overhead [9]. The most important motivation for cognitive networks is to improve spectrum efficiency, so it is disadvantageous for cognitive networks to produce a large communication overhead in the reporting link [10]. Hard combining drastically reduces communication overhead by allowing each sensing satellite to make a local decision first and then send the decision information to the FC for the final decision. Nonetheless, this method cannot make full use of the features of the local sensing data, which will lead to a loss of features and a reduction of the detection performance [5].

Traditionally, most research on CSS has focused on model-driven methods. Their detection performance relies heavily on predefined statistical models, which makes them more difficult to model and deploy in real environments [11]. Popular detection algorithms include energy detection (ED), matched filter detection, cyclic smooth detection, and eigenvalue detection. Among them, matched filter and cyclic smooth detection both require a priori knowledge of the primary user (PU) signals, which is difficult to obtain in practice [12]. Although eigenvalue detection does not require corresponding a priori knowledge, it is difficult to deploy due to its high computational complexity [13]. ED is simple and easy to deploy. However, its detection ability rapidly degrades or even fails to work when the signal-to-noise ratio (SNR) drops below its lower detection limit [14]. Besides, based on the energy observations of all secondary users (SUs), Taherpour et al. [15] proposed an asymptotically optimum detection algorithm. The simulation results show that the algorithm outperforms the OR detector in Rayleigh fading and shadowing environments, and it also performs almost as well as the optimum detector. However, the algorithm also requires the estimations of unknown parameters in the detector structure.

Recently, with the rapid development of data-driven signal processing techniques, machine learning (ML) and deep learning (DL) techniques have attracted extensive attention from industry and academia in the context of future wireless communication [12, 16]. Applying ML/DL to CSS can make the spectrum sensing process of each sensing satellite more adaptable to changes in the channel environment since it does not require any prior knowledge of the new environment. Additionally, the main advantage of CSN is its cognitive capability, i.e., the self-learning capability in the radio environment, which is analogous to the ML/DL model. Therefore, the ML/DL model is widely used in cognitive networks [17]. Applying ML/DL to CSS can make the spectrum sensing process of each sensing satellite more adaptable to changes in the channel environment since it does not require any prior knowledge of the new environment. Ahmadfard et al. [18] proposed a probabilistic spectrum sensing data falsification (SSDF) attack against a soft-judgment spectrum sensing model. In this attack, the attacker is able to perform parameter configuration adaptively. Simulation results demonstrate the effectiveness of the proposed attack strategy. For the SSDF attacks scenario, Parhizgar et al. [19] proposed a scheme based on clustering the SUs to counter SSDF attacks. By using ML algorithms to classify each of the clusters as reliable or unreliable, the simulation results showed that support vector machine (SVM) and artificial neural network (ANN) outperformed other ML classification algorithms and showed good detection performance. Ghaznavi and Jamshidi [20] proposed a method based on clustering the SUs and estimating some unknown parameters of their received power to solve the problem of SSDF attack, which is based on the clustering algorithm in classical ML, has low computational complexity and shows good detection performance in detecting malicious users. Similarly, Ghaznavi and Jamshidi [21] proposed a reliable method based on clustering the cooperating sensors, which significantly improves the performance of cognitive radio networks with attackers. Paul et al. [22] proposed a CSS model based on deep Q-learning, which outperforms the widely popular SVM-based classification methods and traditional CSS methods under SSDF attacks. Yang and Tong [23] proposed a spectrum sensing algorithm using an SVM-optimized RBF neural network, and the results show that the detection performance of spectrum sensing can be further improved by an ML-optimized RBF neural network algorithm, which opens up a new direction for the application of ML and neural network. In [24], a federated learning (FL) algorithm is proposed to distribute the data collection and model training process over many devices. The results show that the detection accuracy of the FL algorithm is similar to that of detection using convolutional neural networks (CNNs), achieving the goal of simplifying the spectrum sensing process in the network. In [25], an ML-based CSS method was proposed. Although it achieved good sensing performance, it also greatly affected its robustness when the noise power was too large. In [26], ANN is used for spectrum sensing. However, ANN was prone to overfitting, which will directly affect the sensing results of the test data. In addition, Shachi et al. [8] proposed a spatiotemporal system model of a CSS scenario for which a CNN is trained to classify the PU states to achieve spectrum sensing. Compared with traditional methods, this scheme achieves a higher detection probability. Nevertheless, the algorithm sends the local training samples directly to FC, resulting in a significant communication overhead and a decline in the efficiency of training and testing.

However, as far as we know, there are very few studies on applying ML/DL to the spectrum sensing problem in CSNs. Ding et al. [27] proposed a satellite-based spectrum prediction system that constructs a joint long short-term memory network (LSTM)-autoregressive moving average (ARMA)-assisted spectrum prediction scheme by combining LSTMs and ARMA models. This scheme effectively reduces prediction errors and enables advanced prediction of future spectrum occupancy. In [16], a spectrum sensing method utilizing temporal convolutional networks (TCN) is introduced for CSNs. This method employs TCN to extract the time domain characteristics of the received signal in order to ascertain whether the PU exists. For a CSN with low SNR, Ding et al. [28] proposed a spectrum sensing scheme that combines CNNs and LSTMs, demonstrating good sensing capability. Nevertheless, these studies have only considered the spectrum sensing of individual sensing nodes and have not taken into account the fading effects on satellite-to-ground links. In this paper, we explore a novel DL-based CSS scheme in a CSN consisting of GEO and LEO satellites. In Figure 1, the GEO relay satellite for full-climate and full-time tracking, telemetry, and command systems is used as the PU, and LEO satellites are used as SUs to share the spectrum with the PU. Between the satellite-ground link of this CSN, similar to [29], the fading model we consider mainly includes free-space transmission loss, cloud attenuation, and atmospheric absorption. The main research contributions of this paper are summarized as follows:(1)First, in the proposed CSN, we analyze the effects of the antenna direction of the GEO earth station on the SNR over Ka-band satellite-ground links.(2)Second, in the proposed CSN, we propose a novel CSS model, which uses CNNs, self-attention (SA) modules, LSTMs, and soft fusion networks, called CSL-SFNet. This model uses a compromise between soft combining and hard combining in the FC, i.e., each SU satellite sends only local sensing soft features (SSFs) to the FC. This approach fully utilizes the feature information contained in the PU signals while simultaneously lowering the communication overhead. The CSL-SFNet consists of CSLNet deployed on LEO satellites, and Soft-FusionNet deployed on GEO satellites. The CSLNet consists of the CNN, the SA module, and the LSTM, which can extract the complex features of the PU signals from the spatial and temporal domains. In addition, Soft-FusionNet is mainly composed of three dense layers, which are used to aggregate SSFs from all the SUs and make the final decision. The simulation results demonstrate that the proposed algorithm can obtain a detection probability () of 90% when the SNR is −20 dB with less running time and always outperforms the other CSS algorithms.(3)Finally, we must maintain a constant false alarm probability () in order to comply with the IEEE 802.22 standard’s transmission requirement [30]. Therefore, to maximize at a given , inspired by the Neyman–Pearson criterion, this paper designs a threshold-based detection mechanism at the output of Soft-FusionNet, which can conveniently control .

The remainder of this paper is organized as follows: In Section 2, the CSN system model is described. In Section 3, the proposed CSS network model framework is introduced. To evaluate the proposed algorithms, Section 4 presents the simulation results. Finally, Section 5 makes a few conclusions.

2. System Model and Problem Formulation

2.1. System Model

In the CSN shown in Figure 1, the GEO relay satellite working in the Ka-band is the PU, which is considered an FC because of its wide coverage and relatively static characteristics with the ground station, and the Soft-FusionNet is deployed on the PU. In addition, LEO satellites with an orbital altitude of 550 km are SUs, and CSLNet is deployed on each SU. The PU satellite beam coverage includes both multiple GEO earth stations (GESs) and LEO earth stations. The antenna beam of the SU satellite overlaps with that of the PU satellite. The PU satellite will suffer serious interference if the SU satellites access the same frequency range. Additionally, in this CSN, the PU satellite sends data to the GES in the downlink. In the uplink, the GES sends commands to the PU satellite. We assume that the spectrum data from the PU satellite or GES can be received by the SU satellites within the beam coverage of the PU satellite. For convenience, we focus on the uplink scenario where the GES communicates with a single PU satellite. During sensing, the SU satellites at different locations receive data from the GESs, and CSLNet extracts SSFs from the data first. Second, each SU satellite sends the local SSFs to the FC through the reporting link (i.e., the channel used when the SU communicates with the FC). Then, the FC combines all the SSFs and sends them to Soft-FusionNet to make the final decision. Finally, the FC broadcasts the sensing results to all the SUs. Thus, the CSS problem can be described as a binary hypothesis testing problem, and the signal samples received by the SU satellites can be written as follows [29]:wherewith and denoting the hypotheses that a PU is present and absent, respectively. denotes the signal samples received by the SU, and is the signal sample sent by the GES. represents the noise vector, which is assumed to follow a circularly symmetric complex Gaussian distribution with zero mean and variance . and denote the received power of the SU and the transmitted power of the GES, respectively. stands for the elevation angle between the SU satellite and GES. is the distance of the GES to the SU, which is mainly affected by . and represent the velocity of the light and the working carrier frequency, respectively. and represent the propagation factors of the atmospheric absorption and the cloud attenuation in the uplink, respectively [29]. Furthermore, is the max gain of the receive antenna of the SU satellites, which is constant because the receive antenna continuously tracks the GES. is the off-axis angle of the GES in the direction of the SU satellites. stands for the gain of the GES transmit antenna in the direction of the SU, which is mainly affected by the . The antenna gain is expressed by the following [3]:wherewith and are the first- and third-order Bessel functions, respectively. is the angle that corresponds to the 3 dB beamwidth. is the maximum gain of the GES transmit antenna when , and its expression is as follows:where stands for the antenna efficiency and is the antenna area. In addition, it can be proven by geometric relations that is also a function of . The dotted line in Figure 2 presents the variation in the antenna gain with . The setting of simulation parameters is shown in Table 1.

Then, the SNR of the received signals can be expressed as follows [29]:where stands for Boltzmann’s constant. and denote the bandwidth and noise temperature of the receiver at the SU satellites, respectively. Figure 1 shows that follows the law of first decreasing and then increasing when the sensing LEO satellite travels along its orbit at high speed. can be expressed as follows:where denotes the radius of the Earth and is the altitude of the LEO. Apparently, the SNR is mainly influenced by and when is constant. The solid line in Figure 2 shows that the SNR of the received signals is greatly affected by the position of the SU satellites, and it can be seen that the fluctuation of SNR reaches 60 dB. Therefore, the sensing results of the SU satellites far away from the GES are likely to be unreliable. To improve the overall sensing performance, a robust CSS scheme using CSL-SFNet is designed in this paper and will be described in detail later.

3. CSL-SFNet for CSS

3.1. CSL-SFNet

Figure 3 shows the network architecture of the proposed CSL-SFNet for CSS, which consists of two parts. The first part, called CSLNet, is deployed on each SU satellite to obtain the SSFs from the input data. We first use a CNN to extract features from the input signals. Since the gradient of the traditional stacked CNN model will disappear as the network depth increases, we introduce the residual module, which can successfully address the problem of gradient disappearance by fuzing the input with the output features of the convolution layer. Additionally, LSTM is used to extract the deeper time-dependent features from the high-dimensional abstract features output by the residual module. The main goal of the SA module [31] is to focus on specific regions with strong dependencies in the output features of the residual module, which can significantly increase how efficiently crucial feature information is extracted. The second part of the architecture, called Soft-FusionNet, is deployed on a PU satellite to make the final decisions; it consists of three dense layers with 128, 32, and 2 neurons. The specific parameter settings of the proposed CSS architecture are shown in Table 2.

3.2. Data Preprocessing

For the proposed CSLNet, which is only suitable for noncomplex numbers, we partition the input samples into real and imaginary parts. Thus, the input samples can be written as follows:where and represent the real part and the imaginary part, respectively. and denote the length and number of samples, respectively. The received samples need to be labeled in the offline training stage, and the labeled samples can be represented as follows:where stands for the th sample of the training set . represents the label of . and denote the hypotheses of and in Equation (1), respectively. Since CSS is a binary hypothesis test problem, training CSLNet can be considered a binary classification problem. Therefore, we can encode the label as a one-hot vector,to indicate the state of the PU.

3.3. Offline Training

For the proposed CSS model, the offline training is divided into two stages. The first stage is that each SU trains the CSLNet through the labeled training samples offline. For convenience, we define a clear physical meaning for the network output that is normalized by the function, i.e.,withwhere stands for the trained weights of CSLNet, and denotes the entire CSLNet deployed on the th SU satellite. represents the class score of output by CSLNet. Besides, a generic objective function is defined for each SU in the training stage by using the maximum-likelihood criterion, i.e.,According to the objective function (Equation (13)), we can define a cross-entropy cost function for CSLNet training, i.e.,

CSLNet training’s purpose is to determine the optimum weight parameter as follows:

We obtain the optimal model parameter by minimizing Equation (14). Meanwhile, the backpropagation algorithm and the Adam-based optimizer are used to train the CSLNet. Then, we call the output of the well-trained CSLNet the SSF and express it as follows:

In the second stage, FC first combines all the SSFs sent by the SU satellites into one SSF vector as follows:

Second, is used to train Soft-FusionNet to obtain the optimal model parameter . Since the training process of Soft-FusionNet is similar to that of CSLNet, we will not describe it in detail. Finally, the outputs of the well-trained Soft-FusionNet are and , which are the class probabilities of and , respectively.

Finally, Figure 4 presents the loss value changes during the offline training process of the single CSLNet and Soft-FusionNet. We can clearly see that the loss values of both networks decrease as the number of epochs increases without any apparent overfitting. Moreover, we can also observe that the loss value of Soft-FusionNet is significantly lower than that of CSLNet. This indicates that the SSFs extracted by multiple CSLNets can effectively reduce the decision error in FC through soft fusion. This result demonstrates the effectiveness of the proposed soft fusion scheme in CSS.

3.4. Online Detection

In the online detection part, the SU satellites receive new unlabeled signal samples from GES and send them into the proposed CSS architecture to obtain the corresponding class probabilities and . We can obtain the final decision result by comparing the values of and . However, to fulfill the IEEE 802.22 standard’s transmission requirement, we must maintain a constant . Inspired by the Neyman–Pearson criterion, we design a threshold-based decision scheme at the output of Soft-FusionNet that can control the desired by updating the threshold. First, we randomly choose samples under the label from the training samples and constitute the selected samples as a new sample set . Then, the sample set is put into the proposed CSS scheme to obtain the corresponding results. Then, we sort the results in descending order as follows:

Second, the detection threshold can be expressed as follows:where denotes the rounding down function. Finally, for the new online samples , the decision result can be obtained by the following:

Above, the proposed CSS algorithm is summarized in Algorithm 1.

1: Set ; initialize maximum training epochs ; initialize and with random weights;
2: Collect the training set and feed it into CSLNet;
3: fordo
4: According to Equation (15), SUs train the CSLNet by the backpropagation algorithm to obtain the optimal model parameters ;
5: end for
6: SUs send the output SSFs of well-trained CSLNet to FC;
7: FC combines SSFs as ;
8: fordo
9: FC trains the Soft-FusionNet by the backpropagation algorithm to get the optimal model parameters ;
10: end for
11: FC calculates the threshold based on Equations (18) and (19);
12: SUs receive the test sample online and input it into the well-trained CSLNet to obtain the new SSFs, which are transmitted to FC;
13: FC outputs the corresponding category probability vector and and decides the final PU state based on Equation (20);
14: FC broadcasts the final decision result to each SU.

4. Simulation and Analysis

4.1. Simulation Environments

In the proposed CSN scenario, we use the GNURadio [32] to generate the sample dataset of PU signals. The PU signal is a 16QAM modulated signal commonly used in satellite communication scenarios [14]. The SNR of the received signals is greatly influenced by . Thus, to verify the SNR robustness of the proposed algorithm, we generate weak signal samples in the SNR range of −20 to 0 dB as the dataset according to Equation (6). The relevant parameter settings for the generated data samples are shown in Table 1. In addition, to further analyze the effects of the direction of the antenna of GES on the SNR over the Ka-band satellite-ground links, we incorporate noise uncertainty (NU), which has a significant impact on the performance of the actual detection. For the NU scenario, the estimated noise power is , where denotes the actual noise power. stands for the NU factor [11], which obeys a uniform distribution with an interval of . In our scenario, we let , which indicates that 1 dB of NU is introduced. Finally, all the algorithms are implemented based on Python 3.7, and the simulation platform is a PC equipped with an NVIDIA GeForce RTX2080Ti GPU and an Intel (R) Core i9-9900K CPU.

4.2. Simulation Results

In this part, the performance of all the CSS algorithms will be evaluated by two key performance indicators, i.e., and , which can be represented as and , respectively, where and are the test statistics and detection threshold of the corresponding algorithm, respectively.

First, we will verify the impact of the number of SU satellites on the detection performance of the proposed algorithm. We set different SU numbers and selected test samples with SNR = −16 dB to evaluate the detection performance of the proposed scheme. In Figure 5, we clearly discover that when the SU number increases, the detection performance of the proposed approach is also enhanced. This is because more hidden features of the PU signals can be learned by the proposed algorithm as the number of SUs increases. In addition, considering the operating point with , the of the proposed algorithm reaches 99% when the number of SUs is 7, but only 25% in the case of a single SU. Meanwhile, we also find that the of the proposed scheme is close to convergence when the number of SUs is 7.

Second, to further evaluate the detection performance of the proposed algorithm, we compare it with other spectrum sensing algorithms, including ED, CNN, LSTM, and multilayer perceptron (MLP). The CNN model consists of two convolution layers containing 128, 64 filters with a filter size of 3 and two dense layers with 128, 2 neurons. The LSTM model consists of one LSTM layer containing 64 neurons and two dense layers with 64, 2 neurons. And the MLP consists of five dense layers with 256, 500, 250, 120, and 2 neurons, respectively. All the algorithms are evaluated using data samples with an SU number of 9 and a sample length of 512 to ensure a fair comparison, and the optimal hyperparameters of each model are determined using a large number of simulations. According to Equation (20), different detection thresholds can be obtained by changing the value of . We choose the test sample data with SNR = −16 dB under various thresholds and feed it into each detection algorithm to obtain the receiver operating characteristics (ROC) curves. In Figure 6, we find that the proposed algorithm outperforms other schemes at any level. Apparently, the data-driven DL algorithms are better than the model-driven ED. This is because the neural network model has adaptive learning ability and can automatically extract hidden features from the PU signals, but ED can only make decisions based on the energy level of the PU signals. When the SNR level is low, it is difficult for the ED to accurately sense the state of the PU. In addition, NU is also common in CSNs, so it is necessary to study the robustness of the SNR of the proposed algorithm in the case of NU. The dotted line in Figure 6 presents the sensing performance of each detection algorithm when the NU is 1 dB. We find that the proposed algorithm is less affected by NU, while the detection performance of other schemes is more affected by NU. Additionally, to verify the effectiveness of the proposed scheme at different SNR levels, we set to 1% and show the SNR curves of each algorithm in Figure 7. As with the ROC curve, the proposed algorithm still outperforms other sensing algorithms, achieving a of over 90% even at SNR = −20 dB, while the ED can hardly work at this SNR level. Then, to further verify the sensing capability of the proposed algorithm in different SNR ranges, we generated untrained test samples with an SNR range of −28 to −20 dB for testing all sensing algorithms. As can be seen in Figure 8, the proposed algorithm still shows the best detection performance at these untrained SNR levels, while CNN, MLP, and ED can barely work.

Additionally, to demonstrate the effectiveness of each component in the proposed CLS-SFNet, we further conducted ablation experiments. From Figure 9, we can see that the impact of the LSTM layer on sensing performance is relatively significant. This is because the model’s input IQ data is a type of time series, and the LSTM can further learn the temporal correlations within the input features, thereby significantly improving the perception performance. However, the impact of the SA module on sensing performance is relatively smaller, but SA can help the model weight the data at different positions in the feature sequence when processing it, allowing the model to focus on the most important parts of the sequence, thereby improving the efficiency of model training. Overall, both the LSTM layer and the SA are essential components of the proposed model and are crucial for the enhancement of the overall model performance.

Finally, we further evaluate the generalization ability of the proposed approach using the RadioML2016.10a [32] dataset, which contains eight types of modulation signals and is widely used for modulation recognition scenarios. It consists of the I/Q signal vector of size , which takes into account typical radio communication impairments such as time delays, frequency offsets, and sample rate drifts. In Figure 10, the dotted line represents the detection ability of the proposed algorithm for untrained modulation schemes. We find that the of the proposed approach for all the modulated signals reaches 100% when SNR = −12 dB. The results show that the proposed algorithm still has good detection abilities for untrained modulated signals.

4.3. Complexity Analysis

The proposed CSL-SFNet consists of a CNN layer, LSTM layer, and dense layer. For the CNN layer, the computational complexity of one data sample is [33], where is the number of convolution layers; is the number of convolution kernels in the th layer; is the number of input channels in the th layer; is the spatial size of convolution kernels; and is the spatial size of the output features of the th layer. According to Table 2, the number of convolution kernels of the CNN layers used is only 64 and 32. For simplicity, we let and represent the other CNN layers with the same number of convolution kernels. In addition, is used to represent the convolution kernel size of all the other convolution layers. In our spectrum sensing, the size of one sample is , so the total computational complexity of the convolution part is . According to Sak et al. [34], for the LSTM layer, its computational complexity is related to the number of hidden units and the input size. In our scenario, the computational complexity of the LSTM layer is , where stands for the number of hidden units. Additionally, for the dense layer, the computational complexity is , where and represent the input size and the number of neurons in the dense layer, respectively. Finally, the total computational complexity of the proposed CSS architecture is as follows:

Outside of the computational complexity, the communication overhead between the SU satellites and the FC is also an important problem. In the proposed scheme, the SU satellite sends the SSFs to the FC through the reporting link instead of directly sending the received signal data to the FC. The communication overhead required by the SU to send the SSFs of one data sample is only 96 bytes, while the communication overhead required to send one data sample to the FC is 8,000 bytes. In the simulation, the total number of training samples is 42,000, and the number of SUs is 9. The proposed algorithm converges after training for 20 epochs, and the total communication overhead is only 692.14 Mbytes. However, the communication overhead required is 59,062.5 Mbytes if all the SUs send data samples directly to the FC. Evidently, the proposed algorithm will show impressive advantages in future scenarios with larger data.

Finally, we analyze the test samples using different algorithms and calculate the running time accordingly. Analyzing Figures 6 and 11, we find that when SNR = −20 dB, the CSS algorithm based on LSTM and the proposed algorithm can achieve more than 90% of . However, the running time of the algorithm based on LSTM is almost five times that of the proposed algorithm. Because LSTMs have a more complex internal structure, leading to slower processing speeds for longer input data. Although the proposed model includes LSTM layers, their inputs are feature data that has been dimensionality reduced by multiple convolutional modules, resulting in relatively lower complexity. This enables the proposed model to have shorter running times. In addition, the longer running time is very disadvantageous for the spectrum sensing of the high-speed moving LEO satellites. Thus, although the running time of the proposed approach is slightly longer than that of a traditional ED, the proposed scheme is a promising CSS scheme in terms of its excellent detection performance.

5. Conclusion

In this paper, we propose a novel CSS algorithm using CSL-SFNet in a CSN with LEO and GEO satellites. The algorithm employs a soft fusion technique in FC, which can fully utilize the features of the primary signals and significantly lower the communication overhead of the reporting link. The simulation results show that the proposed algorithm is insensitive to NU and is robust to untrained modulation types. Furthermore, even in an environment with SNR = −20 dB, the of the proposed algorithm can still reach more than 90%, and it always outperforms the other CSS algorithms with less communication overhead and a shorter running time.

Data Availability

The data that support the findings of this study are available from the corresponding author upon reasonable request.

Conflicts of Interest

The authors declare that they have no conflicts of interest.

Authors’ Contributions

Shengbo Hu and Kai Yang were responsible for the methodology. Xin Zhang and Kai Yang were responsible for the software. Tingting Yan and Manqin Zhu were responsible for the validation. Kai Yang and Tingting Yan were responsible for the formal analysis. Xin Zhang and Kai Yang were responsible for the investigation. Shengbo Hu and Kai Yang were responsible for the data curation. Xin Zhang was responsible for the visualization. Shengbo Hu and Xin Zhang were responsible for the Supervision. Shengbo Hu was responsible for the project administration and funding acquisition.

Acknowledgments

This research was supported by the Guizhou Province Education Department Projects of China, grant number KY [2017]031, KY [2020]007, and the National Natural Science Foundation of China, grant number 61561009.

References

X. You, C.-X. Wang, J. Huang et al., “Towards 6G wireless communication networks: vision, enabling technologies, and new paradigm shifts,” Science China Information Sciences, vol. 64, no. 1, pp. 1–74, 2021.
View at: Publisher Site | Google Scholar
R. Liu, S. Zhu, and C. Li, “Review of cognitive satellite communication technology,” in 2020 IEEE 9th Joint International Information Technology and Artificial Intelligence Conference (ITAIC), pp. 1378–1385, IEEE, Chongqing, China, 2020.
View at: Publisher Site | Google Scholar
C. Wang, D. Bian, S. Shi, J. Xu, and G. Zhang, “A novel cognitive satellite network with GEO and LEO broadband systems in the downlink case,” IEEE Access, vol. 6, pp. 25987–26000, 2018.
View at: Publisher Site | Google Scholar
K. Shi, X. Zhang, S. Zhang, and H. Li, “Time-expanded graph based energy-efficient delay-bounded multicast over satellite networks,” IEEE Transactions on Vehicular Technology, vol. 69, no. 9, pp. 10380–10384, 2020.
View at: Publisher Site | Google Scholar
I. F. Akyildiz, B. F. Lo, and R. Balakrishnan, “Cooperative spectrum sensing in cognitive radio networks: a survey,” Physical Communication, vol. 4, no. 1, pp. 40–62, 2011.
View at: Publisher Site | Google Scholar
K. Wu, M. Tang, C. Tellambura, and D. Ma, “Cooperative spectrum sensing as image segmentation: a new data fusion scheme,” IEEE Communications Magazine, vol. 56, no. 4, pp. 142–148, 2018.
View at: Publisher Site | Google Scholar
M. Jia, X. Liu, Z. Yin, Q. Guo, and X. Gu, “Joint cooperative spectrum sensing and spectrum opportunity for satellite cluster communication networks,” Ad Hoc Networks, vol. 58, pp. 231–238, 2017.
View at: Publisher Site | Google Scholar
P. Shachi, K. Sudhindra, and M. Suma, “Convolutional neural network for cooperative spectrum sensing with spatio-temporal dataset,” in 2020 International Conference on Artificial Intelligence and Signal Processing (AISP), pp. 1–5, IEEE, Amaravati, India, 2020.
View at: Publisher Site | Google Scholar
P. M. Pradhan and G. Panda, “Information combining schemes for cooperative spectrum sensing: a survey and comparative performance analysis,” Wireless Personal Communications, vol. 94, no. 3, pp. 685–711, 2017.
View at: Publisher Site | Google Scholar
A. Jamshidi, “Performance analysis of low average reporting bits cognitive radio schemes in bandwidth constraint control channels,” IET Communications, vol. 3, no. 9, pp. 1544–1556, 2009.
View at: Publisher Site | Google Scholar
C. Liu, J. Wang, X. Liu, and Y.-C. Liang, IEEE Journal on Selected Areas in Communications, vol. 37, no. 10, pp. 2306–2321, 2019.
View at: Publisher Site
B. Soni, D. K. Patel, and M. López-Benítez, “Long short-term memory based spectrum sensing scheme for cognitive radio using primary activity statistics,” IEEE Access, vol. 8, pp. 97437–97451, 2020.
View at: Publisher Site | Google Scholar
E. Axell, G. Leus, E. G. Larsson, and H. V. Poor, “Spectrum sensing for cognitive radio: state-of-the-art and recent advances,” IEEE Signal Processing Magazine, vol. 29, no. 3, pp. 101–116, 2012.
View at: Publisher Site | Google Scholar
T. Ni, X. Ding, Y. Wang, J. Shen, L. Jiang, and G. Zhang, “Spectrum sensing via temporal convolutional network,” China Communications, vol. 18, no. 9, pp. 37–47, 2021.
View at: Publisher Site | Google Scholar
A. Taherpour, Y. Norouzi, M. Nasiri-Kenari, A. Jamshidi, and Z. Zeinalpour-Yazdi, “Asymptotically optimum detection of primary user in cognitive radio networks,” IET Communications, vol. 1, no. 6, pp. 1138–1145, 2007.
View at: Publisher Site | Google Scholar
C. Jiang, H. Zhang, Y. Ren, Z. Han, K.-C. Chen, and L. Hanzo, “Machine learning paradigms for next-generation wireless networks,” IEEE Wireless Communications, vol. 24, no. 2, pp. 98–105, 2017.
View at: Publisher Site | Google Scholar
C. Clancy, J. Hecker, E. Stuntebeck, and T. O’Shea, “Applications of machine learning to cognitive radio networks,” IEEE Wireless Communications, vol. 14, no. 4, pp. 47–52, 2007.
View at: Publisher Site | Google Scholar
A. Ahmadfard, A. Jamshidi, and A. Keshavarz-Haddad, “Probabilistic spectrum sensing data falsification attack in cognitive radio networks,” Signal Processing, vol. 137, pp. 1–9, 2017.
View at: Publisher Site | Google Scholar
N. Parhizgar, A. Jamshidi, and P. Setoodeh, “Defense against spectrum sensing data falsification attack in cognitive radio networks using machine learning,” in 2022 30th International Conference on Electrical Engineering (ICEE), pp. 974–979, IEEE, Tehran, Iran, 2022.
View at: Publisher Site | Google Scholar
M. Ghaznavi and A. Jamshidi, “A low complexity cluster based data fusion to defense against SSDF attack in cognitive radio networks,” Computer Communications, vol. 138, pp. 106–114, 2019.
View at: Publisher Site | Google Scholar
M. Ghaznavi and A. Jamshidi, “A reliable spectrum sensing method in the presence of malicious sensors in distributed cognitive radio network,” IEEE Sensors Journal, vol. 15, no. 3, pp. 1810–1816, 2014.
View at: Publisher Site | Google Scholar
A. Paul, A. K. Mishra, S. Shreevastava, and A. K. Tiwari, “Deep reinforcement learning based reliable spectrum sensing under SSDF attacks in cognitive radio networks,” Journal of Network and Computer Applications, vol. 205, Article ID 103454, 2022.
View at: Publisher Site | Google Scholar
S. Yang and C. Tong, “Cognitive spectrum sensing algorithm based on an RBF neural network and machine learning,” Neural Computing and Applications, vol. 35, no. 36, pp. 25045–25055, 2023.
View at: Publisher Site | Google Scholar
M. Wasilewska, H. Bogucka, and A. Kliks, “Federated learning for 5G radio spectrum sensing,” Sensors, vol. 22, no. 1, Article ID 198, 2022.
View at: Publisher Site | Google Scholar
K. M. Thilina, K. W. Choi, N. Saquib, and E. Hossain, “Machine learning techniques for cooperative spectrum sensing in cognitive radio networks,” IEEE Journal on Selected Areas in Communications, vol. 31, no. 11, pp. 2209–2221, 2013.
View at: Publisher Site | Google Scholar
Y.-J. Tang, Q.-Y. Zhang, and W. Lin, “Artificial neural network based spectrum sensing method for cognitive radio,” in 2010 6th International Conference on Wireless Communications Networking and Mobile Computing (WiCOM), pp. 1–4, IEEE, Chengdu, China, 2010.
View at: Publisher Site | Google Scholar
X. Ding, Q. Lv, Y. Zou, and G. Zhang, “Spectrum prediction for satellite based spectrum-sensing systems using deep learning,” in GLOBECOM 2022-2022 IEEE Global Communications Conference, pp. 3472–3477, IEEE, Rio de Janeiro, Brazil, 2022.
View at: Publisher Site | Google Scholar
X. Ding, T. Ni, Y. Zou, and G. Zhang, “Deep learning for satellites based spectrum sensing systems: a low computational complexity perspective,” IEEE Transactions on Vehicular Technology, vol. 72, no. 1, pp. 1366–1371, 2023.
View at: Publisher Site | Google Scholar
C. Zhang, C. Jiang, J. Jin, S. Wu, L. Kuang, and S. Guo, “Spectrum sensing and recognition in satellite systems,” IEEE Transactions on Vehicular Technology, vol. 68, no. 3, pp. 2502–2516, 2019.
View at: Publisher Site | Google Scholar
Z. Su, K. C. Teh, S. G. Razul, and A. C. Kot, “Deep non-cooperative spectrum sensing over rayleigh fading channel,” IEEE Transactions on Vehicular Technology, vol. 71, no. 4, pp. 4460–4464, 2022.
View at: Publisher Site | Google Scholar
H. Zhao, J. Jia, and V. Koltun, “Exploring self-attention for image recognition,” in Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 10076–10085, IEEE, 2020.
View at: Google Scholar
T. J. O’shea and N. West, “Radio machine learning dataset generation with GNU radio,” Proceedings of the GNU Radio Conference, vol. 1, no. 1, pp. 1–6, 2016.
View at: Google Scholar
K. He and J. Sun, “Convolutional neural networks at constrained time cost,” in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 5353–5360, IEEE, 2015.
View at: Google Scholar
H. Sak, A. Senior, and F. Beaufays, “Long short-term memory based recurrent neural network architectures for large vocabulary speech recognition,” Computer Science, pp. 338–342, 2014.
View at: Google Scholar

Copyright

Copyright © 2024 Kai Yang et al. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

PDF Download Citation

Download other formats

Order printed copies

Views

61

Downloads

51

Citations