Abstract
Recently, the design of sparse linear array for direction of arrival (DOA) estimation of non-Gaussian signals has attracted considerable interest due to the fact that the fourth-order difference coarray offered by non-Gaussian significantly increases the aperture of a virtual linear array, which improves the performance of DOA estimation. In this paper, a super four-level nested array (S-FL-NA) configuration based on fourth-order cumulants (FOC) is proposed. The S-FL-NA consists of uniform linear arrays which have different interelement spacing. The proposed array configuration is designed based on interelement spacing, which, for a given number of sensors, is uniquely determined by a closed-form expression. We also derive the closed-form expression for the degrees of freedom (DOFs) of the proposed array. The optimal distribution of the number of sensors in each uniform linear array of the proposed array is given for an arbitrary number of sensors. Compared with the existing sparse arrays, the proposed array can provide a higher number of degrees of freedom and a larger physical array aperture. In addition, to improve the calculation speed of the fourth-order cumulant matrix, we simplify the FOC matrix by removing some redundancy. Numerical simulations are conducted to verify the superiority of the S-FL-NA over other sparse arrays.
1. Introduction
Direction-of-arrival (DOA) estimation is one of the important research fields in array signal processing [1–6]. For uniform linear array (ULA), the traditional estimation method such as multiple signal classification (MUSIC) [7] and estimation of signal parameters via rotational invariance techniques (ESPRIT) [8] can only resolve sources under the condition of physical sensors. However, the underdetermined conditions in which the number of target sources is larger than that of array sensors are very common in the real world.
To solve this problem, many sparse array structures have been proposed to increase the number of degrees of freedom (DOFs), especially for underdetermined conditions. The minimum redundant array (MRA) [9] is a well-known sparse array that maximizes the number of consecutive lags based on difference coarray. However, due to the fact that the closed-form expression of the configuration of MRA does not exist, it is difficult for array design and precise DOF analysis. The nested arrays (NA) [10] and coprime arrays (CPA) [11] are the most notable sparse arrays presented recently. They both consist of two uniform linear arrays, which make it possible to express the structure of NA and CPA analytically. In [12], coprime arrays with compressed interelement spacing (CACIS) and coprime arrays with displaced subarrays (CADiS) are proposed, which increases the DOF and improves estimation accuracy greatly. Based on these arrays, many underdetermined DOA estimation approaches have been proposed, such as spatial smoothing subspace MUSIC (SS-MUSIC) [13] and compressed sensing algorithm [14–16].
The designs above and estimation methods of the sparse arrays are mainly based on second-order statistics. For non-Gaussian signals, the fourth-order cumulants can also be used to achieve DOA estimation [17, 18] because the fourth-order cumulants of non-Gaussian signals are not zero, while those of Gaussian signals are zero. In [19], DOA estimation based on the fourth-order cumulant (FOC) has been exploited for underdetermined conditions. The method of virtual array concept based on FOC is proposed in [20]. Compared with second-order statistics, utilizing the fourth-order cumulant can improve the degrees of freedom and eliminate the Gaussian noise. Therefore, it is attractive to exploit how to construct a sparse array based on the fourth-order cumulant. The -th-order difference coarray concept and -level nested arrays are proposed in [21] based on the -th-order cumulants [22, 23]. A four-level NA (FL-NA) concept to the case of FOC is investigated as a special case of -level nested arrays when is equal to 2. By adding to the NA and CPA a third uniform linear array, E-NA and E-CPA are proposed based on FOC in [24, 25], respectively. In [26], an expanding and shift scheme is discussed by considering fourth-order difference coarrays as the sum set of two second-order difference lags, leading to the expanding and shift nested array (EAS-NA-NA) with more DOFs. A two-level NA based on FOC is developed in [27], without a gap to break the consecutiveness of coarray. An enhanced four-level nested array (E-FL-NA) is proposed [28] by adding two extra subarrays to the NA, achieving a higher number of the consecutive coarray lags than the aforementioned sparse arrays. Although these array structures increase DOFs greatly, the inner structure of their virtual arrays has not been fully analyzed in the researches given above. Therefore, they are not optimum, and further improvement is possible.
In this paper, a novel four-level nested array structure is proposed based on FOC, which is composed of four sparse ULAs. The proposed sparse array structure effectively utilizes the interelement spacings of the third and fourth sparse ULAs, which provides the largest aperture virtual uniform linear array compared to the existing array structures with a given number of sensors. The closed-form expressions for the physical sensor location and the virtual aperture are derived, and the optimal distribution of sensors to each of the sparse ULAs is also given. Then, to increase the calculation speed of the FOC matrix, we simplify the FOC matrix by removing some redundancy of that.
Throughout this paper, we make use of the following notations. Matrices and vectors are represented by capital letters and lower letters in boldface, respectively. Given a matrix , we use , , and to denote the transpose, the Hermitian transpose, and the conjugate of , respectively. We use and to denote the Kronecker product and the Khatri–Rao product, respectively. The statistical expectation is denoted by , and is the vectorization operation.
The rest of this paper is organized as follows. In Section 2, the signal model is presented. Section 3 describes the novel sparse array structure based on fourth-order difference coarrays and compares the DOFs of the proposed array with those of FL-NA, E-CPA, E-FL-NA, and EAS-NA-NA. Section 4 describes the simplified FOC matrix. Section 5 presents numerical examples, and Section 6 summarizes this paper.
2. Cumulant-Based Fourth-Order Difference Coarray
2.1. Signal Model
Consider a sparse linear array with physical sensors; the unit spacing is equal to half wavelength , denoted by . The set of sensor positions can be expressed aswhere is an integer. Suppose that there are far-field uncorrelated narrow-band sources denoted by impinging on the array from directions , and the steering vector at direction is given bywhere . The received data of the -th sensor can be written aswhere is independent and identically distributed (i.i.d.) additive Gaussian noise with power and it is independent of the sources. is the number of snapshots. We further assume that all signals are non-Gaussian with zero mean, and then the fourth-order cumulants of them are not zero.
2.2. Cumulant-Based Fourth-Order Difference Coarray
There are several definitions of FOC in the literature. In this paper, we utilize the following definition for the calculation of FOC:where , and denote the received signals at the -th, -th, -th, and -th sensors, respectively, and is the cumulant operator. Substituting (3) into (4) giveswhere . Consider an matrix denoted by ; if the element in row and column of the matrix is equal to , the matrix can be calculated by the following formula:where
is called the equivalent array manifold of FOC. In order to increase the array degrees of freedom, can be vectorized as follows:where and . The -th column of can be expressed as
According to (8) and (9), it can be observed that the vector can be equivalent to the data received by a virtual array with elements located at the location set.
Thereby, the fourth-order difference coarray can be obtained by calculating the second-order sum coarray of the virtual array generated at the second-order difference coarray stage. By permutation invariance, it can also be seen as the difference coarray of the virtual array based on second-order sum coarray. Set is centrally symmetric about the origin. Therefore, by analyzing its nonnegative part only, we can obtain all the properties of the set. We denote the nonnegative part of as .
According to [10], a nested array consists of two uniform linear arrays, where the first subarray has sensors with an interspacing and the second subarray has sensors with the interspacing . A topological structure of NA is shown in Figure 1, and the sensor positions of a nested array can be written as

Therefore, the second-order difference coarray of a NA can be expressed asand the sum coarray of that is given bywhere
Note that contains the consecutive range from to , and represents the consecutive range of from 2 to , while denotes the sparse range with interelement spacing from to .
3. The Proposed Four-Level Nested Array Structure
In this section, to maximize the number of consecutive lags in the virtual array based on FOC, we propose a novel sparse array composed of four uniform linear arrays. Then, the optimizing distribution of the number of sensors of the four ULAs is derived and the comparison of DOFs for the proposed array with other sparse arrays is made, which shows that the proposed array can provide a higher number of DOFs and a larger physical array aperture.
3.1. The Super Four-Level Nested Array
Proposition 1. The super four-level nested array (S-FL-NA) is composed of four sparse uniform linear arrays, which is shown in Figure 2. The locations of the S-FL-NA are given bywhere . The set of the consecutive lags of the fourth-order difference coarray for the S-FL-NA is given bywhere . For simplicity, we use to represent the number of consecutive lags.

Proof. The S-FL-NA can be seen as an array composed of two subarrays denoted by and , which can be expressed asObviously, is the same nested array as (11). is an expanded nested array, where the first subarray has sensors starting from the position with the interelement spacing and the second subarray has sensors with the interelement spacing .
Now, we consider analyzing the fourth-order difference coarray of the S-FL-NA using the two sets, given bywhere represents the second-order sum coarray of the virtual array generated by the second-order difference coarray stage and denotes the second-order difference coarray of the virtual array generated at the second-order sum coarray stage, when and . The relationship between the two sets and can be expressed asFirst, we investigate the inner structure of . According to (13), the second-order difference coarray of can be expressed asSubstituting (11) and (21) into (18) yieldsTherefore, we getwhereObviously, each is a continuous set of integers.
Second, we investigate the inner structure of . For , it is generated by a nested array whose interelement spacing is expanded and sensor locations are translated to the right. Hence, the second-order sum coarray of is given bywhereNote that the set is an ULA with interelement spacing . Substituting (14) and (25) into (19) givesNext, we focus on and . The difference set between and is given bywhereObviously, each is a set with continuous integers.
By analyzing and , the following results can be given:indicating that is a consecutive integer set.
To maximize the consecutive lags in set , we consider constructing a new consecutive set . Hence, the following relationship should be satisfied:Then we get the following equation:By solving (32), the value of can be expressed asAs a result, the set of consecutive integers generated by the combination of and can be expressed asThe max value of is equal to the max value of , i.e.,For is centrally symmetric about the origin and , the proof is finished.
The following remarks are the comparison between S-FL-NA and FL-NA [21], EAS-NA-NA [26], and E-FL-NA [28].
Remark 1. According to (11), the S-FL-NA can be seen as a combination of two nested arrays. The configuration of NA1 is . NA2 is an array that is configured as and then shifts to the right, so the first element of NA2 coincides with the last one of NA1. Therefore, the number of sensors of S-FL-NA is , which is the same as the other three arrays.
Remark 2. FL-NA, EAS-NA-NA, E-FL-NA, and S-FL-NA all consist of four uniform linear arrays and have similar subarray 1 and subarray 2. However, the third subarrays of them have different interelements spacing. The interelements spacing of the third subarray of FL-NA is , and those of EAS-NA-NA and E-FL-NA are , while that of S-FL-NA is .
Remark 3. EAS-NA-NA and S-FL-NA can be seen as a sparse array composed of a nested array with sensor positions in plus the second array with sensor positions in . However, the design of EAS-NA-NA is to maximize the number of consecutive lags in , while S-FL-NA is to maximize the number of consecutive lags in . If not considering , the S-FL-NA will be reduced to EAS-NA-NA by making be adjacent to .
To take full advantage of the array proposed above, we take the virtual uniform linear array part in , and then the observation vector of the virtual ULA can be expressed as follows:We divided the virtual ULA into overlapping uniform subarrays of size , where . The output of the -th subarray is given bywhereBy averaging the covariance matrices of the smooth subarrays, we can obtainThe eigenvalue decomposition of is carried out as follows:Since noise subspace and the signal subspace are orthogonal and the space formed by the steering vector and the signal subspace belong to the same space, the noise subspace and the steering vector are also orthogonal. Hence, the spatial spectrum of DOA is given byThen the DOAs can be obtained by the peak search method which divides the angle 0– into small parts and searches the maximum points of to form the DOAs.
3.2. Optimal Sensor Allocation Problem
In this section, we address the problem of the optimal sensor distribution to maximize the degrees of freedom for a fixed number of physical sensors. According to the above proposition, the number of the consecutive virtual lags of S-FL-NA is . Hence, the degrees of freedom optimization problem can be formulated as
In other words, what we are interested in is how to configure to make virtual ULA with the largest possible number of lags under a fixed number of sensors. We define integers and to be the quotient and remainder of modulo 4, i.e., . In terms of and , the solution to (22) can then be completely characterized as given by the following proposition.
Proposition 2. One solution to the optimization problem in (42) can be expressed as
The length of the corresponding consecutive lags can be written aswhere .
Proof. See Appendix A.
Example 1. For the illustrated purpose, Figure 3 shows an example of the proposed array. We assume that the number of physical sensors is 6. By using formula (42), we can get , and . According to (15) and (16), the location of physical sensors is and the number of consecutive lags is 117. Therefore, we can obtain and . For , its second-order difference coarray is and second-order sum coarray is . For , its second-order difference coarray and sum coarray are and , respectively. Therefore, , where , , , and ; and , where , , , and . By combination of , we can obtain having 59 consecutive lags.

3.3. Comparison for Configurations Based on the Fourth-Order Difference Coarray
The sparse array based on fourth-order difference coarray proposed in this paper effectively increases the consecutive virtual aperture. The virtual aperture of the proposed array is larger than those of EAS-NA-NA and E-FL-NA, and the number of consecutive lags is more at the same time. For comparison between S-FL-NA and the E-FL-NA, we have the following corollary.
Corollary 1. Given the same number of physical sensors, the potential DOFs provided by our proposed array can be more than E-FL-NA.
Proof. Given sensors, the number of consecutive coarray lags provided by E-FL-NA is , which is proved in [28]. For S-FL-NA, this number is . In the same configuration , the difference between and is given byTo ensure the existence of each level, .
The number of consecutive lags at the fourth-order difference coarray stage for different arrays is listed in Table 1. It can be observed that S-FL-NA has the largest number of DOFs compared with FL-NA, E-CPA, EAS-NA-NA, and E-FL-NA. Note that the parameters and of E-CPA are coprime because E-CPA is an extension of coprime array.
4. Simplified FOC Matrix for Low Complexity
4.1. Simplified FOC Matrix
According to (6), we know that is an matrix, which has great computational complexity. In the section, we propose a simplified fourth-order cumulant matrix denoted by to reduce the computational complexity.
The FOC matrix after vectorization is equivalent to the observation data of a virtual ULA with the manifold which is an vector. However, in the above estimation algorithm based on S-FL-NA, we only make use of the virtual ULA part with lags, implying that the virtual steering vector and the FOC matrix include redundancy. Therefore, we think that it is only necessary to calculate the cumulant matrix containing virtual ULA parts to perform the subsequent vectorization, without calculating the entire FOC matrix. Accordingly, a simplified FOC matrix based on S-FL-NA is proposed.
First, we rewrite the FOC matrix as
Obviously, consists of blocks, each of which is an FOC submatrix with size . Specifically, the -th block is given by
By analyzing the structure of for S-FL-NA, we find that the virtual ULA part is contained in the following matrix blocks:where . Then we reconstruct a simplified FOC matrix , which is given bywherewhere , , and . Then we can replace with for the subsequent algorithm steps.
4.2. Complexity Analysis
The complexity of the previously mentioned algorithm mainly comes from four aspects: FOC matrix calculation, virtual covariance matrix reconstruction, eigenvalue decomposition, and one-dimensional spectral peak search. The complexity of calculating is , the reconstruction complexity of the virtual covariance matrix is , the complexity of eigenvalue decomposition is , and the complexity of one-dimensional peak search is . Therefore, the complexity of the unsimplified fourth-order cumulant algorithm is . The complexity of the simplified FOC matrix calculation is . The other three are the same as the unsimplified ones. So its total algorithm complexity is . Table 2 shows the comparison of the complexity of the two methods.
Figure 4(a) shows that the complexity of calculating is higher than that of calculating , and when the number of array elements is small, the difference between the two is not big. As the number of array elements increases, the advantage of calculating becomes larger.

(a)

(b)
Figure 4(b) exhibits the changing curve of the ratio of and as the number of array elements increases. It can be observed that although the complexity of has been reduced to a certain extent, it will not be lower than 75 percent of . The 75 percent ratio can be achieved when is an integer multiple of 4.
5. Simulation Results
In this section, simulation results are present to demonstrate the performance of the proposed array structure. First, the number of consecutive lags and physical apertures of five kinds of sparse array are compared, and then MUSIC spectra are simulated to show the number of distinguishable sources. Finally, the root mean square error (RMSE) results of the proposed array under different SNR and different number of snapshots are simulated, respectively, and compared with different arrays. The RMSE is defined aswhere is the number of Monte Carlo simulations, is the number of source targets, and is the -th source DOA in the -th Monte Carlo simulation. The simulation conditions are listed in Table 3.
5.1. The Number of Consecutive Lags and Physical Aperture
In the first set of simulations, comparisons about the number of consecutive lags and physical aperture are carried out to show the superiority of the proposed array. We chose FL-NA [21], E-CPA [25], EAS-NA-NA [26], and E-FL-NA [28] as the comparison arrays here and set the number of sensors varying from 8 to 20. The number of consecutive lags versus the number of sensors is illustrated in Figure 5(a). It can be observed that the number of consecutive lags of S-FL-NA is significantly higher than those of the other four sparse arrays. Figure 5(b) shows the physical aperture with respect to the number of sensors. It is evident that the S-FL-NA has a larger physical aperture compared to the other sparse arrays. The above results indicate that the proposed array holds the promising potential to achieve better DOA estimation performance compared to the other arrays.

(a)

(b)
5.2. MUSIC Spectra
For the second set of simulations, we consider 35 independent sources distributed between and , impinging on the sparse linear arrays consisting of 6 sensors, valued by . The parameters set are 10 dB SNR and 100000 snapshots to make sure it is sufficient to calculate the FOC matrix. Figure 6 shows the MUSIC spectra for four kinds of 6-element sparse arrays. We can observe that, by using FL-NA, EAS-NA-NA, and E-FL-NA, we are not able to resolve all the 35 sources. On the other hand, all the sources are successfully resolved if the proposed array is used, illustrating the improved resolution capability and a higher number of DOFs offered by the proposed array.

(a)

(b)

(c)

(d)
5.3. The RMSE of DOA Estimation
The next simulations consider the RMSE performance versus the input SNR, as well as the number of snapshots. We assume that signals from directions listed in Table 3 impinging on the sparse arrays. As such, physical apertures of all the arrays hold: . Likewise, the number of consecutive lags for all the arrays satisfies . The fixed parameter setting is 0 dB SNR and 7500 snapshots. The RMSEs of DOA estimates with respect to the SNR and the number of snapshots are shown in Figure 7. It can be observed that, along with the increase of SNR and snapshots, all the RMSEs decrease and S-FL-NA yields the lowest RMSE among all the arrays. This can be attributed to a larger physical aperture and a higher consecutive number of lags of S-FL-NA compared with FL-NA, E-CPA, EAS-NA-NA, and E-FL-NA.

(a)

(b)
The Cramer–Rao lower bound (CRLB) is the lower bound of the unbiased estimation variance, which is used to measure the deviation of the DOA estimation. Based on [29, 30], we simulate the CRLB of the five kinds of sparse array. Figure 8 shows the results of the CRLB of the DOA estimation of the different sparse arrays. We can observe that the CRLB of S-FL-NA are lower than those of the other arrays, and the accuracy of DOA estimation is improved obviously.

(a)

(b)

(a)

(b)
5.4. The RMSE Based on Two Kinds of FOC Matrices
In order to verify the effectiveness of the method based on the simplified FOC matrix proposed in Section 4, we simulate its RMSE and compare it with the method using the prototype FOC matrix. The results are shown in Figure 8. It is evident that, under different signal-to-noise ratios and snapshots, the performance of the method based on is close to that based on , showcasing that the proposed method improves the calculation speed while ensuring accuracy.
6. Conclusion
In this paper, a four-level nested array composed of four ULAs has been proposed to maximize the number of consecutive lags based on the fourth-order difference coarray. The closed-form expressions for the physical sensor location and the virtual aperture are specifically derived. Given a fixed number of physical sensors, the optimal distribution of each subarray is also provided. In the future, it will be interesting to design a sparse array based on FOC to reduce the effect of mutual coupling. In the simulation part, numerical simulation results validate that the sparse array proposed in this paper can detect more sources and improve the DOA estimation accuracy because of the higher number of consecutive lags.
Appendix
Proof of Proposition 2. Let us start with a simple optimization problem:When and are nonnegative integers, we apply the arithmetic mean-geometric mean (AM-GM) inequality to the optimization problem. The solution of (A.1) is . If we modify the above optimization problem (A.1) to bethe solution to the optimization problem in (A.2) is also . Therefore, by setting and , formula (42) can be simplified asAccording to the proof of Theorem 3 in [21], we consider any pair integers of , denoted by and . If , we can create a new pair of integers and , where and , such thatUnder the condition that , and instead of and can strictly increase the objective function, as long as . Therefore, in the optimal solution , the difference between any two integers cannot be more than one. So, the solution of (42) can be given byBy substituting (A.5) into (16), we obtain the number of the corresponding consecutive lags expressed by (44).
Data Availability
The authors claim that the data used in this article are provided by their simulations and the data used to support the findings of this study are available from the corresponding author upon request.
Conflicts of Interest
The authors confirm that the funding did not lead to any conflicts of interest regarding the publication of this manuscript.
Acknowledgments
This work was supported by the National Natural Science Foundation of China (Grant no. 61401513).