Abstract

Acoustic array is a ubiquitous tool for locating and quantifying sound sources. However, its effectiveness depends greatly on the array configuration. This paper presents an array configuration method to enhance array performance, especially on the spatial resolution and the Doppler effect correction. The problem of array configuration is formulated into a position matrix determined by introducing partition spatial filtering. Irregular coaxial ring grid spacings and partition filtering conditions are suggested to control array spatial resolution. Geometrical parameters and performance indicators are constructed to quantify the relationships between the array configuration and performance. Based on these quantitative relations, the spatial variation of the array beam pattern and the Doppler effect has got adaptive adjustment. In particular, an adaptive partition algorithm is proposed to reduce computation time. The performance of the method is examined numerically and experimentally, which is compared with the other methods. The results provide the method to guide the design of a 64-microphone optimized array with high performance (1.8° spatial angle resolution and 40% Doppler frequency correction over the bandwidth from 800 Hz to 3000 Hz) and fast computing speed (18 s array generated time for 2000 arrays). Furthermore, an unusual feature of the method is that it can be utilized in the case when the source moves at a nonconstant velocity.

1. Introduction

Acoustic array techniques are applied for moving sources, such as cars [1], trains [2], airplanes [35], and wind turbines [68]. However, array identifying accuracy and resolution are limited by the array configuration due to complex motion sound characteristics. Moving sound sources need some special acoustic characteristics such as broadband, Doppler effect, and directivity, which undoubtedly impose high requirements on array configuration design. For instance, in this case, the array microphones must be mounted closer to avoid spatial aliasing. Also, the total array aperture must be large enough to allow moving sources to be solved accurately. Unfortunately, although the traditional regular array [9] had much more advantages on array performance, it is helpless to meet these requirements due to need for a large number of microphones. Fortunately, irregular array [10, 11] is a good method by which the larger array aperture can be obtained conveniently with few microphones. Irregular array consists of two major categories including geometrical and performance optimized arrays. Ignoring the fact that further studies are required to clarify kinds of optimized arrays, a detailed explanation of the two-dimensional geometrical array is provided in this paper. Geometrical arrays are subdivided into sparse array [12, 13], random array [14], and random sparse array [15]. Sparse arrays are generated from the regular array by extracting different spacing microphones. It provides the relatively larger apertures with the same number of microphones. However, a sparse array is also redundant arrays, which is just like a regular array. Severe aliasing problem always occurs because of the repeated sampling spacing. Although this phenomenon can be improved by limiting the minimum intrasensor spacing less than half wavelength, spatial aliasing still results in the resolution degradation and the appearance of ghost sources in acoustic map [16]. In fact, in order to significantly reduce spatial aliasing, microphone arrays must guarantee nonredundancy in spatial sampling, i.e., nonredundant with almost unique intramicrophone spacings.

Array microphones mounted randomly are one strategy to ensure nonredundant array generation. The position of microphones in a random fashion prevents the periodicity inherent in nonrandom arrangements [17]. However, this randomly layout mode easily leads to the microphones clumping together in a small region, thus yielding a smaller effective aperture size. This problem has been addressed by employing segmenting scheme [18]. Furthermore, earlier studies have shown that the geometrical rotation symmetry helps to avoid array redundancy, for example, wheel array with the odd number of spokes [19] and pizza array with odd slices [20]. Previous research has found that uniform density of the microphone distribution was a significant geometrical factor. Because different points on the source plane will have different exposure from the array, it affects array effective measurement especially irregular array. To ensure the effectiveness, Hook and coworkers [18] have handled this problem of a regular grid array by employing uniform segment scheme. The work described in the present article was performed to evaluate the uniformity for an irregular array.

Despite there is considerable research studies, a good irregular array design is still difficult to conduct due to unexplained relationship among array geometry, performance, and incomplete performance indicators. Some research studies [21] attempted to make it clear what geometric properties are crucial for the superior beamforming performance of irregular arrays. Others [22] wanted to explain that how array layout affect the performance by random, genetic, and iterative optimization. However, because optimization procedures are an exhaustive search, it often resorts to a tedious trial and error cycle to design irregular arrays for a given frequency range. Moreover, since the trade-off of array geometry and performance always heavily relies on situations of application, there are no consistent standards and principles to follow. Since sidelobe of irregular array beampattern is more complex than that of regular configurations on the aspect of amplitude and shape, performance indicators are more intricate problem. So far, a lot of research studies have been conducted to enhance array performance by employing maximum sidelobe level (MSL) [22], synthesizing mainlobe width (MW) and MSL [23], adopting product of frequency-averaged MLW and MSL [24, 25], using statistical characteristics of MW and MSL [18], and utilizing the relative variances of MW and MSL [2628]. Although those methods have the ability to cover large apertures with quite few microphones, low MSL in a wide range of frequencies, and good geometry symmetry, there are some challenges in designing the high performance irregular array for the localization of moving sources. Firstly, the effective geometrical parameters and performance indicators are needed, which can lead to a quantifying relationship between array geometry and performance. Secondly, a more effective cost function can manage the spatial variation of array beampattern. Finally, a simple and fast optimization algorithm is needed for improving computation speed.

In this paper, a partition spatial filtering method is put forward to deal with the problem of irregular array configuration. A geometrical parameter (evenness index) and performance indicator (sidelobe suppression ratio) are established by introducing irregular grids spacing and partition filtering conditions. Our intention is to describe clearly the relationship between irregular array geometry and performance. In particular, an adaptive partition algorithm is designed to enhance optimized speed. Compared with the popular irregular arrays such as logical spiral wheel array and pizza array, partition optimized array has shown superior array performance especially resolution and corrected ability of Doppler effect. The method is more suitable for designing irregular array to localize moving wideband sources.

2. Irregular Grid Array for Moving Sources

A standard moving source array measurement is described in Figure 1, in which the height from the array to the ground is h and the focused distance is Z0. The source locations are assumed on a reconstruction plane B which is attaching to and moving with the side plane of the moving sources. Since the source position can be regarded as a function of time, propagation distances from assumed source locations to each array microphone in the array plane A are calculated. Based on propagation distances, microphone signals are de-Dopplerized corrected. The processed signals are focused on a sequence of locations on the reconstruction plane B to yield the spatial distribution of source strength and then to realize source identification.

During the measurement of moving sources, an array must satisfy basic requirements that there are at least two microphone spacings within one spatial cycle of the sound field in the x direction. Therefore, to avoid spatial aliasing, the minimum sensor spacing is allowed in the x direction, which can be calculated aswhere c is the sound speed and fmax is the highest frequency that contributes to the sound field incident on the array. The present analysis is built on the existence of plane phase planes. An assumed spherical phase surfaces would result in a decrease in the allowable microphone spacing in the x direction.

2.1. Irregular Grid Spacing

In order to share the same spatial aliasing characteristics and avoid the intrinsic periodicity of regular layout, an irregular array is determined on the irregular grid. Irregular grid is built on coaxial ring array which is circumferential equal interval and radial nonuniformly spacing. The spacings of irregular grid are utilized to control the smallest actual microphone spacing for suppressing spatial aliasing. The unequal radius is derived from the performance synthesis of a uniform coaxial ring array, which is shown in Figure 2. Coaxial ring array maintains the rotational symmetry as the wheel array and pizza array. It benefits for avoiding redundant spatial two-dimensional sampling of the array. Moreover, based on the geometric symmetry, it is easily amplifying array aperture through increasing the number of rings or ring spacing, and the array aperture is increasing with twice the radius length. Therefore, an irregular coaxial ring array is generated, and its irregular grid spacings are determined.

First of all, circumferential direction grids are segmented evenly with the equal arc length interval d which should satisfy 0.25λ ≤ d ≤ 0.5 λ to ensure that the array can reduce mutual coupling effects in the upper frequency limit. The normalized d by a wavelength λ is expressed as . Furthermore, the maximum ring radius equals to the aperture, and the adjacent ring radius difference fits to 0.5λ ≤ Δρ ≤ λ. The normalized by wavelength λ is expressed as . The minimized microphone spacing is effectively guaranteed by controlling the radius of the inner ring. For moving source measure, the innermost ring radius ρ1 is determined by . Secondly, based on the uniform coaxial ring array, the unequal rings’ radius can be deduced by the sinusoidal transformation of a regular coaxial ring array’s pattern. Uniform coaxial ring array beampattern is given aswhere N is the number of coaxial rings, Mi is the number of elements in the ith ring, Pij is the sound pressure response of the element j in ring i, and ρi is the radial distance to the ith ring from the center of the array. k is the wave number. For simplification of deducing, the sound pressure response is assumed to be uniform, i.e., Pij = 1. The elements in each ring are also assumed to be uniformly distributed around the circular ring, i.e., . After considerable manipulation, we can obtainwhere u = sin(θ) and is the Bessel function of zero order.

Sinusoidal transformation of equation (3) can be expressed as

Assuming that and using the properties of J0, we can obtain

Combination with arranging equations (4) and (5), ap is determined by the auto regression.

pth ring radius is also obtained by

Finally, once the circumferential spacing and rings’ radius are defined, the underlying irregular grid points can be generated, which is shown in Figure 2. In the figure, the black dots depict grid points and each grid point is a potential position of the array microphone.

2.2. Irregular Grid Point

In this section, the number of grid point on each ring Mi (i = 1, 2, …, N), the total number of grid point Mt with given ring’s number N, and the quantitative relationship between grid point increment Δm and ring radius increment are derived.

The arc length interval di and the ring radius ρi are normalized by wavelength λ. They are defined as

The difference of the grid point’s number in adjacent rings is named grid point increment Δm which is defined by

Normalized difference of radius in adjacent rings is called radius increment which is defined as

Assuming the arc length intervals of all rings are consistent, i.e., , we can obtain the quantitative relationship between grid point increment Δm and ring radius increment , where

In the case of , grid point increment Δm can be depicted aswhere int expresses getting an integer.

The total grid point’s number Mt of a coaxial rings array with N rings can be expressed aswhere Δmi is the grid point’s increment of the ring i and M1 is the grid point number of the inner ring. The concrete relations among Δm, , and are plotted in Figure 3.

3. Adaptive Partition Filtering

The performance of an irregular array, the mainlobe width, and the peak sidelobe level depends on the aperture size and the way of the microphones disposed on the grid points. To make sure that the microphones are spatial well distributed after fixed an appropriate grid size, the entire aperture into smaller subsections was considered. Also, quantitative array spatial filter conditions and performance evaluating indicators were proposed in the paper.

In the partition procedure, microphones are first grouped on a segment-by-segment basis, and each group of microphones is positioned randomly on the grid points within each subsection. Taking into account the measurement of a wide frequency band, one microphone is always positioned at the center of the array’s aperture, named zero section, and the rest of the microphones were positioned randomly within the various subsections. There is equal number of microphone and grid point in each subsection. The microphones in each subsection is placed at least four, i.e., the microphone number in the outmost ring is the same as the subsection number C. The subsection number is strongly related to array microphones’ number MC, grid points’ number Mt, circular rings’ number N, and incident wave length. A novel adaptive algorithm is introduced to describe this relation.

In order to guarantee the uniformity of grid points’ distribution, the ratio between microphones’ number and grid points’ number in each subsection is defined as

Another constraint is imposed on the grid points in each ring to ensure the large array aperture and wide range of measurement frequency, which is defined bywhere Qi is the number of polar angle in the ith ring in each subsection.

Once given the subsections’ number C and grid points’ number J in each subsection, the mathematical model of array’s grid points is generated. Based on the geometric symmetry, the coaxial ring array can be transformed into the matrix expression of a polar diameter and the polar angle of the polar coordinate system. A grid point can be described by a pair of polar angle and polar radius. The grid points of the entire underlying coaxial ring array are depicted by the polar angles’ matrix and polar radius’s matrix. The mathematical model is described as

The columns of matrix and depict the subsections, and their rows represent the polar radius and polar angle of grid points over array area, respectively. The polar angle is named subsection angle which starts on one side of the subsection’s boundary line, not in the positive direction of the x-axis of the array. They are distributed in each subsection randomly, and all the angles are not equal.

An irregular array can be generated using the polar radius and polar angle matrixes, which are generated by positioning randomly several nonzero elements in each column of matrices and in equations (16) and (17). The polar radius and polar angle matrixes are shown in the following equations:

The above matrix operations will produce a mass of polar radius and polar angle matrices, i.e., plentiful array configurations. It resorts to a tedious trial and error cycle. In order to overcome this problem, some novel spatial filter conditions are proposed to filter out array configurations having relatively larger mainlobe width for the same sidelobe level.

3.1. Filtering Condition

There are two aspects need to be considered for imposing geometric constraints on the configuration for array preselecting. Firstly, since the regularity of microphone position leads to redundant spatial 2D sampling interval, we employed the microphone position’s general angle and circumferential general angle difference in the same ring to avoid regularity. They are given in Condition 1. Secondly, uniform density of microphones across the array area is the main factor in the variation of the measurement distances of moving sources. When the array is used at sufficiently long measurement distance, typically down to the distance equal to array diameter, the nonuniform density ensures good array performance over a wide frequency range. However, if the distance becomes much smaller than that, the nonuniform density of the microphones across the array area starts to have the effect that different points on the source plane will get very diverse exposure from the array. In this case, a more uniform density might be better, and numerical simulations have verified that hypothesis—specifically for the so-called partition array described in the present work. When the same array has to be used at very small measurement distances, a more uniform density is even more necessary. In order to quantitatively evaluate the uniformity density of microphones, a novel geometry descriptor is put forward to capture the properties of microphone distribution showing their impact on array performance. They are shown in Condition 2.

Condition 1. Constraints are imposed on the general angle and circumferential general angle difference.
General angle is an angle of the coordinate with the positive direction of the polar axis as the starting position. It can be defined from the subsection angle as follows:where q is the index of subsections number (q = 1, 2, …, C), φkq is the angle k in subsection q (k = 1, 2, …, J), and expresses the general angle k in subsection q. The array’s general angle can be expressed as the form of matrix as follows:where represents the general angle vector of the grid points in subsection q.
Recombination matrix obtains a matrix as follows:The column vector , where is the kth general angle in ring i and subsection q, which expresses the general angle with equal radius. Since the number of microphones in each ring is different, we use zeros to express subsections without microphones in ring i, and in case of Mi < Mmax, row zero should be padded in . Transform the matrix . Firstly, circumferential general angle difference in the same ring for instance, assuming to show that seven subsections, four microphones in ring i and microphones are placed in subsections 1, 2, 4, and 7, respectively. The transformation is defined aswhere express circumferential general angle differences among different subsections in the same ring, i.e., is angle difference between 1st and 2nd subsections and other analogies. The elements of angle difference matrix should satisfySecondly, when the number of microphones in the subsection is less than or equal to the rings’ number N, i.e., Ms ≤ N, there is only one microphone placed in the same ring and same subsection. For example, assume is a row vector of matrix . It shows that there are 7 rings and 5 microphones randomly placed in 1st, 2nd, 4th, 5th, and 6th rings in subsection q. The transformation is defined bywhere express angle differences among different rings in the same subsection, i.e., is angle difference between 1st and 2nd rings and other analogies. The elements of angle difference matrix should satisfyFurthermore, in the case of Ms > N, i.e., the number of microphone Ms is more than the rings’ number N in the same subsection. This means that there are at least two microphones in the same ring and same subsection. The general circumferential angle difference of adjacent microphones in the same ring should abide by the condition of equation (26) .

Condition 2. Geometric descriptor assesses uniformity of microphone’s distribution. The mass center of array is denoted aswhere express the ith microphone position. Its polar moment is defined asBased on equation (28), a definition of aperture size for irregular array can be achieved. The aperture can be regarded as approximately proportional to the dispersion of the microphones. The dispersion factor used to evaluate microphone dispersion is presented. It is defined that the average microphone polar moment is computed byFurthermore, Pielou’s evenness index EI, which is normalized Shannon entropy, is introduced to numerically assess the uniformity of microphone distribution.where s is the total number of Li, is the percentage of Li within a, H is the Shannon entropy, and Hmax is the maximum possible entropy which represents an ideal uniform distribution of Li. As the value of EI increases, microphones tend to spread evenly over the array area.

3.2. Performance Evaluating Indicator

The main parameters influencing the performance of an array are the mainlobe width (MW) and sidelobe level. Sidelobe level consists of the maximum sidelobe level (MSL) and sidelobe suppression ratio (SSR). All parameters present here are on the basis of the so-called array pattern. The mainlobe width represents the spatial resolution of the array and the ability of the array to distinguish different sound sources. The sidelobe level represents the performance of the array in suppressing nonsonic or false sources. The narrower mainlobe width and the smaller sidelobe level, the better the performance of the array.

3.2.1. Mainlobe Width

Mainlobe width represents array’s spatial resolution, that is to say, the ability of the array to distinguish different sound sources. It is a common parameter to assess the resolution of array. The narrower the mainlobe width is, the higher the resolution has. The mainlobe width, in particular, is strongly related to an array configuration. There are different definitions of mainlobe width. We adopt 3 dB down mainlobe width in this paper.

It is known that an array’s mainlobe width generally depends on the array’s aperture size. Aperture sizes of irregular array to some extent can be controlled by the size of the underlying coaxial rings. Here is a definition of “aperture size” for two-dimensional arrays. It can be expressed as the average dispersity of the polar moment of an array’s microphone positions on the array area. The diffusivity a is determined by equation (29). In the simulation section, the relationship between the mainlobe width and aperture size is investigated based on statistical analysis of arrays generated for different segmenting schemes.

3.2.2. Sidelobe Level

The sidelobe level is related to the noise rejection capability of an array. And, it is usually an important factor of the performance of noise. Maximum sidelobe level (MSL) is one of evaluative indicators for the sidelobe level. It defines the dynamic range of the ability of the array to separate sources in different directions. The MSL is a significant parameter when choosing an optimal array configuration. On the contrary, when interpreting results obtained by a given array, it is important to know about MSL. Therefore, a good array design can be characterized by having lower MSL, measured relative to the mainlobe level. Based on the profile of the array pattern shown in Figure 4, we define the maximum sidelobe level function as

It is known that the peak sidelobe levels of irregular arrays tend to be higher than those of equivalent fully populated arrays, and the shape of sidelobe profile is more complex than it. In order to comprehensively assess the performance for an array configuration, sidelobe suppression ratio (SSR) is presented. It is defined aswhere and hp are the amplitude of maximum sidelobe peak and the amplitude of mainlobe peak, respectively.

The numerical simulation results show that the irregular arrays’ sidelobe suppression ratio is strongly related to the number of array microphones. The quantitative relationship can be described asand this value increases as the number of microphones increases.

4. Simulations

The adaptive partition algorithm convergence depends on the irregular grid size, the geometric conditions, and the performance cost functions. The way to verify the ability of the algorithm to provide optimized and repeatable solutions to the selected parameters is to investigate the statistics of various optimized solutions. The performance repeatability is investigated for 2000 arrays optimized in MW, MSL, and SSR, with two types of partition schemes. The objective is to validate the optimization process and also to identify trends. The parameters used for this simulation are shown in Table 1.

First of all, the focus is done on the performance of the partition arrays in terms of EI, MW, MSL, and SSR with different partition schemes. And then, the optimal partition array configuration is investigated and determined. Finally, frequency sensitivity, the resolution, and Doppler effect correction capability of optimal partition array are studied.

4.1. Partition Array Generation

As optimized arrays produce some repeatability in their performance, the purpose is to discern preferential positioning for microphones, which is associated with the given performance indicators that need to be optimized by the adaptive partition optimization algorithm. In the case of the number of microphones in each subsection is more than four, i.e., Ms ≥ 4, 64-microphone array configurations with different partition scheme are plotted in Figure 5. In Figure 5, all outmost radiuses of configurations are the same and the “o” highlights the position of microphones. The values of EI, MW, MSL, and SSR that are taken from 2000 optimized arrays in each partition scheme are plotted in Figure 6.

The best choice of an array configuration from amongst all the partition arrays is reproduced below. Firstly, the array partition is conducted by adaptive partition. Odd number segments include subsections 7 and 9 with a center microphone, which are shown in Figures 5(a) and 5(b). Even number segments consisting of subsections 8 and 16 have no center microphone, which are plotted in Figures 5(c) and 5(d), respectively. And then, candidate arrays are produced by preselecting from different partition arrays using geometrical filter conditions. If an array configuration of (17) does not meet Conditions 1 and 2, the polar radiuses’ and polar angles’ matrices must be randomly generated again until figure out those matrices of arrays satisfying conditions. Those arrays are called candidate arrays. This procedure is conducted by Matlab random generation. Evenness index EI simulation results are shown Figure 6. Although the introduction of geometrical filter conditions in the generation of partition arrays helps to filter out configurations without fine structural characteristics, the number of candidate arrays is large. And, it is difficult to decide which array is the best. Therefore, to find out the optimal partition array configuration, we need to conduct further assessment by using the performance of the candidate arrays.

Finally, performance indicators have been predefined in Section 3, such as MW, MSL, and SSR. Here, a different compromise principle is set up. It sounds like an appropriate choice of an array configuration is from those arrays having the smallest possible value of MW with the same SSR and the largest EI. Distribution of MW and SSR values of four different partition schemes is plotted in Figures 6(a)6(d). First of all, the range of MW values becomes narrow as the partition number increases from 7.2, 5.4° (see Figures 6(a)6(c)) to 5.4° only (see Figure 6(d)). Moreover, in the same values range, the percentage of the smaller value of MW increases with the increase in number of partitions; the percentage of 5.4 degree is from 60.0% (Figure 6(a)) up to 90% (Figure 6(c)). This is explained by the fact that the partition schemes can enlarge the array’s effective aperture. That means as the number of uniform subsections increases, microphones tend to spread evenly over the grid. This is consistent with our original assumption.

On the other hand, range of SSR values becomes narrower and the maximum value is less. The percentage of the value of overtaking 20 drops from 48.5% (Figure 6(a)) to12.5% (Figure 6(d)). This indicates that the increase in the sidelobe level is at the expense of widening MW. This tendency is more obvious in odd number segment schemes because the center microphone is used (Figures 66(a) and 6(b)) to enhance the interference suppression. Furthermore, as can be seen from Figures 6(a)6(d), Pielou’s evenness index EI is relative stabilization and the variety is smaller. The mean value is from 3.2 to 2.4, and the variance is only 0.5 (Figures 6(a) and 6(d)). The value of EI decreases with the increase in partition number approximately. The values of EI of odd number partitions are bigger than those of even number segments. The maximum value of EI is obtained when the partition number is seven. This shows that the seven partition scheme overtakes others schemes on the aspect of the uniformity of microphones distribution. Therefore, the seven partition scheme is our focus. An optimal partition array configuration is generated from above analysis results. It is plotted in Figure 7(c), and its characteristic parameters are given in Table 2.

4.2. Frequency Sensitivity

As arrays are designed for a frequency bandwidth, the evolution of performance indicators with frequency is of great interest to the geometry. Maximum sidelobe level (MSL) and sidelobe suppression ratio (SSR) are two predefined performance indicators which are functions of frequency and directly related to the frequency characteristics of the moving sound sources. A partition array is optimized over the frequency range of 500 Hz to 3000 Hz, which is shown in Figure 7(c). The same tests are performed using the wheel array and the pizza array to compare the performance, which are shown in Figures 7(a) and 7(b), respectively. The MSL and SSR curve of three different optimized arrays is calculated, which is shown in Figure 8. These arrays are composed of 64 microphones with 2 m apertures. Simulation geometrical parameters in our experiment are listed in Tables 1 and 2.

Distributions of SSR and MSL of three different arrays are plotted in Figures 8(a) and 8(b), respectively. As can be seen from Figure 8(a), three arrays have SSR value stability in the frequency range of 800 Hz to 3000 Hz. Partition array has the maximum SSR value among three arrays. As shown in Figure 8(b), MSL values of partition array are greater than −10 dB on the bandwidth of 800 Hz to 3000 Hz, especially above 1000 Hz reaching −8 dB. According to Christensen and Hald [19], a MSL above 10 dB is a good performance. Therefore, we can obtain that partition array has good false sound source rejection and robustness on the overall radiation bandwidth of moving sound sources, and its frequency performance is superior to log-spiral wheel array and pizza array.

4.3. Source Resolution

The partition array is carried out to assess its usefulness in terms of the resolution. The power of resolution is investigated on two aspects by Matlab random simulation. One is the partition array sources localization, and the other is angle resolution testing. They are checked at four frequencies (800 Hz, 1200 Hz, 2000 Hz, and 3000 Hz) for two static harmonic sources with the same level at different positions. The positions of two sources are characterized by the azimuth angle and elevation angles: one position is (45°, 45°) and the other one is (−45°, −45°).

In this study, the partition array is optimized, which with the geometry is shown in Figure 7(c). The same tests are performed using wheel array and pizza array, which are shown in Figures 7(a) and 7(b), respectively, to compare a source resolution. In each case, white noise is added in the simulated acoustic signals with a signal-to-noise ratio (SNR) of 10 dB and performs 2000 simulations.

Localized acoustic source maps of partition array are plotted in Figure 9. In the figure, two source positions are depicted using red points on four source maps. Blue portions under those red points are the calculated source position based on the measured data of the partition array. The area of the blue portion is approximately proportional to the magnitude of the source. As shown in Figure 9(a), the area of the blue portion is bigger than others in Figures 9(b)9(d). This is because the measurement at this time is at low frequency (800 Hz). The main lobe width is wide, and the sidelobe rejection ratio is small. However, as the frequency increases, the array performance keeps getting better and better. That is to say, areas of blue portions get smaller. This phenomenon has a good agreement with the simulation results shown in Figures 6 and 8. From Figures 9(a)9(d), we can see that partition array is clearly good in terms of source location accuracy and suppresses the “ghost” of universal source locations. Therefore, it can be concluded that the partition array exhibits a superior ability to separate sources especially for broadband sources.

On the other hand, angle resolution is proposed to further assess the source resolution. The angle resolution is defined by the minimum angle to distinguish the two sources. This procedure is conducted by Matlab simulation. A source is randomly fixed, and the additional source approaches the first source with an angular step size. When the angular step size is less than determined angle, the two sources roll up into one. In this case, the determined angle is the smallest angle. To simplify, we introduce the separation rate to estimate the resolution of three optimized arrays at frequencies of 800 Hz, 1200 Hz, 2000 Hz, and 3000 Hz, respectively. The simulation results are plotted in Figures 10(a)10(d). The first source is placed on (45°, 45°) of the azimuth and elevation plane. The second source is approaching to the first source with the angular step size of 1.8°.

In Figure 10, the values of y-coordinate indicate the percentage of effectual separation of two sources. For example, 1 means 100% separation of two sources. As we can see from Table 3, it is the smallest angle to separate two sources of three arrays at different frequencies, and the partition array has the smallest separated angle at all frequencies. Compared with wheel array and pizza array, the partition array has a significant advantage at frequency of 800–2000 Hz to separate two sources that are close to each other. Furthermore, from Figures 10(a)10(d), the separation rates of all arrays increase as frequencies increase, especially at 3000 Hz. While all arrays separation rates are higher than ever before, partition array and pizza array have the same smallest separated angle (3.6°). However, the angle separation rate of partition array is higher than that of pizza array. Although the incident wavelength becomes short as the frequency increases, the array aperture becomes larger and the WM becomes narrower. Then, its resolution is improved. The results show that the partition array provides the best source resolution overall the whole bandwidth than the others.

4.4. Doppler Effect Correction

In this part, the partition array is examined in terms of the ability to correct the Doppler effect. Since the Doppler frequency in the de-Dopplerised spectrum is related to the separation ability and variation of the arraypattern, a Matlab simulation is proposed for estimating the emitting frequency of a moving source at the different frequency. We use the bandwidth of the frequency shift around the center frequencies of 800 Hz, 1200 Hz, and 2000 Hz for wheel array, pizza array, and partition array to compare the performance. The source trajectory is at the 7.5 m distance from the array plane. The position of the source is randomly placed. The source moves with the speed of 120 km h−1 along a straight line which is parallel to the source plane. The center of array is placed at point (16.5, 7.5, 0). Starting point of source movement is the coordinate zero point (0, 0, 0). The signal sample length of microphones is 1 s, and the sampling frequency is 8192 Hz. During signal processing, white noise is added to the simulated acoustic signals with a signal-to-noise ratio (SNR) of 10 dB. In this paper, we use the median spectrum to estimate the frequency shift for wheel array, pizza array, and partition array at frequencies of 800 Hz, 1200 Hz, and 2000 Hz, respectively. The results are plotted in Figure 11.

Doppler effect in motion source recognition is reflected in two aspects, which are amplitude change and frequency shift. In order to further evaluate the advantages of the partition array in frequency shift, amplitude correction was made to the collected data in the process of de-Dopplerised, and frequency shift information was retained. The processed results are presented in Figure 11. As can be seen from Figures 11(a)11(c), the processing results of the partition array have the best consistency with that of the sound source at a different frequency. Although the measurements from other arrays were also processed similarly, their measured amplitude differed greatly from the actual amplitude of the sound source. In addition, to further illustrate this point, the partition array was used to localize one moving source at 800 Hz with the speed of 120 km·h−1. The acoustic maps at different seconds are shown in Figure 12. In Figures 12(a)12(d), the red points depict the source positions at every second, and blue portions under each red point are the calculated source position based on the processing data of the partition array. The area of the blue portion is approximately proportional to the magnitude of the source. At every second, the partition array exhibits a good source position fit. This reflects the invariance of the array beampattern. From this point of view, the partition array has the merit of measuring the moving sound source.

On the other hand, the median spectrum is in place to estimate array performance on the aspect of the frequency shift. Here, the median spectrum is characterized as the average value of the processing spectrums. As shown in Figures 11(a)11(c), the partition array has the narrowest frequency shift bandwidth at every frequency. When compared with the wheel array and pizza array, the frequency shift width of the partition array decreases by about 40% of their shift width. The wheel array and pizza array have almost the same characteristics in the frequency shift. The results show that the partition array configuration optimization may enhance the invariance of array beampattern and a better correction of the Doppler effect in the measure of motion source.

5. Experiments

In this section, we have conducted experiments to validate the proposed array configuration method. The experimental setup photo is shown in Figure 13(a). A partition array was set up 7.5 m from the car surface. The array center is at point (16.5, 0, 1.2). The x-axis is horizontal, parallel to the line along with the car moved, the y-axis is vertical, pointing downward to the ground, and the z-axis is horizontal, pointing toward the car.

Figure 13(b) shows the microphone array in detail. Microphone array configuration was built based on the simulation results in Section 4. There are 64 microphones and 10 rings, and each ring radius was the same as the optimal partition array shown in Figure 7(c). The array aperture is 2 m. A microphone was placed at the center of the array.

During the measurement, the wind speed was less than 3 m s−1, and there was no rain or snow. The sample frequency was 8192 Hz, and the sampling length was 1 s. The data were processed with a moving focus beamforming algorithm. This algorithm was based on the assumption that a source was moving in front of the array of microphones on an arbitrary trajectory with variable speed. The basic idea was to focus on an assumed source position on reconstruction plane.

There are two important experiments. Firstly, the examination of the localization of one moving source at 800 Hz with the speed of 120 km·h−1 using partition array. A loudspeaker was placed in a moving car near the left rear side window, which is generating 800 Hz tone and the amplitude is 74 dB. Figure 14 shows localization results for loudspeaker at different moment.

In Figure 14, the color maps are utilized to show the amplitude variation from 72 dB to 74 dB, which corresponds to color scale from blue to red. As can be observed in the parts (a) to (d) of Figure 14, the areas of acoustic maps are small. It covers the loudspeaker almost completely at every second. Moreover, the maps are quite clear and not “ghost” sources appeared. This means that acoustic maps can focus on the source position accurately. Compared with the numerical simulating results in Figure 12, the results of moving source localization have a good agreement at the 800 Hz.

Secondly, to assess the resolution of the partition array, two loudspeakers were placed in a moving car near the left front and rear side window, respectively. The distance between them is the smallest separated angle. It is 3.6° at high frequencies of 3000 Hz, which is shown in Table 3. Within a meter scale, 3.6° equals to 0.0628 m. Loudspeakers generate two sound waves with the same frequency of 3000 Hz and sound pressure level of 67 dB. The localization results are shown in Figure 15. As showed in the figure, the acoustic map areas of two sources are separated and not “ghost” sources appeared around them. The maps cover the loudspeakers almost completely. It is demonstrated that the partition array can locate and separate two closer sources clearly.

These experiments have highlighted that the localization of moving sources can be improved using partition array. Interference has been eliminated, and the localization seems more accurate. The experiment results verified the performance found on the simulation.

6. Conclusions

A novel irregular array configuration method by utilizing partition filtering was put forward in this paper. Partition filtering conditions, which takes into account the sound field characteristics and array properties, were determined to quantify the relationship between the array configuration and performance. Those relations are significant to resolve the inevitable variance of array beampattern and the Doppler effect, especially for making moving wideband sources measure possible. The problem of array configuration was formulated into position matrix determinants and solved fast by employing the adaptive partition algorithm.

This method involves many practical advantages. Adaptive partition procedure and quantitative relations help to preselect array configurations. This can significantly reduce the computation speed, and it is more suitable for array build especially large number microphones. Array irregular layout can effectively extend array aperture and reduce redundant spatial samplings to achieve superior array performance, such as high resolution (narrowest MW) and interference suppression (the larger MSL and SSR) by adjusting grid spacings and partition number. Meanwhile, the array configuration can be easily generated with optimal MW, MSL, and SSR characteristics for a specific acoustic test. It is fitted to achieve high quality beamformer outputs for a small-scale array over a wideband. In addition, since partition filtering can sufficiently capture the spatial variation of beampattern by an adaptive procedure, the partition array has stronger capacity of Doppler effect correction. It is more suitable for the localization of moving sources.

Data Availability

The data used to support the findings of this study are available from the corresponding author upon request.

Conflicts of Interest

The authors declare that they have no conflicts of interest regarding the publication of this paper.

Acknowledgments

This research was funded by the National Natural Science Foundation of China, grant nos. 61871447, 61671262, and 51806114.