Research Article

An Adaptive Method Based on Multiscale Dilated Convolutional Network for Binaural Speech Source Localization

Table 3

Localization accuracy (%) of different approaches in the noisy and reverberant scenes.

RT60/DRR0.1 s/−1.44 dB0.3 s/−2.02 dB0.5 s/−2.58 dB
Noise/SNRAvg.-/-White/15 dB-/-White/15 dB-/-White/15 dB

MLP [8]28.8743.2424.4633.4224.1923.8424.05
DNN [19]67.6992.1478.1174.9453.5163.8143.65
Regular CNN61.4085.2679.7358.2352.1649.4043.65
Dilation-2 CNN57.6977.1575.4156.0250.1443.7443.65
Dilation-5 CNN84.0394.5989.4692.1475.9586.6265.41
Cascaded DCNN73.1691.1577.8484.5256.6279.2549.59
Ours 78.8693.1287.9783.7871.0876.5060.68
Ours 83.4894.5989.0590.6677.7085.0863.81