Research Article

An Adaptive Method Based on Multiscale Dilated Convolutional Network for Binaural Speech Source Localization

Table 2

Localization accuracy (%) of different approaches in additively noisy environments.

SNR−10 dB0 dB10 dB20 dB30 dB
NoiseAvg.WhiteF16M109WhiteF16M109WhiteF16M109WhiteF16M109WhiteF16M109

MLP [8]83.7762.9353.2167.4172.6571.3782.3781.6286.7595.5189.4296.2699.1598.1899.8999.89
DNN [19]82.5643.1638.2553.4270.3057.1686.3297.6592.31100.099.8999.89100.0100.0100.0100.0
Regular CNN84.6554.3841.9965.6073.6169.7686.4389.9690.8199.0498.7299.7999.7999.89100.0100.0
Dilation-2 CNN87.4645.3054.1775.7570.9477.6797.7697.3393.5999.6899.8999.89100.0100.0100.0100.0
Dilation-5 CNN90.1462.6154.1780.3483.5575.8599.1597.8698.61100.0100.0100.0100.0100.0100.0100.0
Cascaded DCNN89.6257.0554.3887.6176.9276.5099.2595.5197.22100.099.79100.0100.0100.0100.0100.0
Ours 89.3459.8347.5478.6384.1974.4798.6198.4098.50100.0100.0100.0100.0100.0100.0100.0
Ours 91.8568.1656.6290.0686.0080.2499.3698.6198.72100.0100.0100.0100.0100.0100.0100.0