Research Article

Speech Attribute Detection to Recognize Arabic Broadcast Speech in Industrial Networks

Table 5

Best models of all Arabic attributes for both manner and place of articulation, with DBN-DNNs or CNNs with hidden layers from 1 up to 10 and 512 or 1024 units in each layer.

AttributeCNNCNNDBN-DNNDBN-DNN

Classes

Manner of articulation
 Stop12.229.1413.1112.34
 Affricates11.1010.2112.1711.44
 Fricatives13.109.3312.8113.14
 Nasals13.149.8012.6213.11
 Laterals12.1210.7113.5111.80
 Trills11.0010.5210.1210.55
 Approximants11.319.1112.3311.61
Place of articulation

 Labial13.2013.7014.1114.31
 Labiodental11.3512.9813.1714.44
 Interdental11.4212.1114.8114.86
 Alveolar12.1213.9714.3614.50
 Palatal13.1113.9813.8113.95
 Velar13.1313.2213.6214.11
 Uvular12.6112.6614.6214.22
 Pharyngeal11.5112.7113.6213.55
 Glottal13.1013.8814.2214.55