Research Article

Multitask Learning with Local Attention for Tibetan Speech Recognition

Table 3

Dialect ID recognition accuracy (%) of two-task models.

ArchitectureModelLhasa-Ü-TsangChangdu-KhamAmdo Pastoral

DialectID model97.8892.2497.9
WaveNet-CTC with dialect IDD-S98.5795.2399.6
S-D99.0197.6199.41

Attention (5)-WaveNet-CTCD-S10089.2894.52
S-D000

WaveNet-Attention (5)-CTCD-S10098.899.41
S-D10094.0498.06