Research Article
ASLNet: An Encoder-Decoder Architecture for Audio Splicing Detection and Localization
Table 1
Illustration of audio clips in each dataset.
| Dataset | Language | Duration (seconds) | Num. of audio clips | Original | Spliced | Total |
| ENSet2s | English | 2 | 9,898 | 15,173 | 25,071 | ENSet3s | English | 3 | 4,089 | 19,783 | 23,872 | CNSet2s | Chinese | 2 | 44,727 | 86,073 | 130,800 | CNSet3s | Chinese | 3 | 44,669 | 85,865 | 130,534 |
|
|