Research Article

Identification of Weakly Pitch-Shifted Voice Based on Convolutional Neural Network

Table 2

Detection performance of strongly pitch-shifted voice in binary classification.

Pitch shifting softwareTraining datasetTesting datasetDetecting method
[6] LFCC + GMM[8] MFCC + GMMProposed
RateFARRateFARRateFAR

AuditionTIMITTIMIT99.860.0299.880.0299.540.10
TIMITUME97.601.1098.061.1995.891.52
UMETIMIT99.520.3698.580.0297.511.45
UMEUME99.790.1599.790.1299.490.12

GoldWaveTIMITTIMIT99.970.0099.940.0199.580.05
TIMITUME97.930.7596.822.0496.291.53
UMETIMIT99.720.0598.450.0198.441.17
UMEUME99.870.0299.700.0799.120.36

AudacityTIMITTIMIT99.980.0099.970.0099.970.00
TIMITUME99.130.4497.572.1099.780.07
UMETIMIT99.970.0198.720.0099.960.01
UMEUME99.970.0099.950.0099.840.11

Bold values represent the best performance under same circumstances (in same row) of the three methods. For criteria detection rate (Rate), higher is better. For criteria false alarm rate (FAR), lower is better.