Research Article

A Voice Cloning Method Based on the Improved HiFi-GAN Model

Table 2

Speaker encoder model parameters based on x-vector.

Initial learning rate0.0001

Model embedding size256
Model hidden layer size256
Model layers3
Speaker batch size32
Number of utterances per speaker10