Research Article
A Voice Cloning Method Based on the Improved HiFi-GAN Model
Table 2
Speaker encoder model parameters based on x-vector.
| Initial learning rate | 0.0001 |
| Model embedding size | 256 | Model hidden layer size | 256 | Model layers | 3 | Speaker batch size | 32 | Number of utterances per speaker | 10 |
|
|