Research Article
Utterance Clustering Using Stereo Audio Channels
Figure 2
t-SNE visualization for seven speakers’ feature vectors in the condition in which audio contains overlapping. Different colors represent different speakers. (a) t-SNE visualization of d-vectors’ clusters for speakers’ mono signals, (b) t-SNE visualization of d-vectors’ clusters for speakers’ mstack processed signals, (c) t-SNE visualization of d-vectors’ clusters for speakers’ hstack processed signals, and (d) t-SNE visualization of d-vectors’ clusters for speakers’ sumdif processed signals.
| (a) |
| (b) |
| (c) |
| (d) |