speech-synthesis

Vietnamese Voice Conversion

Overview This thesis develops a voice conversion model for Vietnamese based on the Phoneme Hallucinator model with 2 adoptions: (1) Add a Text2SSL module to get more context information before performing the KNN algorithm, (2) To create a more diverse dataset we apply spectrogram-resize (SR) based data augmentation idea from Free-VC model which distorts speaker information without changing content information to generate more ”speakers”.

Minh Nguyen Le

Mar 9, 2024 1 min read Speech, speech-synthesis, voice-conversion

Vietnamese Voice Conversion

KNN-VC vs Phoneme Hallucinator [09/03/2024] ?

Overview Comparing different methods This section compares Phoneme Hallucinator kNN-VC and Phoneme Hallucinator. Source Target kNN-VC Phoneme Hallucinator

Minh Nguyen Le

Last updated on Aug 25, 2022 1 min read speech-synthesis, voice-conversion

KNN-VC vs Phoneme Hallucinator [09/03/2024] ?

KNN-VC vs Phoneme Hallucinator [23/03/2024] ?

Overview Comparing different methods This section compares Phoneme Hallucinator kNN-VC and Phoneme Hallucinator. Source Target Phoneme Hallucinator Phoneme Hallucinator + Text2SSL

Minh Nguyen Le

Mar 9, 2022 1 min read speech-synthesis, voice-conversion

KNN-VC vs Phoneme Hallucinator [23/03/2024] ?