Abstract: Scaling Zero-shot Text-to-speech (TTS) to large-scale datasets has been demonstrated as an effective method for improving the diversity and naturalness of synthesized speech. At the high ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results