Low Resource Text-to-Speech Using Specific Data and Noise Augmentation
Multi-Speaker Text-to-Speech Using ForwardTacotron with Improved Duration Prediction
Low-Resource Text-to-Speech Synthesis Using Noise-Augmented Training of ForwardTacotron
Benchmarking Neural Speech Codec Intelligibility with SITool