Created 02/03/2020 at 11:50PM
- SqueezeWave: Extremely Lightweight Vocoders for On-device Speech Synthesis (github)
- Mellotron: Multispeaker expressive voice synthesis by conditioning on rhythm, pitch and global style tokens (github)
- Parallel WaveGAN: A fast waveform generation model based on generative adversarial networks with multi-resolution spectrogram (github)
- High Fidelity Speech Synthesis with Adversarial Networks (arxiv)