English
Acoustic Models
| Model | SR (kHz) | Tokenizer | Dataset | 
|---|---|---|---|
| FastSpeech2 EN v2 | 22.05 | Character-based | Azure | 
| FastSpeech2 EN v3 | 44.10 | g2p_en (ARPA) | Azure | 
| FastSpeech2 MFA EN v2 | 22.05 | g2p_en (ARPA) | Azure | 
| FastSpeech2 MFA EN v3 | 44.10 | gruut (IPA) | Azure | 
| FastSpeech2 MFA EN v4 | 44.10 | gruut (IPA) | Azure (Mastered) | 
| FastSpeech2 MFA EN ESD Angry | 44.10 | gruut (IPA) | Emotional Speech Dataset - Angry | 
| LightSpeech MFA EN | 44.10 | gruut (IPA) | Azure (Mastered) | 
| LightSpeech MFA EN v2 | 44.10 | gruut (IPA) | Azure (Mastered) | 
| LightSpeech MFA EN v3 | 44.10 | gruut (IPA) | Azure (Mastered) | 
| LightSpeech MFA EN ESD | 44.10 | gruut (IPA) | Emotional Speech Dataset - 0013 | 
Vocoder Models
| Model | SR (kHz) | Dataset | 
|---|---|---|
| MB-MelGAN EN | 22.05 | Azure | 
| MB-MelGAN HiFi EN | 22.05 | Azure | 
| MB-MelGAN HiFi PostNets EN | 22.05 | Azure | 
| MB-MelGAN HiFi PostNets EN v2 | 22.05 | Azure | 
| MB-MelGAN HiFi PostNets EN v3 | 44.10 | Azure | 
| MB-MelGAN HiFi PostNets EN v5 | 44.10 | Azure | 
| MB-MelGAN HiFi PostNets EN v6 | 44.10 | Azure (Mastered) | 
| MB-MelGAN HiFi PostNets EN v7 | 44.10 | Azure (Mastered) | 
| MB-MelGAN HiFi PostNets EN v8 | 44.10 | Azure (Mastered) | 
| MB-MelGAN HiFi PostNets EN ESD Angry | 44.10 | Emotional Speech Dataset - Angry | 
| MB-MelGAN HiFi PostNets EN ESD | 44.10 | Emotional Speech Dataset - 0013 |