Indonesian
Acoustic Models
| Model | SR (kHz) | Tokenizer | Dataset | 
|---|---|---|---|
| FastSpeech2 ID | 22.05 | Character-based | Azure + WaveNet + Weildan | 
| FastSpeech2 MFA ID v4 | 44.10 | g2p_id (IPA) | Weildan | 
| FastSpeech2 MFA ID v5 | 44.10 | g2p_id (IPA) | Weildan (Mastered) | 
| FastSpeech2 MFA ID v7 | 44.10 | g2p_id (IPA) | Azure | 
| LightSpeech MFA ID | 44.10 | g2p_id (IPA) | Azure | 
| LightSpeech MFA ID v2 | 22.05 | g2p_id (IPA) | Azure | 
| LightSpeech MFA ID v3 | 32.00 | g2p_id (IPA) | Azure | 
| LightSpeech MFA ID v5 | 44.10 | g2p_id (IPA) | Althaf | 
| Tacotron2 ID | 22.05 | Character-based | Azure | 
| Tacotron2 ID v2 | 44.10 | Character-based | Azure | 
Vocoder Models
| Model | SR (kHz) | Dataset | 
|---|---|---|
| MB-MelGAN HiFi PostNets ID | 22.05 | Azure + WaveNet + Weildan | 
| MB-MelGAN HiFi PostNets ID v4 | 44.10 | Weildan | 
| MB-MelGAN HiFi PostNets ID v5 | 44.10 | Weildan (Mastered) | 
| MB-MelGAN HiFi PostNets ID v7 | 44.10 | Azure | 
| MB-MelGAN HiFi PostNets ID v8 | 44.10 | Azure | 
| MB-MelGAN HiFi PostNets ID v9 | 22.05 | Azure | 
| MB-MelGAN HiFi PostNets ID v10 | 32.00 | Azure | 
| MB-MelGAN HiFi PostNets ID v12 | 44.10 | Althaf |