๐ฆ๐ช Emirati FastPitch v1.0
A fine-tuned FastPitch text-to-speech model for Emirati Arabic with ArabicโEnglish bilingual code-switching support.
Designed for natural prosody, dialectal Emirati pronunciation, and seamless AR/EN mixing in real-world speech.
๐ Audio Samples
๐ฆ๐ช Full Emirati Arabic
Reference text:
ุดู ุฑุฃูู ูู ูุนู
ู ูุฑุดุฉ ุตุบูุฑุฉ ููุฃุทูุงู ุจุงูู
ุฏุฑุณุฉ ุนู ุงููููู ุงููุฏููุฉุ ุชุฑู ุงูุฃุทูุงู ูุญุจูู ุงูุญุฑูุงุช ูุฐู ูุจูุดุฌุนูู
ุนูู ุงูุฅุจุฏุงุนุ ุฅูุช ู
ูุงูู ุชุณุงุนุฏูุง ุจุงูู
ูุงุฏ ูุงูุฃููุงุฑ ููุง ู
ุจ ูุงุถู
๐ Emirati + English (Code-Switching)
Reference text:
ุงูููู
ุฌูุณูุง ููุณู
ูุง ุงูู
ูุงู
ูู ุงูู Project Management ุนุดุงู ูุญุงูุธ ุนูู ุงูู
ุญู
ูุงุชุ ุงูู AI ูุณุงุนุฏ ูู ุฑุตุฏ ุงูุญููุงูุงุช ูุงูุทููุฑ ูุงูู Dashboard ูุนุทููุง ุชูุจููุงุชุ ุดู ุชุชููุนูู ุจูุตูุฑ ูู ู
ุง ุณูููุง Update ุณุฑูุนุ ุชุฑู ุงููุถุน ู
ูู
ูุงูุฏ
๐บ๐ธ English Only
Reference text:
Basically, pair programming feels awkward at first, but you know, it actually speeds up problem solving and learning
๐ก All samples were synthesized using Emirati FastPitch v1.0 + Emirati HiFi-GAN bilingual vocoder at 22,050 Hz.
โจ Features
- Emirati Dialect Focus: Trained on authentic Emirati Arabic speech
- Bilingual Phonemes: AR_ and EN_ prefixed phoneme inventory
- Code-Switching Ready: Natural transitions between Arabic and English
- Custom Tokenizer: Space-delimited, multi-character phoneme tokens
- High Prosody Control: Duration + pitch prediction (FastPitch)
๐ง Model Details
| Item | Value |
|---|---|
| Model Type | FastPitch |
| Framework | NVIDIA NeMo |
| Sample Rate | 22,050 Hz |
| Epochs | 1,059 |
| Training Data | 10+ hours (Emirati Arabic + English) |
| Phoneme Inventory | 200+ AR_/EN_ tokens |
๐ Vocoder (Required)
This model is designed to be used with:
Emirati HiFi-GAN Bilingual v1.0
https://huggingface.co/vadimbelsky/emirati-hifigan-bilingual-v1.0
The vocoder is trained on matching mel-spectrogram configuration, ensuring:
- clean consonants
- stable Arabic emphatics
- natural English sibilants
- smooth AR/EN transitions
๐ Quick Start (NeMo)
1. Load FastPitch
from nemo.collections.tts.models import FastPitchModel
fastpitch = FastPitchModel.from_pretrained(
"vadimbelsky/emirati-fastpitch-bilingual-v1.0"
)
fastpitch.eval()
- Convert Text โ Phonemes
from emirati_phoneme_converter import EmiratiPhonemeConverter
converter = EmiratiPhonemeConverter()
phonemes = converter.convert(
"ุงูููู
ุนูุฏูุง meeting ู
ูู
with the AI team"
)
- Generate Mel-Spectrogram
tokens = fastpitch.parse(phonemes)
mel = fastpitch.generate_spectrogram(tokens=tokens)
- Synthesize Audio with HiFi-GAN
from nemo.collections.tts.models import HifiGanModel
vocoder = HifiGanModel.from_pretrained(
"vadimbelsky/emirati-hifigan-bilingual-v1.0"
)
vocoder.eval()
audio = vocoder.convert_spectrogram_to_audio(mel)
๐ Supported Languages โข Emirati Arabic (dialect) โข English (US-style phonemes) โข Mixed ArabicโEnglish speech (code-switching)
๐ License
This model is released under CC-BY-4.0.
๐ Citation
Emirati FastPitch v1.0 โ Bilingual Emirati Arabic TTS Vadim Belsky, 2026
- Downloads last month
- 2
Model tree for vadimbelsky/emirati-fastpitch-bilingual-v1.0
Unable to build the model tree, the base model loops to the model itself. Learn more.
Evaluation results
- MOS on Your Eval Dataset Nameself-reportedyour-mos-score