๐Ÿ‡ฆ๐Ÿ‡ช Emirati FastPitch v1.0

A fine-tuned FastPitch text-to-speech model for Emirati Arabic with Arabicโ€“English bilingual code-switching support.
Designed for natural prosody, dialectal Emirati pronunciation, and seamless AR/EN mixing in real-world speech.

๐Ÿ”Š Audio Samples

๐Ÿ‡ฆ๐Ÿ‡ช Full Emirati Arabic

Reference text:
ุดูˆ ุฑุฃูŠูƒ ู„ูˆ ู†ุนู…ู„ ูˆุฑุดุฉ ุตุบูŠุฑุฉ ู„ู„ุฃุทูุงู„ ุจุงู„ู…ุฏุฑุณุฉ ุนู† ุงู„ูู†ูˆู† ุงู„ูŠุฏูˆูŠุฉุŒ ุชุฑู‰ ุงู„ุฃุทูุงู„ ูŠุญุจูˆู† ุงู„ุญุฑูƒุงุช ู‡ุฐูŠ ูˆุจู†ุดุฌุนู‡ู… ุนู„ู‰ ุงู„ุฅุจุฏุงุนุŒ ุฅู†ุช ู…ูˆุงูู‚ ุชุณุงุนุฏู†ุง ุจุงู„ู…ูˆุงุฏ ูˆุงู„ุฃููƒุงุฑ ูˆู„ุง ู…ุจ ูุงุถูŠ


๐ŸŒ Emirati + English (Code-Switching)

Reference text:
ุงู„ูŠูˆู… ุฌู„ุณู†ุง ูˆู‚ุณู…ู†ุง ุงู„ู…ู‡ุงู… ููŠ ุงู„ู€ Project Management ุนุดุงู† ู†ุญุงูุธ ุนู„ู‰ ุงู„ู…ุญู…ูŠุงุชุŒ ุงู„ู€ AI ูŠุณุงุนุฏ ููŠ ุฑุตุฏ ุงู„ุญูŠูˆุงู†ุงุช ูˆุงู„ุทูŠูˆุฑ ูˆุงู„ู€ Dashboard ูŠุนุทูŠู†ุง ุชู†ุจูŠู‡ุงุชุŒ ุดูˆ ุชุชูˆู‚ุนูˆู† ุจูŠุตูŠุฑ ู„ูˆ ู…ุง ุณูˆูŠู†ุง Update ุณุฑูŠุนุŒ ุชุฑู‰ ุงู„ูˆุถุน ู…ู‡ู… ูˆุงูŠุฏ


๐Ÿ‡บ๐Ÿ‡ธ English Only

Reference text:
Basically, pair programming feels awkward at first, but you know, it actually speeds up problem solving and learning

๐Ÿ’ก All samples were synthesized using Emirati FastPitch v1.0 + Emirati HiFi-GAN bilingual vocoder at 22,050 Hz.

โœจ Features

  • Emirati Dialect Focus: Trained on authentic Emirati Arabic speech
  • Bilingual Phonemes: AR_ and EN_ prefixed phoneme inventory
  • Code-Switching Ready: Natural transitions between Arabic and English
  • Custom Tokenizer: Space-delimited, multi-character phoneme tokens
  • High Prosody Control: Duration + pitch prediction (FastPitch)

๐Ÿง  Model Details

Item Value
Model Type FastPitch
Framework NVIDIA NeMo
Sample Rate 22,050 Hz
Epochs 1,059
Training Data 10+ hours (Emirati Arabic + English)
Phoneme Inventory 200+ AR_/EN_ tokens

๐Ÿ” Vocoder (Required)

This model is designed to be used with:

Emirati HiFi-GAN Bilingual v1.0
https://huggingface.co/vadimbelsky/emirati-hifigan-bilingual-v1.0

The vocoder is trained on matching mel-spectrogram configuration, ensuring:

  • clean consonants
  • stable Arabic emphatics
  • natural English sibilants
  • smooth AR/EN transitions

๐Ÿš€ Quick Start (NeMo)

1. Load FastPitch

from nemo.collections.tts.models import FastPitchModel

fastpitch = FastPitchModel.from_pretrained(
    "vadimbelsky/emirati-fastpitch-bilingual-v1.0"
)
fastpitch.eval()
  1. Convert Text โ†’ Phonemes
from emirati_phoneme_converter import EmiratiPhonemeConverter

converter = EmiratiPhonemeConverter()
phonemes = converter.convert(
    "ุงู„ูŠูˆู… ุนู†ุฏู†ุง meeting ู…ู‡ู… with the AI team"
)
  1. Generate Mel-Spectrogram
tokens = fastpitch.parse(phonemes)
mel = fastpitch.generate_spectrogram(tokens=tokens)
  1. Synthesize Audio with HiFi-GAN
from nemo.collections.tts.models import HifiGanModel

vocoder = HifiGanModel.from_pretrained(
    "vadimbelsky/emirati-hifigan-bilingual-v1.0"
)
vocoder.eval()

audio = vocoder.convert_spectrogram_to_audio(mel)

๐ŸŒ Supported Languages โ€ข Emirati Arabic (dialect) โ€ข English (US-style phonemes) โ€ข Mixed Arabicโ€“English speech (code-switching)

๐Ÿ“œ License

This model is released under CC-BY-4.0.

๐Ÿ“Œ Citation

Emirati FastPitch v1.0 โ€“ Bilingual Emirati Arabic TTS Vadim Belsky, 2026

Downloads last month
2
Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support

Model tree for vadimbelsky/emirati-fastpitch-bilingual-v1.0

Unable to build the model tree, the base model loops to the model itself. Learn more.

Evaluation results