HiFiGAN Arabic Vocoder

A standalone implementation of HiFiGAN vocoder for Arabic text-to-speech, based on the paper "HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis" (https://arxiv.org/pdf/2010.05646.pdf).

Usage

from hifigan_ar_v2 import HiFiGANArabicGenerator
import torch

# Load the model
model = HiFiGANArabicGenerator.from_pretrained("generator.ckpt", "config.json")

# Generate audio from mel spectrogram
mel = torch.rand(1, 80, 122)  # Example mel spectrogram
audio = model(mel)  # Shape: [1, 1, 8448]

Model Details

Sample Rate: 22050 Hz
Input: Mel spectrogram (80 channels)
Output: Audio waveform (1 channel)

Downloads last month: 11

Inference Providers NEW

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Paper for muhtasham/hifigan-ar-v2

HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis

Paper • 2010.05646 • Published Oct 12, 2020