HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis
Paper
•
2010.05646
•
Published
A standalone implementation of HiFiGAN vocoder for Arabic text-to-speech, based on the paper "HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis" (https://arxiv.org/pdf/2010.05646.pdf).
from hifigan_ar_v2 import HiFiGANArabicGenerator
import torch
# Load the model
model = HiFiGANArabicGenerator.from_pretrained("generator.ckpt", "config.json")
# Generate audio from mel spectrogram
mel = torch.rand(1, 80, 122) # Example mel spectrogram
audio = model(mel) # Shape: [1, 1, 8448]