Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

JCTN
/
MMAudio

video-to-audio
Model card Files Files and versions
xet
Community
1

This repository contains the model described in Taming Multimodal Joint Training for High-Quality Video-to-Audio Synthesis.

Code: https://github.com/hkchengrex/MMAudio.

autor: https://huggingface.co/hkchengrex/MMAudio

Repositorio: https://github.com/Navezjt/MMAudio

Downloads last month

-

Downloads are not tracked for this model. How to track
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Paper for JCTN/MMAudio

Taming Multimodal Joint Training for High-Quality Video-to-Audio Synthesis

Paper • 2412.15322 • Published Dec 19, 2024 • 20
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs