TADA: A Generative Framework for Speech Modeling via Text-Acoustic Dual Alignment

A unified speech-language model that synchronizes speech and text into a single, cohesive stream via 1:1 alignment.

Text-Acoustic Dual-Alignment Large Language Model

TADA is a unified speech-language model that synchronizes speech and text into a single, cohesive stream via 1:1 alignment. By leveraging a novel tokenizer and architectural design, TADA achieves high-fidelity synthesis and generation with a fraction of the computational overhead required by traditional models.

⭐️ arxiv: https://arxiv.org/abs/2602.23068
⭐️ demo1: https://huggingface.co/spaces/fffiloni/tada-dual-alignment-tts-demo
⭐️ demo2: https://huggingface.co/spaces/HumeAI/tada
⭐️ github: https://github.com/HumeAI/tada
⭐️ blog post: https://www.hume.ai/blog/opensource-tada

Downloads last month: -; Downloads are not tracked for this model. How to track

Model tree for HumeAI/tada-codec

Finetunes

1 model

Spaces using HumeAI/tada-codec 13

Collection including HumeAI/tada-codec

TADA

Collection

TADA: A Generative Framework for Speech Modeling via Text-Acoustic Dual Alignment | https://huggingface.co/papers/2602.23068 • 5 items • Updated 8 days ago • 66

Paper for HumeAI/tada-codec

TADA: A Generative Framework for Speech Modeling via Text-Acoustic Dual Alignment

Paper • 2602.23068 • Published 21 days ago • 7