Voice Clone Multilingual TTS

Speaker Selection
0.1 1
0.5 2

Voice Cloning Guidelines:

  • Use around 7-10 seconds of clear, noise-free audio
  • For transcription interface will use Whisper turbo to transcribe the audio file
  • Longer audio clips will reduce maximum output length
  • Custom speaker overrides speaker selection