DramaBox is an expressive text-to-speech and voice cloning project from Resemble AI built on top of the LTX-2.3 audio branch. It generates speech from prompts that control not only the spoken text, but also speaker identity, emotion, delivery style, laughs, sighs, pauses, and transitions. Users can optionally provide a voice reference of around 10 seconds or more to clone the target timbre while still guiding performance through scene-style prompting. The project includes a warm inference server, a CLI workflow, and a Gradio app for interactive generation. It also supports additional LoRA training on top of DramaBox, making it possible to adapt the model for a specific speaker, language flavor, or performance style. DramaBox is aimed at developers, researchers, and audio creators who need highly expressive English TTS for character dialogue, narrative audio, prototyping, or voice experimentation.

Features

  • Prompt-driven expressive TTS
  • Optional voice reference cloning
  • Emotion and delivery-style control
  • CLI, server, and Gradio workflows
  • LoRA fine-tuning support
  • Automatic neural audio watermarking

Project Samples

Project Activity

See All Activity >

Follow DramaBox

DramaBox Web Site

Other Useful Business Software
Go From AI Idea to AI App Fast Icon
Go From AI Idea to AI App Fast

One platform to build, fine-tune, and deploy ML models. No MLOps team required.

Access Gemini 3 and 200+ models. Build chatbots, agents, or custom models with built-in monitoring and scaling.
Try Free
Rate This Project
Login To Rate This Project

User Reviews

Be the first to post a review of DramaBox!

Additional Project Details

Registered

2026-05-14