Matcha-TTS is a non-autoregressive neural text-to-speech architecture that uses conditional flow matching to generate speech quickly while maintaining natural quality. It models speech as an ODE-based generative process, and conditional flow matching lets it reach high-quality audio in only a few synthesis steps, which greatly reduces latency compared to score-matching diffusion approaches. The model is fully probabilistic, so it can generate diverse realizations of the same text while still sounding stable and intelligible. The repository provides an end-to-end TTS pipeline: a PyTorch/Lightning training stack, configuration files, pre-trained checkpoints, a command-line interface, and a Gradio app for interactive testing. Users can train on standard datasets like LJSpeech or plug in their own corpora, with helper tools for computing dataset statistics, extracting phoneme durations, and running multi-GPU training.

Features

  • Non-autoregressive TTS architecture based on conditional flow matching for fast synthesis
  • Probabilistic speech generation with natural-sounding, high-quality audio outputs
  • Ready-to-use CLI and Gradio app for text-to-speech from the terminal or browser
  • Full training pipeline with Hydra configs, Lightning runner, and multi-GPU support
  • ONNX export and ONNX Runtime inference, with optional end-to-end vocoder integration
  • Utilities for dataset normalization, phoneme alignment extraction, and custom-dataset training

Project Samples

Project Activity

See All Activity >

Categories

Text to Speech

License

MIT License

Follow Matcha-TTS

Matcha-TTS Web Site

Other Useful Business Software
Full-stack observability with actually useful AI | Grafana Cloud Icon
Full-stack observability with actually useful AI | Grafana Cloud

Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.
Create free account
Rate This Project
Login To Rate This Project

User Reviews

Be the first to post a review of Matcha-TTS!

Additional Project Details

Programming Language

Python

Related Categories

Python Text to Speech Software

Registered

2025-11-28