MockingBird is an open-source voice cloning and real-time speech generation toolkit that lets you clone a speaker’s voice from a short audio sample (reportedly as little as 5 seconds) and then synthesize arbitrary speech in that voice. It builds on deep-learning based TTS / voice-cloning technology (in the lineage of projects such as Real-Time-Voice-Cloning), but extends it with support for Mandarin Chinese and multiple Chinese speech datasets — broadening its applicability beyond English. The codebase is implemented in Python (with PyTorch) and includes modules for encoder, synthesizer, vocoder, preprocessing, and inference, as well as demo scripts and a web-server interface for easier experimentation or deployment. MockingBird supports both using pretrained models and training your own synthesizer (with custom datasets), giving flexibility for voice-cloning or custom-voice synthesis depending on your needs.

Features

  • Zero-shot voice cloning: generate speech in a target voice from just a short reference sample (≈ 5 seconds)
  • Support for Mandarin Chinese (and tested on multiple Chinese speech datasets) in addition to standard English TTS, broadening voice-cloning language support
  • Full TTS pipeline implemented: encoder, synthesizer, vocoder, preprocessing, training and inference modules, plus ready-made demo tools
  • Ability to use pretrained encoder/vocoder while training or fine-tuning the synthesizer to speed up customization
  • Optional web-server interface plus CLI/demo scripts for easy local testing, deployment or integration in applications
  • Cross-platform support (Windows, Linux, community-documented compatibility with Apple-Silicon/M1) and MIT-licensed for free reuse

Project Samples

Project Activity

See All Activity >

License

MIT License

Follow Mocking Bird

Mocking Bird Web Site

Other Useful Business Software
Try Google Cloud Risk-Free With $300 in Credit Icon
Try Google Cloud Risk-Free With $300 in Credit

No hidden charges. No surprise bills. Cancel anytime.

Use your credit across every product. Compute, storage, AI, analytics. When it runs out, 20+ products stay free. You only pay when you choose to.
Start Free
Rate This Project
Login To Rate This Project

User Ratings

★★★★★
★★★★
★★★
★★
0
0
0
0
1
ease 1 of 5 2 of 5 3 of 5 4 of 5 5 of 5 1 / 5
features 1 of 5 2 of 5 3 of 5 4 of 5 5 of 5 1 / 5
design 1 of 5 2 of 5 3 of 5 4 of 5 5 of 5 1 / 5
support 1 of 5 2 of 5 3 of 5 4 of 5 5 of 5 1 / 5

User Reviews

  • it does Not even install - because there is No exe or setup file - completely useless !!! waste of Time/Data to download !!!
Read more reviews >

Additional Project Details

Operating Systems

Linux, Mac, Windows

Programming Language

Python

Related Categories

Python Text to Speech Software, Python Voice Cloning Software

Registered

2023-03-23