MockingBird is an open-source voice cloning and real-time speech generation toolkit that lets you clone a speaker’s voice from a short audio sample (reportedly as little as 5 seconds) and then synthesize arbitrary speech in that voice. It builds on deep-learning based TTS / voice-cloning technology (in the lineage of projects such as Real-Time-Voice-Cloning), but extends it with support for Mandarin Chinese and multiple Chinese speech datasets — broadening its applicability beyond English. The codebase is implemented in Python (with PyTorch) and includes modules for encoder, synthesizer, vocoder, preprocessing, and inference, as well as demo scripts and a web-server interface for easier experimentation or deployment. MockingBird supports both using pretrained models and training your own synthesizer (with custom datasets), giving flexibility for voice-cloning or custom-voice synthesis depending on your needs.

Features

  • Zero-shot voice cloning: generate speech in a target voice from just a short reference sample (≈ 5 seconds)
  • Support for Mandarin Chinese (and tested on multiple Chinese speech datasets) in addition to standard English TTS, broadening voice-cloning language support
  • Full TTS pipeline implemented: encoder, synthesizer, vocoder, preprocessing, training and inference modules, plus ready-made demo tools
  • Ability to use pretrained encoder/vocoder while training or fine-tuning the synthesizer to speed up customization
  • Optional web-server interface plus CLI/demo scripts for easy local testing, deployment or integration in applications
  • Cross-platform support (Windows, Linux, community-documented compatibility with Apple-Silicon/M1) and MIT-licensed for free reuse

Project Samples

Project Activity

See All Activity >

License

MIT License

Follow Mocking Bird

Mocking Bird Web Site

Other Useful Business Software
MongoDB Atlas runs apps anywhere Icon
MongoDB Atlas runs apps anywhere

Deploy in 115+ regions with the modern database for every enterprise.

MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
Start Free
Rate This Project
Login To Rate This Project

User Ratings

★★★★★
★★★★
★★★
★★
0
0
0
0
1
ease 1 of 5 2 of 5 3 of 5 4 of 5 5 of 5 1 / 5
features 1 of 5 2 of 5 3 of 5 4 of 5 5 of 5 1 / 5
design 1 of 5 2 of 5 3 of 5 4 of 5 5 of 5 1 / 5
support 1 of 5 2 of 5 3 of 5 4 of 5 5 of 5 1 / 5

User Reviews

  • it does Not even install - because there is No exe or setup file - completely useless !!! waste of Time/Data to download !!!
Read more reviews >

Additional Project Details

Operating Systems

Linux, Mac, Windows

Programming Language

Python

Related Categories

Python Text to Speech Software, Python Voice Cloning Software

Registered

2023-03-23