MockingBird is an open-source voice cloning and real-time speech generation toolkit that lets you clone a speaker’s voice from a short audio sample (reportedly as little as 5 seconds) and then synthesize arbitrary speech in that voice. It builds on deep-learning based TTS / voice-cloning technology (in the lineage of projects such as Real-Time-Voice-Cloning), but extends it with support for Mandarin Chinese and multiple Chinese speech datasets — broadening its applicability beyond English. The codebase is implemented in Python (with PyTorch) and includes modules for encoder, synthesizer, vocoder, preprocessing, and inference, as well as demo scripts and a web-server interface for easier experimentation or deployment. MockingBird supports both using pretrained models and training your own synthesizer (with custom datasets), giving flexibility for voice-cloning or custom-voice synthesis depending on your needs.

Features

  • Zero-shot voice cloning: generate speech in a target voice from just a short reference sample (≈ 5 seconds)
  • Support for Mandarin Chinese (and tested on multiple Chinese speech datasets) in addition to standard English TTS, broadening voice-cloning language support
  • Full TTS pipeline implemented: encoder, synthesizer, vocoder, preprocessing, training and inference modules, plus ready-made demo tools
  • Ability to use pretrained encoder/vocoder while training or fine-tuning the synthesizer to speed up customization
  • Optional web-server interface plus CLI/demo scripts for easy local testing, deployment or integration in applications
  • Cross-platform support (Windows, Linux, community-documented compatibility with Apple-Silicon/M1) and MIT-licensed for free reuse

Project Samples

Project Activity

See All Activity >

License

MIT License

Follow Mocking Bird

Mocking Bird Web Site

Other Useful Business Software
Auth0 B2B Essentials: SSO, MFA, and RBAC Built In Icon
Auth0 B2B Essentials: SSO, MFA, and RBAC Built In

Unlimited organizations, 3 enterprise SSO connections, role-based access control, and pro MFA included. Dev and prod tenants out of the box.

Auth0's B2B Essentials plan gives you everything you need to ship secure multi-tenant apps. Unlimited orgs, enterprise SSO, RBAC, audit log streaming, and higher auth and API limits included. Add on M2M tokens, enterprise MFA, or additional SSO connections as you scale.
Sign Up Free
Rate This Project
Login To Rate This Project

User Ratings

★★★★★
★★★★
★★★
★★
0
0
0
0
1
ease 1 of 5 2 of 5 3 of 5 4 of 5 5 of 5 1 / 5
features 1 of 5 2 of 5 3 of 5 4 of 5 5 of 5 1 / 5
design 1 of 5 2 of 5 3 of 5 4 of 5 5 of 5 1 / 5
support 1 of 5 2 of 5 3 of 5 4 of 5 5 of 5 1 / 5

User Reviews

  • it does Not even install - because there is No exe or setup file - completely useless !!! waste of Time/Data to download !!!
Read more reviews >

Additional Project Details

Operating Systems

Linux, Mac, Windows

Programming Language

Python

Related Categories

Python Text to Speech Software, Python Voice Cloning Software

Registered

2023-03-23