Compare the Top Voice Cloning Software that integrates with GitHub as of November 2025

This a list of Voice Cloning software that integrates with GitHub. Use the filters on the left to add additional filters for products that have integrations with GitHub. View the products that work with GitHub in the table below.

What is Voice Cloning Software for GitHub?

Voice cloning software is a type of software that enables a user to create a voice that sounds very similar to their own. It utilizes artificial intelligence, speech synthesis, and voice deep learning algorithms to produce accurate copies of audio recordings. This technology has been used in various applications such as video games, films, and advertising campaigns. It can also be used to create virtual assistants for customer service or other uses. Compare and read user reviews of the best Voice Cloning software for GitHub currently available using the table below. This list is updated regularly.

  • 1
    Zyphra Zonos
    Zyphra is excited to announce the release of Zonos-v0.1 beta, featuring two expressive and real-time text-to-speech models with high-fidelity voice cloning. We are releasing our 1.6B transformer and 1.6B hybrid under an Apache 2.0 license. It is difficult to quantitatively measure quality in the audio domain; we find that Zonos’ generation quality matches or exceeds that of leading proprietary TTS model providers. Further, we believe that openly releasing models of this caliber will significantly advance TTS research. Zonos model weights are available on Huggingface, and sample inference code for the models is available on our GitHub. You can also access Zonos through our model playground and API with simple and competitive flat-rate pricing. We have found that quantitative evaluations struggle to measure the quality of outputs in the audio domain, so for demonstration, we present a number of samples of Zonos vs both proprietary models.
    Starting Price: $0.02 per minute
  • 2
    Chirp 3

    Chirp 3

    Google

    ​Google Cloud's Text-to-Speech API introduces Chirp 3, enabling users to create personalized voice models using their own high-quality audio recordings. This feature facilitates the rapid generation of custom voices, which can be utilized to synthesize audio through the Cloud Text-to-Speech API, supporting both streaming and long-form text. Access to this voice cloning capability is restricted to allow-listed users due to safety considerations; interested parties should contact the sales team to be added to the allowed list. Instant Custom Voice creation and synthesis are supported in various languages, including English (US), Spanish (US), and French (Canada), among others. It is available in multiple Google Cloud regions, and supported output formats include LINEAR16, OGG_OPUS, PCM, ALAW, MULAW, and MP3, depending on the API method used.
  • Previous
  • You're on page 1
  • Next