intelligence free download

OpenVoice

Instant voice cloning by MIT and MyShell. Audio foundation model

OpenVoice is a versatile instant voice cloning system that can replicate a speaker’s tone color from just a short audio clip and then generate speech in multiple languages. It is designed not only to match the timbre of the reference voice, but also to give granular control over style parameters such as emotion, accent, rhythm, pauses, and intonation. The model supports cross-lingual and even zero-shot cross-lingual voice cloning, so a speaker recorded in one language can be made to speak...

Downloads: 91 This Week

Last Update: 2025-11-28

See Project

GPT-SoVITS

1 min voice data can also be used to train a good TTS model

GPT‑SoVITS is a state-of-the-art voice conversion and TTS system that enables zero‑shot and few‑shot synthesis based on a short vocal sample (e.g., 5 seconds). It supports cross‑lingual speech synthesis across English, Chinese, Japanese, Korean, Cantonese, and more. It's powered by VITS architecture enhanced for few‑sample adaptation and real‑time usability.

Downloads: 32 This Week

Last Update: 2025-07-29

See Project

PaddleSpeech

Easy-to-use Speech Toolkit including Self-Supervised Learning model

PaddleSpeech is an open-source toolkit on PaddlePaddle platform for a variety of critical tasks in speech and audio, with state-of-art and influential models. Via the easy-to-use, efficient, flexible and scalable implementation, our vision is to empower both industrial application and academic research, including training, inference & testing modules, and deployment process. Low barriers to install, CLI, Server, and Streaming Server is available to quick-start your journey. We provide...

Downloads: 0 This Week

Last Update: 2025-03-04

See Project

elevenlabs-api

elevenlabs-api is an open source Java wrapper around the ElevenLabs

Elevenlabs-api is an open-source Java wrapper around the ElevenLabs Voice Synthesis and Cloning Web API. Compiled JARs are available via the Releases tab. To access your ElevenLabs API key, head to the official website, you can view your xi-API-key using the 'Profile' tab on the website. To set up your ElevenLabs API key, you must register it with the ElevenLabsAPI Java API. For any public repository security, you should store your API key in an environment variable, or external from your...

Downloads: 1 This Week

Last Update: 2023-12-25

See Project

Lyrebird

Simple and powerful voice changer for Linux, written with Python & GTK

Simple and powerful voice changer for Linux, written with Python & GTK.

Downloads: 44 This Week

Last Update: 2024-06-27

See Project

lora-svc

Singing voice change based on whisper, lora for singing voice clone

singing voice change based on whisper, and lora for singing voice clone. You will feel the beauty of the code from this project. Uni-SVC main branch is for singing voice clone based on whisper with speaker encoder and speaker adapter. Uni-SVC main target is to develop lora for SVC. With lora, maybe clone a singer just need 10 stence after 10 minutes train. Each singer is a plug-in of the base model.

Downloads: 0 This Week

Last Update: 2023-06-12

See Project

VoiceSmith

[WIP] VoiceSmith makes training text to speech models easy

VoiceSmith makes it possible to train and infer on both single and multispeaker models without any coding experience. It fine-tunes a pretty solid text to speech pipeline based on a modified version of DelightfulTTS and UnivNet on your dataset. Both models were pretrained on a proprietary 5000 speaker dataset. It also provides some tools for dataset preprocessing like automatic text normalization. Windows (only CPU supported currently) or any Linux based operating system. If you want to run...

Downloads: 0 This Week

Last Update: 2023-03-24

See Project

VoiceOver

VoiceOver is a web application that allows you to transcribe audio

VoiceOver is a web application that allows you to transcribe English audio and listen to it in another voice. Choose a source, an audio file (.wav) in English only. Transcribe audio, several algorithms will take care of it. Listen to the generated transcription, a man or a woman, it's up to you!

1 Review

Downloads: 0 This Week

Last Update: 2023-03-24

See Project

Mocking Bird

Clone a voice in 5 seconds to generate arbitrary speech in real-time

MockingBird is an open-source voice cloning and real-time speech generation toolkit that lets you clone a speaker’s voice from a short audio sample (reportedly as little as 5 seconds) and then synthesize arbitrary speech in that voice. It builds on deep-learning based TTS / voice-cloning technology (in the lineage of projects such as Real-Time-Voice-Cloning), but extends it with support for Mandarin Chinese and multiple Chinese speech datasets — broadening its applicability beyond English....

1 Review

Downloads: 2 This Week

Last Update: 2023-03-23

See Project

Multilingual Speech Synthesis

An implementation of Tacotron 2 that supports multilingual experiments

This repository provides synthesized samples, training and evaluation data, source code, and parameters for the paper One Model, Many Languages: Meta-learning for Multilingual Text-to-Speech. It contains an implementation of Tacotron 2 that supports multilingual experiments and that implements different approaches to encoder parameter sharing. It presents a model combining ideas from Learning to speak fluently in a foreign language: Multilingual speech synthesis and cross-language voice...

Downloads: 0 This Week

Last Update: 2023-03-24

See Project

Search Results for "intelligence"

Showing 10 open source projects for "intelligence"

OpenVoice

GPT-SoVITS

PaddleSpeech

elevenlabs-api

Lyrebird

lora-svc

VoiceSmith

VoiceOver

Mocking Bird

Multilingual Speech Synthesis

Search Results for "intelligence"

Showing 10 open source projects for "intelligence"

OpenVoice

GPT-SoVITS

PaddleSpeech

elevenlabs-api

Lyrebird

lora-svc

VoiceSmith

VoiceOver

Mocking Bird

Multilingual Speech Synthesis

Related Searches

Related Categories