Showing 41 open source projects for "tts voice cloning"

View related business solutions
  • Find out just how much your login box can do for your customer | Auth0 Icon
    Find out just how much your login box can do for your customer | Auth0

    With over 53 social login options, you can fast-track the signup and login experience for users.

    From improving customer experience through seamless sign-on to making MFA as easy as a click of a button – your login box must find the right balance between user convenience, privacy and security.
    Sign up
  • ConnectWise Cybersecurity Management for MSPs Icon
    ConnectWise Cybersecurity Management for MSPs

    Software and support solutions to protect your clients’ critical business assets

    ConnectWise SIEM (formerly Perch) offers threat detection and response backed by an in-house Security Operations Center (SOC). Defend against business email compromise, account takeovers, and see beyond your network traffic. Our team of threat analysts does all the tedium for you, eliminating the noise and sending only identified and verified treats to action on. Built with multi-tenancy, ConnectWise SIEM helps you keep clients safe with the best threat intel on the market.
    Learn More
  • 1
    TTS Voice Wizard

    TTS Voice Wizard

    Speech to Text to Speech, sends text as OSC messages

    Speech to Text to Speech. Song now playing. Sends text as OSC messages to VRChat to display on avatar. (STTTS) (Speech to TTS) (VRC STT System) Use TTS Voice Wizard's accessibility features to improve your VRChat experience (it works outside of VRChat too!) You can convert your Speech-to-Text and back to Speech through various Speech Recognition and Text-to-Speech methods. You can send what you say as OSC messages to VRChat to be displayed on your avatar using KillFrenzyAvatarText or VRChats...
    Downloads: 18 This Week
    Last Update:
    See Project
  • 2
    Coqui TTS

    Coqui TTS

    A deep learning toolkit for Text-to-Speech, battle-tested in research

    TTS is a library for advanced Text-to-Speech generation. It's built on the latest research, was designed to achieve the best trade-off among ease-of-training, speed and quality. TTS comes with pre-trained models, tools for measuring dataset quality and is already used in 20+ languages for products and research projects. High-performance Deep Learning models for Text2Speech tasks. Text2Spec models (Tacotron, Tacotron2, Glow-TTS, SpeedySpeech). Speaker Encoder to compute speaker embeddings...
    Downloads: 20 This Week
    Last Update:
    See Project
  • 3
    Voice Cloning App

    Voice Cloning App

    A Python/Pytorch app for easily synthesising human voices

    A Python/Pytorch app for easily synthesizing human voices. If you are using a language other than English you can add it to the app. Firstly, you'll need to find a deep speech model for your language by going to coqui. You'll then need to download the model.pbmm and alphabet.txt files for your language. Requires Windows 10 or Ubuntu 20.04+ operating system, 5GB+ Disk space, and NVIDIA GPU with at least 4GB of memory & driver version 456.38+ (optional). Automatic dataset generation (with...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4

    MARS5-TTS

    MARS5 is a fully open-source, hyper-realistic text-to-speech (TTS).

    CAMB.AI introduces MARS5, a fully open-source (commercially usable) TTS with break-through prosody and realism available on our Github: https://www.github.com/camb-ai/mars5-tts MARS5 is able to replicate performances (from 2-3s of audio reference) in 140+ languages, even for extremely tough prosodic scenarios like sports commentary, movies, anime and more; hard prosody that most closed-source and open-source TTS models struggle with today. We're excited for you to try, build on and use...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Free and Open Source HR Software Icon
    Free and Open Source HR Software

    OrangeHRM provides a world-class HRIS experience and offers everything you and your team need to be that HR hero you know that you are.

    Give your HR team the tools they need to streamline administrative tasks, support employees, and make informed decisions with the OrangeHRM free and open source HR software.
    Learn More
  • 5
    Lyrebird

    Lyrebird

    Simple and powerful voice changer for Linux, written with Python & GTK

    Simple and powerful voice changer for Linux, written with Python & GTK.
    Downloads: 7 This Week
    Last Update:
    See Project
  • 6
    lora-svc

    lora-svc

    Singing voice change based on whisper, lora for singing voice clone

    singing voice change based on whisper, and lora for singing voice clone. You will feel the beauty of the code from this project. Uni-SVC main branch is for singing voice clone based on whisper with speaker encoder and speaker adapter. Uni-SVC main target is to develop lora for SVC. With lora, maybe clone a singer just need 10 stence after 10 minutes train. Each singer is a plug-in of the base model.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 7
    PaddleSpeech

    PaddleSpeech

    Easy-to-use Speech Toolkit including Self-Supervised Learning model

    ...-speed and ultra-lightweight models, and also cutting-edge technology. We provide production ready streaming asr and streaming tts system. Our frontend contains Text Normalization and Grapheme-to-Phoneme (G2P, including Polyphone and Tone Sandhi). Moreover, we use self-defined linguistic rules to adapt Chinese context.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    elevenlabs-api

    elevenlabs-api

    elevenlabs-api is an open source Java wrapper around the ElevenLabs

    Elevenlabs-api is an open-source Java wrapper around the ElevenLabs Voice Synthesis and Cloning Web API. Compiled JARs are available via the Releases tab. To access your ElevenLabs API key, head to the official website, you can view your xi-API-key using the 'Profile' tab on the website. To set up your ElevenLabs API key, you must register it with the ElevenLabsAPI Java API. For any public repository security, you should store your API key in an environment variable, or external from your...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 9
    NVIDIA NeMo

    NVIDIA NeMo

    Toolkit for conversational AI

    NVIDIA NeMo, part of the NVIDIA AI platform, is a toolkit for building new state-of-the-art conversational AI models. NeMo has separate collections for Automatic Speech Recognition (ASR), Natural Language Processing (NLP), and Text-to-Speech (TTS) models. Each collection consists of prebuilt modules that include everything needed to train on your data. Every module can easily be customized, extended, and composed to create new conversational AI model architectures. Conversational AI...
    Downloads: 3 This Week
    Last Update:
    See Project
  • High-performance Open Source API Gateway Icon
    High-performance Open Source API Gateway

    KrakenD is a stateless, distributed, high-performance API Gateway that helps you effortlessly adopt microservices

    KrakenD is a high-performance API Gateway optimized for resource efficiency, capable of managing 70,000 requests per second on a single instance. The stateless architecture allows for straightforward, linear scalability, eliminating the need for complex coordination or database maintenance.
    Learn More
  • 10
    Amiga Memories

    Amiga Memories

    A walk along memory lane

    Amiga Memories is a project (started & released in 2013) that aims to make video programmes that can be published on the internet. The images and sound produced by Amiga Memories are 100% automatically generated. The generator itself is implemented in Squirrel, the 3D rendering is done on GameStart 3D. An Amiga Memories video is mostly based on a narrative. The purpose of the script is to define the spoken and written content. The spoken text will be read by a voice synthesizer (Text To Speech...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    VoiceSmith

    VoiceSmith

    [WIP] VoiceSmith makes training text to speech models easy

    VoiceSmith makes it possible to train and infer on both single and multispeaker models without any coding experience. It fine-tunes a pretty solid text to speech pipeline based on a modified version of DelightfulTTS and UnivNet on your dataset. Both models were pretrained on a proprietary 5000 speaker dataset. It also provides some tools for dataset preprocessing like automatic text normalization. Windows (only CPU supported currently) or any Linux based operating system. If you want to run...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    Pearl Desktop (PDE) 12

    Pearl Desktop (PDE) 12

    Stable Solid Compiz Powered Multimedia Workhorse

    ... Piper handles great sounding natural Text to Speech Voice Models.
    Leader badge
    Downloads: 302 This Week
    Last Update:
    See Project
  • 13
    eGuideDog free software for the blind
    eGuideDog project develops free software for the blind. Currently, we focus on WebSpeech, Ekho TTS and WebAnywhere.
    Leader badge
    Downloads: 221 This Week
    Last Update:
    See Project
  • 14
    PBXinaFlash 3/ IncrediblePBX
    Longing for the good old days of Asterisk@Home? Welcome back to the steroid-enhanced version. PBX in a Flash 3.0 & Incredible PBX 2020/2021/2022/2027 are the latest Lean, Mean Asterisk Machines, high-performance, turnkey Asterisk PBXs that are easy to upgrade. Features include Rocky8, CentOS/SL 7.x, Ubuntu 22.04 & 20.04, Debian 10 and Raspbian 10 support with Asterisk 20/18/16 and FreePBX 16/15 GPL modules. Add-ons include one-click installs of Incredible Fax and many other Asterisk...
    Leader badge
    Downloads: 111 This Week
    Last Update:
    See Project
  • 15
    Pearl MATE Studio 12

    Pearl MATE Studio 12

    OSX Styled Powerful Audio Workstation

    Pearl MATE Studio 12 is running on the Ubuntu 24.04 base with no snap support. This release does however support flatpak and the text to speech on Pearl is managed with Pied for downloading and selecting natural voice models which through keyboard shortcut, <Alt>+s, which will playback highlighted. This release of Pearl MATE Studio is alot lighter on the pre installed software so the user may choose what they want by installing through our software manager, Gdebi, or Synaptic which are pre...
    Leader badge
    Downloads: 72 This Week
    Last Update:
    See Project
  • 16
    SpeakLogPSU
    SpeakLogPSU can speak chat messages with an individual voice if the NPC or player was configured or with a default one. You will never miss if someone talks to you. Voice cloning can be accomplished with cogui in less than five minutes without GPU. The result is archived and can be used the next time in game. Some TTS projects already started to add tag support to speak text with emotions or sing it. If a game designer has that in mind with a good chat log she can voiced her game...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    AI Models Downloads

    AI Models Downloads

    Open Source AI Models Your One-Stop Destination for AI Model Downloads

    The download is a sample Voice Pack Trained and Used with Coqui TTS AIModels.org provides a comprehensive directory of AI models for download, ranging from text-to-speech, large language models, to text-to-image models. Our goal is to provide you with easy access to a vast array of pre-trained models, complete with examples and demonstrations. Discover the Power of AI Models AI models have come a long way, empowering various applications that improve our lives on a daily basis. Here...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 18
    RemoteTTS

    RemoteTTS

    Tool to remotely activate Text-To-Speech (TTS) on a server

    The tool provides a simple TCP/UDP interface to let a remote machine perform TTS outputs.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    VoiceOver

    VoiceOver

    VoiceOver is a web application that allows you to transcribe audio

    VoiceOver is a web application that allows you to transcribe English audio and listen to it in another voice. Choose a source, an audio file (.wav) in English only. Transcribe audio, several algorithms will take care of it. Listen to the generated transcription, a man or a woman, it's up to you!
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    Parakeet

    Parakeet

    PAddle PARAllel text-to-speech toolKIT

    PAddle PARAllel text-to-speech toolKIT (supporting Tacotron2, Transformer TTS, FastSpeech2/FastPitch, SpeedySpeech, WaveFlow and Parallel WaveGAN) Parakeet aims to provide a flexible, efficient and state-of-the-art text-to-speech toolkit for the open-source community. It is built on PaddlePaddle dynamic graph and includes many influential TTS models. In order to facilitate exploiting the existing TTS models directly and developing the new ones, Parakeet selects typical models and provides...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    Mocking Bird

    Mocking Bird

    Clone a voice in 5 seconds to generate arbitrary speech in real-time

    AI拟声: 5秒内克隆您的声音并生成任意语音内容 Clone a voice in 5 seconds to generate arbitrary speech in real-time 🌍 Chinese supported mandarin and tested with multiple datasets: aidatatang_200zh, magicdata, aishell3, data_aishell, and etc. 🤩 PyTorch worked for pytorch, tested in version of 1.9.0(latest in August 2021), with GPU Tesla T4 and GTX 2060 🌍 Windows + Linux run in both Windows OS and linux OS (even in M1 MACOS) 🤩 Easy & Awesome effect with only newly-trained synthesizer, by reusing...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    Multilingual Speech Synthesis

    Multilingual Speech Synthesis

    An implementation of Tacotron 2 that supports multilingual experiments

    This repository provides synthesized samples, training and evaluation data, source code, and parameters for the paper One Model, Many Languages: Meta-learning for Multilingual Text-to-Speech. It contains an implementation of Tacotron 2 that supports multilingual experiments and that implements different approaches to encoder parameter sharing. It presents a model combining ideas from Learning to speak fluently in a foreign language: Multilingual speech synthesis and cross-language voice cloning...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    Olivia

    Olivia

    Your new best friend powered by an artificial neural network

    Olivia is an open-source chatbot built in Golang using Machine Learning technologies. Its goal is to provide a free and open-source alternative to big services like DialogFlow. You can chat with her by speaking (STT) or writing, she replies with a text message but you can enable her voice (TTS). Olivia can listen to you by saying “Hey Olivia” or clicking on the central button. She speaks to reply to you unless you've disabled her voice. Olivia respects your privacy. All the data used by Olivia...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 24
    Ohod Quiz Game

    Ohod Quiz Game

    quiz game with spin wheel

    - quiz game - work in any system - تعمل على جميع الانظمة - multi language utf-8 - متعددة اللغات
    Downloads: 3 This Week
    Last Update:
    See Project
  • 25
    vocoder_chung
    vocoder chung is a small educational vocoder using discrete fourier transform FFT spectrum written in easy fast compiled freebasic . (24/12/2019) uses fast and accurate FFTdll.dll (28/03/2020) algorythmic voice cloning / change / morphing experiment added
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • Next