Showing 63 open source projects for "voice synthesis"

View related business solutions
  • Red Hat Enterprise Linux on Microsoft Azure Icon
    Red Hat Enterprise Linux on Microsoft Azure

    Deploy Red Hat Enterprise Linux on Microsoft Azure for a secure, reliable, and scalable cloud environment, fully integrated with Microsoft services.

    Red Hat Enterprise Linux (RHEL) on Microsoft Azure provides a secure, reliable, and flexible foundation for your cloud infrastructure. Red Hat Enterprise Linux on Microsoft Azure is ideal for enterprises seeking to enhance their cloud environment with seamless integration, consistent performance, and comprehensive support.
  • ContractSafe: Contract Management Software Icon
    ContractSafe: Contract Management Software

    Take Control Of Your Contracts Without Wrecking The Budget

    Ditch those spreadsheets, shared drives & crazy-expensive solutions with too many bells & whistles. ContractSafe offers the simplest way to manage your contracts efficiently without breaking the bank.
  • 1
    Alan AI

    Alan AI

    In-App assistant SDK to build a multimodal conversational UX websites

    ...-backend powered by the industry’s best Automatic Speech Recognition (ASR), Natural Language Understanding (NLU) and Speech Synthesis. The Alan Cloud provisions and handles the infrastructure required to maintain your voice deployments and perform all the voice processing tasks. To voice enable your app, you only need to get the Alan Client SDK and drop it to your app. No need to plan for, deploy and maintain any infrastructure or speech components - the Alan Platform does the bulk of the work.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 2
    elevenlabs-api

    elevenlabs-api

    elevenlabs-api is an open source Java wrapper around the ElevenLabs

    Elevenlabs-api is an open-source Java wrapper around the ElevenLabs Voice Synthesis and Cloning Web API. Compiled JARs are available via the Releases tab. To access your ElevenLabs API key, head to the official website, you can view your xi-API-key using the 'Profile' tab on the website. To set up your ElevenLabs API key, you must register it with the ElevenLabsAPI Java API. For any public repository security, you should store your API key in an environment variable, or external from your...
    Downloads: 9 This Week
    Last Update:
    See Project
  • 3
    Alan AI for iOS

    Alan AI for iOS

    In-App assistant SDK to build a multimodal conversational UX for iOS

    ...-backend powered by the industry’s best Automatic Speech Recognition (ASR), Natural Language Understanding (NLU) and Speech Synthesis. The Alan Cloud provisions and handles the infrastructure required to maintain your voice deployments and perform all the voice processing tasks. Voice enable your app, you only need to get the Alan Client SDK and drop it into your app. No need to plan for, deploy and maintain any infrastructure or speech components - the Alan Platform does the bulk of the work.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 4
    PyGPT

    PyGPT

    Open source personal AI Assistant for Linux, Windows and Mac

    PyGPT is a desktop application that allows you to talk to OpenAI's LLM models such as GPT4 and GPT3 using your own computer and OpenAI API. It allows you to talk in chat mode and in completion mode, as well as generate images using DALL-E 2. PyGPT also adds access to the Internet for GPT via Google Custom Search API and Wikipedia API and includes voice synthesis using Microsoft Azure Text-to-Speech API. Moreover, the application has implemented context memory support, context storage, history...
    Downloads: 12 This Week
    Last Update:
    See Project
  • Case Management Software for Social Services Icon
    Case Management Software for Social Services

    For human services organizations looking for case management software

    Collaborate is customizable case management software for non-profits and social services agencies with teams of 5+ staff.
  • 5
    Alan AI for Android

    Alan AI for Android

    Assistant SDK to build a multimodal conversational UX for Android

    ...-backend powered by the industry’s best Automatic Speech Recognition (ASR), Natural Language Understanding (NLU) and Speech Synthesis. The Alan Cloud provisions and handles the infrastructure required to maintain your voice deployments and perform all the voice processing tasks. Voice enable your app, you only need to get the Alan Client SDK and drop it into your app. No need to plan for, deploy and maintain any infrastructure or speech components - the Alan Platform does the bulk of the work.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 6
    RHVoice

    RHVoice

    Free open source speech synthesizer for Russian and other languages

    RHVoice is a free and open-source multilingual speech synthesizer. Its developers hope to give more visually impaired people the ability to use a good free synthesis voice reading in their native language with their screen reader. We are especially interested in supporting those languages for which there are currently no good voices that could be used with a screen reader. The creator of RHVoice, Olga Yakovleva, is blind herself. Many of the contributors to the RHVoice project, both programmers...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    Alan AI for Flutter

    Alan AI for Flutter

    SDK to build a multimodal conversational UX for Flutter apps

    ...-backend powered by the industry’s best Automatic Speech Recognition (ASR), Natural Language Understanding (NLU) and Speech Synthesis. The Alan Cloud provisions and handles the infrastructure required to maintain your voice deployments and perform all the voice processing tasks. Voice enable your app, you only need to get the Alan Client SDK and drop it into your app. No need to plan for, deploy and maintain any infrastructure or speech components - the Alan Platform does the bulk of the work.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    eGuideDog free software for the blind
    eGuideDog project develops free software for the blind. Currently, we focus on WebSpeech, Ekho TTS and WebAnywhere.
    Leader badge
    Downloads: 289 This Week
    Last Update:
    See Project
  • 9
    UTAU

    UTAU

    Singing voice synthesis tool

    UTAU is singing voice synthesis software created for Windows. It has the ability to assemble a song based on a sampled audio library, and you can create your own audio library (original sound file set) in addition to the included audio. Currently, there are a large number of audio libraries created for UTAU on the internet, and by downloading and using them, you can create singing songs with your favorite voices.
    Downloads: 19 This Week
    Last Update:
    See Project
  • AI-based, Comprehensive Service Management for Businesses and IT Providers Icon
    AI-based, Comprehensive Service Management for Businesses and IT Providers

    Modular solutions for change management, asset management and more

    ChangeGear provides IT staff with the functions required to manage everything from ticketing to incident, change and asset management and more. ChangeGear includes a virtual agent, self-service portals and AI-based features to support analyst and end user productivity.
  • 10
    byzorgan

    byzorgan

    Specialized sound synthesizer with Byzantine Church music scales

    This software integrates a small, specialized synthesizer and vocal processor. It can be used to learn Byzantine Church singing. You can play from the keyboard, mouse or touch screen. MIDI input is also available. Voice functions include: pitch highlighting, synthesizer control by voice, pitch correction and voice-to-ison conversion. On the screen there are labels with symbols of Byzantine notes. There is a metronome. The program is oriented on the Chrysanthos tuning of the diatonic scale: 9/8...
    Downloads: 38 This Week
    Last Update:
    See Project
  • 11
    ATC-pie

    ATC-pie

    Air traffic control tower and radar simulator (solo + multi-player)

    ATC-pie is an air traffic control simulation program. It features solo, multi-player and teacher-student sessions, rendering 3D views of airports through FlightGear. It is essentially designed for realism, and simulates real-life ATC tasks and equipment such as strip racks and sequence management, handovers to/from neighbouring controllers, flight plans, primary & secondary radars, RDF, CPDLC, ATIS recording...
    Leader badge
    Downloads: 28 This Week
    Last Update:
    See Project
  • 12
    Maia
    MAIA (MyApp Intelligence Artificial) is designed to provide a foundation for building your own voice-controlled assistant with Python. It uses various libraries and modules for speech recognition, text-to-speech synthesis, and custom functionality.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13

    MARS5-TTS

    MARS5 is a fully open-source, hyper-realistic text-to-speech (TTS).

    CAMB.AI introduces MARS5, a fully open-source (commercially usable) TTS with break-through prosody and realism available on our Github: https://www.github.com/camb-ai/mars5-tts MARS5 is able to replicate performances (from 2-3s of audio reference) in 140+ languages, even for extremely tough prosodic scenarios like sports commentary, movies, anime and more; hard prosody that most closed-source and open-source TTS models struggle with today. We're excited for you to try, build on and use...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    InstrumentalMusic

    InstrumentalMusic

    Application which detects musical notes from the microphone.

    Application which detects musical notes from the microphone. It allows listening to the microphone and play the detected notes to output (in midi). Multilanguage support. Zoom Dark mode option JDK-17 compatibility With v1.2 it includes a pitch shifter (making voice lower or sharper through a slider) There is a demo video which shows how it works (the demo video can be visited from Help menu of the application) You can also see the pitch-shifter demo version here: https...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    Alan AI for Cordova

    Alan AI for Cordova

    Assistant SDK to build a multimodal conversational UX for Apache

    ...-backend powered by the industry’s best Automatic Speech Recognition (ASR), Natural Language Understanding (NLU) and Speech Synthesis. The Alan Cloud provisions and handles the infrastructure required to maintain your voice deployments and perform all the voice processing tasks. Voice enable your app, you only need to get the Alan Client SDK and drop it into your app. No need to plan for, deploy and maintain any infrastructure or speech components - the Alan Platform does the bulk of the work.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    Alan AI for React Native

    Alan AI for React Native

    Build a multimodal conversational UX for apps created with React

    ...-backend powered by the industry’s best Automatic Speech Recognition (ASR), Natural Language Understanding (NLU) and Speech Synthesis. The Alan Cloud provisions and handles the infrastructure required to maintain your voice deployments and perform all the voice processing tasks. Voice enable your app, you only need to get the Alan Client SDK and drop it into your app. No need to plan for, deploy and maintain any infrastructure or speech components - the Alan Platform does the bulk of the work.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    Parakeet

    Parakeet

    PAddle PARAllel text-to-speech toolKIT

    ... their reference implementations in PaddlePaddle. Further more, Parakeet abstracts the TTS pipeline and standardizes the procedure of data preprocessing, common module sharing, model configuration, and the process of training and synthesis. The models supported here include Text FrontEnd, end-to-end Acoustic models and Vocoders.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    CRONLOCO!

    CRONLOCO!

    User-Programmable Voice Clock

    Annoy your neighbor, antagonize your boss, or simply make everyone else smile with this insidiously customizable audio clock.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    Multilingual Speech Synthesis

    Multilingual Speech Synthesis

    An implementation of Tacotron 2 that supports multilingual experiments

    This repository provides synthesized samples, training and evaluation data, source code, and parameters for the paper One Model, Many Languages: Meta-learning for Multilingual Text-to-Speech. It contains an implementation of Tacotron 2 that supports multilingual experiments and that implements different approaches to encoder parameter sharing. It presents a model combining ideas from Learning to speak fluently in a foreign language: Multilingual speech synthesis and cross-language voice cloning...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 20
    PaddlePaddle models

    PaddlePaddle models

    Pre-trained and Reproduced Deep Learning Models

    ... detection, image segmentation, text recognition, speech synthesis, etc. An end-to-end development kit that meets the needs of enterprises for low-cost development and rapid integration. The model library of Flying Paddle is an industrial-level model library tailored around the actual R&D process of domestic enterprises, serving enterprises in many fields such as energy, finance, industry, and agriculture.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 21
    Dragonfire

    Dragonfire

    The open-source virtual assistant for Ubuntu based Linux distributions

    .... It will contain various software packages for controlling the helmet. It will be the first of its kind. Dragonfire uses Mozilla DeepSpeech to understand your voice commands and Festival Speech Synthesis System to handle text-to-speech tasks.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    vocoder_chung
    vocoder chung is a small educational vocoder using discrete fourier transform FFT spectrum written in easy fast compiled freebasic . (24/12/2019) uses fast and accurate FFTdll.dll (28/03/2020) algorythmic voice cloning / change / morphing experiment added
    Downloads: 1 This Week
    Last Update:
    See Project
  • 23
    Deepvoice3_pytorch

    Deepvoice3_pytorch

    PyTorch implementation of convolutional neural networks

    An open source implementation of Deep Voice 3: Scaling Text-to-Speech with Convolutional Sequence Learning.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24

    Ham Radio Projects

    KB1OIQ Ham Radio Projects

    This area contains files from a variety of ham radio related projects including: * Software and hardware mods for the BITX40 QRP SSB transceiver from India, including primitive voice synthesis from the Arduino Talkie library. * Software and hardware mods for the uBITX QRP SSB/CW transceiver from India, including voice synthesis using the EMIC-2 chip. * Hardware and software project to create an 80m ARDF transmitter using an Arduino and a Cricket 80a * My Arduino sketches...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 25

    Voice Choir Modulator with PureData

    Technique of Vocal Tract model, Subtractive synthesis + Effects

    This is a voice modulator implemented in PureData. Differents techniques are applied in order to find the best result for choir (Vocal tract, subtractive synthesis, AM, FM, etc) Also included some effects like Vibrato, Tremolo and Reverb. The folder also has a description of the project with block diagrams and an user manual. The modulator can also be used with a midi controller.
    Downloads: 1 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • 3
  • Next