python voice synthesis free download

Showing 52 open source projects for "python voice synthesis"

View related business solutions

Multimedia Mac Clear Filters & Widen Search

MongoDB Atlas runs apps anywhere
Deploy in 115+ regions with the modern database for every enterprise.

MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.

Start Free
AI-generated apps that pass security review
Stop waiting on engineering. Build production-ready internal tools with AI—on your company data, in your cloud.

Retool lets you generate dashboards, admin panels, and workflows directly on your data. Type something like “Build me a revenue dashboard on my Stripe data” and get a working app with security, permissions, and compliance built in from day one. Whether on our cloud or self-hosted, create the internal software your team needs without compromising enterprise standards or control.

Try Retool free
1

FFsubsync

Automagically synchronize subtitles with video

...First, make sure ffmpeg is installed. Make sure ffmpeg is on your path and can be referenced from the command line! Next, grab the script. It should work with both Python 2 and Python 3. There may be occasions where you have a correctly synchronized srt file in a language you are unfamiliar with, as well as an unsynchronized srt file in your native language. In this case, you can use the correctly synchronized srt file directly as a reference for synchronization, instead of using the video as the reference. ffsubsync uses the file extension to decide whether to perform voice activity detection on the audio or to directly extract speech from an srt file. ffsubsync usually finishes in 20 to 30 seconds, depending on the length of the video.

Downloads: 51 This Week

Last Update: 2025-11-24
See Project
2

PersonaPlex

PersonaPlex code

PersonaPlex is an open-source real-time conversational speech AI model that goes beyond traditional text chat by providing full-duplex speech-to-speech interaction, meaning it can listen and talk at the same time instead of waiting for you to finish speaking before responding. This architectural approach eliminates awkward pauses and makes conversations feel much more human-like, with natural behaviors such as overlapping speech, interruptions, and fluent turn-taking, traits that traditional...

Downloads: 5 This Week

Last Update: 2026-03-02
See Project
3

RHVoice

Free open source speech synthesizer for Russian and other languages

RHVoice is a free and open-source multilingual speech synthesizer. Its developers hope to give more visually impaired people the ability to use a good free synthesis voice reading in their native language with their screen reader. We are especially interested in supporting those languages for which there are currently no good voices that could be used with a screen reader. The creator of RHVoice, Olga Yakovleva, is blind herself. Many of the contributors to the RHVoice project, both programmers and non-programmers, are blind or partially sighted.

Downloads: 43 This Week

Last Update: 2026-03-31
See Project
4

VCClient

Software that uses AI to perform real-time voice conversion

VCClient is a real-time voice conversion system that uses machine learning models to transform a speaker’s voice into another voice with minimal latency. It is designed for live applications such as streaming, gaming, and virtual communication, where immediate feedback is essential. The system supports multiple voice conversion models, including RVC and other neural network-based approaches, allowing users to switch between different voices or customize their output. It provides both a...

Downloads: 20 This Week

Last Update: 2026-03-23
See Project
$300 in Free Credit Towards Top Cloud Services
Build VMs, containers, AI, databases, storage—all in one place.

Start your project in minutes. After credits run out, 20+ products include free monthly usage. Only pay when you're ready to scale.

Get Started
5

Speakr

Speakr is a personal, self-hosted web application

Speakr is an open-source, real-time text-to-speech (TTS) web application that allows users to convert written text into natural-sounding speech in just a few clicks. It provides a clean, user-friendly interface where users can input text, choose a voice style or language, and immediately hear the output, making it ideal for accessibility, content creation, and learning applications. Behind the scenes, Speakr leverages modern TTS engines and streaming audio technologies to deliver smooth and...

Downloads: 0 This Week

Last Update: 2026-03-18
See Project
6

Podcastfy.ai

Transforming Multimodal Content into Captivating Multilingual Audio

Podcastfy is an open-source Python package that transforms multi-modal content (text, images) into engaging, multi-lingual audio conversations using GenAI. Input content includes websites, PDFs, youtube videos as well as images. Unlike UI-based tools focused primarily on note-taking or research synthesis (e.g. NotebookLM), Podcastfy focuses on the programmatic and bespoke generation of engaging, conversational transcripts and audio from a multitude of multi-modal sources enabling customization and scale.

Downloads: 10 This Week

Last Update: 2024-11-16
See Project
7

ML Sharp

Sharp Monocular View Synthesis in Less Than a Second

ML Sharp is a research code release that turns a single 2D photograph into a photorealistic 3D representation that can be rendered from nearby viewpoints. Instead of requiring multi-view input, it predicts the parameters of a 3D Gaussian scene representation directly from one image using a single forward pass through a neural network. The core idea is speed: the 3D representation is produced in under a second on a standard GPU, and then the resulting scene can be rendered in real time to...

Downloads: 4 This Week

Last Update: 2026-01-29
See Project
8

AudioNotes

Extract audio and video content and organize it into a Markdown note

AudioNotes is an application (or proof-of-concept) that likely combines audio recording or playback with note-taking or annotation functionality — enabling users to record voice or audio and attach textual or timestamped notes, making it ideal for lectures, interviews, meetings, or personal memos. Such a tool offers a more expressive and flexible way to capture and revisit information: instead of just typed notes or raw audio, users get both audio context and structured notes. As an...

Downloads: 2 This Week

Last Update: 2025-12-04
See Project
9

byzorgan

Specialized sound synthesizer with Byzantine Church music scales

This software integrates a small, specialized synthesizer and vocal processor. It can be used to learn Byzantine Church singing. You can play from the keyboard, mouse or touch screen. MIDI input is also available. Voice functions include: pitch highlighting, synthesizer control by voice, pitch correction and voice-to-ison conversion. On the screen there are labels with symbols of Byzantine notes. There is a metronome. The program is oriented on the Chrysanthos tuning of the diatonic scale:...

4 Reviews

Downloads: 23 This Week

Last Update: 2025-07-28
See Project
AI-powered service management for IT and enterprise teams
Enterprise-grade ITSM, for every business

Give your IT, operations, and business teams the ability to deliver exceptional services—without the complexity. Maximize operational efficiency with refreshingly simple, AI-powered Freshservice.

Try it Free
10

Audio Satanifier 666

Easily apply cool gnarly voice filters to your audio files

Transform pure innocent audio files, speech, music, etc into unholy demonic abominations. Audio Satanifier 666 is a fun easy-to-use browser-based tool forged in the pits of hell, for voice actors, musicians, sound designers, for memes, for creative projects or anyone else who want to twist their sound into something absolutely diabolical! Layperson friendly - you'll be able to apply cool effects to your audio file even if you know nothing about audio engineering. Theres also a...

Downloads: 1 This Week

Last Update: 2025-07-27
See Project
11

InstrumentalMusic

Application which detects musical notes from the microphone.

Application which detects musical notes from the microphone. It allows listening to the microphone and play the detected notes to output (in midi). Multilanguage support. Zoom Dark mode option JDK-17 compatibility With v1.2 it includes a pitch shifter (making voice lower or sharper through a slider) There is a demo video which shows how it works (the demo video can be visited from Help menu of the application) You can also see the pitch-shifter demo version...

Downloads: 0 This Week

Last Update: 2026-03-05
See Project
12

DALL-E 2 - Pytorch

Implementation of DALL-E 2, OpenAI's updated text-to-image synthesis

Implementation of DALL-E 2, OpenAI's updated text-to-image synthesis neural network, in Pytorch. The main novelty seems to be an extra layer of indirection with the prior network (whether it is an autoregressive transformer or a diffusion network), which predicts an image embedding based on the text embedding from CLIP. Specifically, this repository will only build out the diffusion prior network, as it is the best performing variant (but which incidentally involves a causal transformer as...

Downloads: 14 This Week

Last Update: 2023-10-19
See Project
13

CRONLOCO!

User-Programmable Voice Clock

Annoy your neighbor, antagonize your boss, or simply make everyone else smile with this insidiously customizable audio clock.

Downloads: 0 This Week

Last Update: 2022-01-16
See Project
14

VAD

Voice activity detection (VAD) toolkit including DNN, bDNN, LSTM

...Post-processing modules (e.g. smoothing, thresholds). The toolkit supports both MATLAB and Python/TensorFlow components (for feature extraction, classification, postprocessing). Acoustic feature extraction (multi-resolution cochleagram, MRCG). Provided real-world dataset with manual annotations.

Downloads: 1 This Week

Last Update: 2025-10-02
See Project
15

Pyo Synth

A GUI to help with pyo synthesizer scripts manipulation.

Pyo Synth is an open source application that makes the manipulation of pyo scripts easier by letting you control it with a midi keyboard. The interface allows you to setup every control on your keyboard and link them to parameters in your script during runtime. It is also possible to save your progress directly in the pyo script. See manual for more explanation on features.

Downloads: 0 This Week

Last Update: 2017-03-27
See Project
16

Loris

C++ class library for sound analysis, synthesis, and morphing

Loris is a library for sound analysis, synthesis, and morphing, developed by Kelly Fitz and Lippold Haken at the CERL Sound Group. Loris includes a C++ class library, Python module, C-linkable interface, command line utilities, and documentation.

1 Review

Downloads: 5 This Week

Last Update: 2016-08-23
See Project
17

The MusicKit

The MusicKit & SndKit is an object-oriented software system for building music, sound, signal processing & MIDI applications. The distribution is a comprehensive package that includes on-line documentation, code examples, utilities, applications & scores

Downloads: 2 This Week

Last Update: 2016-05-23
See Project
18

Sinsy

HMM-based singing voice synthesis system

Sinsy is an HMM-based singing voice synthesis system. This software is released under the Modified BSD license.

4 Reviews

Downloads: 21 This Week

Last Update: 2016-03-23
See Project
19

Nsound

A C++ library and Python module for audio synthesis featuring dynamic digital filters. Nsound lets you easily shape waveforms and write to disk or plot them. Nsound aims to be as powerful as Csound but easy to use.

Downloads: 0 This Week

Last Update: 2015-12-12
See Project
20

sparsedoppler

Continuous choice of string resonance at each point live music in CPU

Nonlinearly change frequencies and echos for live music by CPU. I found a way to normalize 1d wavefunction amplitude so this hack and its random heat vibrations are still unitary, even while microphone vibrating adds energy to part of 1d string of position and speed scalar arrays. The sparse part is, while the arrays are perfectly dense and linear, time is sparse when some springs vibrate with a larger multiplier of position subtracted from speed. In other words, this hack is a quanta level...

1 Review

Downloads: 0 This Week

Last Update: 2015-11-05
See Project
21

Text to Speech for Video

create wav files for video character speech by typing in dialogue

Choose from the "voices" available, and type in what you want the computer to say. A wave file called sounds.wav is stored to the output sub folder. Output is intended primarily for users who need speech for animated characters in videos.

Downloads: 0 This Week

Last Update: 2015-10-16
See Project
22

marsyas

Marsyas (Music Analysis, Retrieval and Synthesis for Audio Signals) is a framework for developing systems for audio processing. It provides an general architecture for connecting audio, soundfiles, signal processing blocks and machine learning. Source code at SF is outdated! Marsyas is now hosted at GitHub: https://github.com/marsyas/marsyas Downloads are now provided at Bintray: https://bintray.com/marsyas

6 Reviews

Downloads: 0 This Week

Last Update: 2014-11-25
See Project
23

Steel TTS

A cross-platform wrapper for common text-to-speech engines in Python

Steel is a cross-platform package for using common text-to-speech (speech synthesis) engines in Python. Steel currently supports the following TTS software: - Microsoft Speech API 5 (SAPI5) - eSpeak - NS Speech Synthesis - FreeTTS Documentation: http://sourceforge.net/p/steeltts/wiki/ Bug Tracker: http://sourceforge.net/p/steeltts/tickets/ If you are interested in contributing to the Steel TTS codebase, or would like to make a feature-request, please contact the lead developer, Jasper Danielson, at jrd4@rice.edu.

Downloads: 1 This Week

Last Update: 2016-03-15
See Project
24

Simpl

Simpl is an open source library for sinusoidal modelling written in the Python programming language and making use of SciPy.

Downloads: 0 This Week

Last Update: 2014-02-23
See Project
25

pyespeak

Python to eSpeak speech synthesis

ctypes Python module for eSpeak http://espeak.sf.net speech synthesis

Downloads: 0 This Week

Last Update: 2017-10-28
See Project