python voice synthesis free download

Showing 75 open source projects for "python voice synthesis"

View related business solutions

Multimedia Linux Clear Filters & Widen Search

Enterprise-grade ITSM, for every business
Give your IT, operations, and business teams the ability to deliver exceptional services—without the complexity.

Freshservice is an intuitive, AI-powered platform that helps IT, operations, and business teams deliver exceptional service without the usual complexity. Automate repetitive tasks, resolve issues faster, and provide seamless support across the organization. From managing incidents and assets to driving smarter decisions, Freshservice makes it easy to stay efficient and scale with confidence.

Try it Free
MongoDB Atlas runs apps anywhere
Deploy in 115+ regions with the modern database for every enterprise.

MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.

Start Free
1

PersonaPlex

PersonaPlex code

PersonaPlex is an open-source real-time conversational speech AI model that goes beyond traditional text chat by providing full-duplex speech-to-speech interaction, meaning it can listen and talk at the same time instead of waiting for you to finish speaking before responding. This architectural approach eliminates awkward pauses and makes conversations feel much more human-like, with natural behaviors such as overlapping speech, interruptions, and fluent turn-taking, traits that traditional...

Downloads: 5 This Week

Last Update: 2026-03-02
See Project
2

RHVoice

Free open source speech synthesizer for Russian and other languages

RHVoice is a free and open-source multilingual speech synthesizer. Its developers hope to give more visually impaired people the ability to use a good free synthesis voice reading in their native language with their screen reader. We are especially interested in supporting those languages for which there are currently no good voices that could be used with a screen reader. The creator of RHVoice, Olga Yakovleva, is blind herself. Many of the contributors to the RHVoice project, both programmers and non-programmers, are blind or partially sighted.

Downloads: 43 This Week

Last Update: 2026-03-31
See Project
3

VCClient

Software that uses AI to perform real-time voice conversion

VCClient is a real-time voice conversion system that uses machine learning models to transform a speaker’s voice into another voice with minimal latency. It is designed for live applications such as streaming, gaming, and virtual communication, where immediate feedback is essential. The system supports multiple voice conversion models, including RVC and other neural network-based approaches, allowing users to switch between different voices or customize their output. It provides both a...

Downloads: 20 This Week

Last Update: 2026-03-23
See Project
4

Speakr

Speakr is a personal, self-hosted web application

Speakr is an open-source, real-time text-to-speech (TTS) web application that allows users to convert written text into natural-sounding speech in just a few clicks. It provides a clean, user-friendly interface where users can input text, choose a voice style or language, and immediately hear the output, making it ideal for accessibility, content creation, and learning applications. Behind the scenes, Speakr leverages modern TTS engines and streaming audio technologies to deliver smooth and...

Downloads: 0 This Week

Last Update: 2026-03-18
See Project
Earn up to 16% annual interest with Nexo.
Let your crypto work for you

Put idle assets to work with competitive interest rates, borrow without selling, and trade with precision. All in one platform. Geographic restrictions, eligibility, and terms apply.

Get started with Nexo.
5

Podcastfy.ai

Transforming Multimodal Content into Captivating Multilingual Audio

Podcastfy is an open-source Python package that transforms multi-modal content (text, images) into engaging, multi-lingual audio conversations using GenAI. Input content includes websites, PDFs, youtube videos as well as images. Unlike UI-based tools focused primarily on note-taking or research synthesis (e.g. NotebookLM), Podcastfy focuses on the programmatic and bespoke generation of engaging, conversational transcripts and audio from a multitude of multi-modal sources enabling customization and scale.

Downloads: 10 This Week

Last Update: 2024-11-16
See Project
6

ML Sharp

Sharp Monocular View Synthesis in Less Than a Second

ML Sharp is a research code release that turns a single 2D photograph into a photorealistic 3D representation that can be rendered from nearby viewpoints. Instead of requiring multi-view input, it predicts the parameters of a 3D Gaussian scene representation directly from one image using a single forward pass through a neural network. The core idea is speed: the 3D representation is produced in under a second on a standard GPU, and then the resulting scene can be rendered in real time to...

Downloads: 4 This Week

Last Update: 2026-01-29
See Project
7

AudioNotes

Extract audio and video content and organize it into a Markdown note

AudioNotes is an application (or proof-of-concept) that likely combines audio recording or playback with note-taking or annotation functionality — enabling users to record voice or audio and attach textual or timestamped notes, making it ideal for lectures, interviews, meetings, or personal memos. Such a tool offers a more expressive and flexible way to capture and revisit information: instead of just typed notes or raw audio, users get both audio context and structured notes. As an...

Downloads: 2 This Week

Last Update: 2025-12-04
See Project
8

byzorgan

Specialized sound synthesizer with Byzantine Church music scales

This software integrates a small, specialized synthesizer and vocal processor. It can be used to learn Byzantine Church singing. You can play from the keyboard, mouse or touch screen. MIDI input is also available. Voice functions include: pitch highlighting, synthesizer control by voice, pitch correction and voice-to-ison conversion. On the screen there are labels with symbols of Byzantine notes. There is a metronome. The program is oriented on the Chrysanthos tuning of the diatonic scale:...

4 Reviews

Downloads: 23 This Week

Last Update: 2025-07-28
See Project
9

Audio Satanifier 666

Easily apply cool gnarly voice filters to your audio files

Transform pure innocent audio files, speech, music, etc into unholy demonic abominations. Audio Satanifier 666 is a fun easy-to-use browser-based tool forged in the pits of hell, for voice actors, musicians, sound designers, for memes, for creative projects or anyone else who want to twist their sound into something absolutely diabolical! Layperson friendly - you'll be able to apply cool effects to your audio file even if you know nothing about audio engineering. Theres also a...

Downloads: 1 This Week

Last Update: 2025-07-27
See Project
Forever Free Full-Stack Observability | Grafana Cloud
Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.

Create free account
10

eGuideDog free software for the blind

eGuideDog project develops free software for the blind. Currently, we focus on WebSpeech, Ekho TTS and WebAnywhere.

16 Reviews

Downloads: 199 This Week

Last Update: 17 hours ago
See Project
11

InstrumentalMusic

Application which detects musical notes from the microphone.

Application which detects musical notes from the microphone. It allows listening to the microphone and play the detected notes to output (in midi). Multilanguage support. Zoom Dark mode option JDK-17 compatibility With v1.2 it includes a pitch shifter (making voice lower or sharper through a slider) There is a demo video which shows how it works (the demo video can be visited from Help menu of the application) You can also see the pitch-shifter demo version...

Downloads: 0 This Week

Last Update: 2026-03-05
See Project
12

Internet DJ Console

A feature packed DJ console and internet radio client for Linux users

Conceived as an internet radio Shoutcast/Icecast client and DJ console IDJC has two main media players, a background track player, effects buttons, crossfader, webm, aac, ogg, and mp3 streaming, stream automation timers, aux input, voice and VoIP integration. Media file formats include: mp3, ogg, flac, wma, wav, m4a, m3u, xspf, pls, and cue sheet support, IRC track and station announcements, uses jack audio connection kit to provide a flexible audio chain. This list of features is by no...

32 Reviews

Downloads: 11 This Week

Last Update: 2026-01-10
See Project
13

DALL-E 2 - Pytorch

Implementation of DALL-E 2, OpenAI's updated text-to-image synthesis

Implementation of DALL-E 2, OpenAI's updated text-to-image synthesis neural network, in Pytorch. The main novelty seems to be an extra layer of indirection with the prior network (whether it is an autoregressive transformer or a diffusion network), which predicts an image embedding based on the text embedding from CLIP. Specifically, this repository will only build out the diffusion prior network, as it is the best performing variant (but which incidentally involves a causal transformer as...

Downloads: 14 This Week

Last Update: 2023-10-19
See Project
14

CRONLOCO!

User-Programmable Voice Clock

Annoy your neighbor, antagonize your boss, or simply make everyone else smile with this insidiously customizable audio clock.

Downloads: 0 This Week

Last Update: 2022-01-16
See Project
15

VAD

Voice activity detection (VAD) toolkit including DNN, bDNN, LSTM

...Post-processing modules (e.g. smoothing, thresholds). The toolkit supports both MATLAB and Python/TensorFlow components (for feature extraction, classification, postprocessing). Acoustic feature extraction (multi-resolution cochleagram, MRCG). Provided real-world dataset with manual annotations.

Downloads: 1 This Week

Last Update: 2025-10-02
See Project
16

Swami Project

A SoundFont editor and other software for editing, managing and sharing sample based MIDI instrument files for computer music composition. Support for other formats is planned.

3 Reviews

Downloads: 8 This Week

Last Update: 2019-03-09
See Project
17

Loris

C++ class library for sound analysis, synthesis, and morphing

Loris is a library for sound analysis, synthesis, and morphing, developed by Kelly Fitz and Lippold Haken at the CERL Sound Group. Loris includes a C++ class library, Python module, C-linkable interface, command line utilities, and documentation.

1 Review

Downloads: 5 This Week

Last Update: 2016-08-23
See Project
18

The MusicKit

The MusicKit & SndKit is an object-oriented software system for building music, sound, signal processing & MIDI applications. The distribution is a comprehensive package that includes on-line documentation, code examples, utilities, applications & scores

Downloads: 2 This Week

Last Update: 2016-05-23
See Project
19

Sinsy

HMM-based singing voice synthesis system

Sinsy is an HMM-based singing voice synthesis system. This software is released under the Modified BSD license.

4 Reviews

Downloads: 21 This Week

Last Update: 2016-03-23
See Project
20

Nsound

A C++ library and Python module for audio synthesis featuring dynamic digital filters. Nsound lets you easily shape waveforms and write to disk or plot them. Nsound aims to be as powerful as Csound but easy to use.

Downloads: 0 This Week

Last Update: 2015-12-12
See Project
21

sparsedoppler

Continuous choice of string resonance at each point live music in CPU

Nonlinearly change frequencies and echos for live music by CPU. I found a way to normalize 1d wavefunction amplitude so this hack and its random heat vibrations are still unitary, even while microphone vibrating adds energy to part of 1d string of position and speed scalar arrays. The sparse part is, while the arrays are perfectly dense and linear, time is sparse when some springs vibrate with a larger multiplier of position subtracted from speed. In other words, this hack is a quanta level...

1 Review

Downloads: 0 This Week

Last Update: 2015-11-05
See Project
22

Text to Speech for Video

create wav files for video character speech by typing in dialogue

Choose from the "voices" available, and type in what you want the computer to say. A wave file called sounds.wav is stored to the output sub folder. Output is intended primarily for users who need speech for animated characters in videos.

Downloads: 0 This Week

Last Update: 2015-10-16
See Project
23

marsyas

Marsyas (Music Analysis, Retrieval and Synthesis for Audio Signals) is a framework for developing systems for audio processing. It provides an general architecture for connecting audio, soundfiles, signal processing blocks and machine learning. Source code at SF is outdated! Marsyas is now hosted at GitHub: https://github.com/marsyas/marsyas Downloads are now provided at Bintray: https://bintray.com/marsyas

6 Reviews

Downloads: 0 This Week

Last Update: 2014-11-25
See Project
24

Steel TTS

A cross-platform wrapper for common text-to-speech engines in Python

Steel is a cross-platform package for using common text-to-speech (speech synthesis) engines in Python. Steel currently supports the following TTS software: - Microsoft Speech API 5 (SAPI5) - eSpeak - NS Speech Synthesis - FreeTTS Documentation: http://sourceforge.net/p/steeltts/wiki/ Bug Tracker: http://sourceforge.net/p/steeltts/tickets/ If you are interested in contributing to the Steel TTS codebase, or would like to make a feature-request, please contact the lead developer, Jasper Danielson, at jrd4@rice.edu.

Downloads: 1 This Week

Last Update: 2016-03-15
See Project
25

Simpl

Simpl is an open source library for sinusoidal modelling written in the Python programming language and making use of SciPy.

Downloads: 0 This Week

Last Update: 2014-02-23
See Project