Join/Login
Business Software
Open Source Software
For Vendors
Blog
About
More

For Vendors Help Create Join Login

Business Software

Open Source Software

SourceForge Podcast

Resources

Articles
Case Studies
Blog

Menu

Help
Create
Join
Login

Home
Open Source Software
Artificial Intelligence Software
Search Results

Search Results for "python voice synthesis" - Page 2

x

Sort By:

Relevance

Clear All Filters

OS

Windows 258
Linux 252
Mac 226
More...
BSD 116
ChromeOS 113
Mobile Operating Systems 14
Desktop Operating Systems 4

Category

Artificial Intelligence 290
Multimedia 17
Software Development 10
Communications 6
System 5
Business 2
Internet 2
Scientific/Engineering 2
Desktop Environment 1
Education 1
Games 1
Productivity 1
Terminals 1

License

OSI-Approved Open Source 262
Public Domain 3
Creative Commons Attribution License 2
GNU Free Documentation License 1

Translations

English 12
Bengali 1
Chinese (Simplified) 1
German 1

Programming Language

Python 242
C++ 17
JavaScript 16
TypeScript 11
More...
Java 10
C 8
Unix Shell 6
Perl 3
Ruby 3
BASIC 2
C# 2
Kotlin 2
PHP 2
Delphi/Kylix 1
Julia 1
Objective C 1
PowerShell 1
Prolog 1
R 1
Rust 1
Scheme 1

Status

Beta 13
Production/Stable 8
Alpha 7
Pre-Alpha 4
More...
Planning 1

Showing 290 open source projects for "python voice synthesis"

View related business solutions

Artificial Intelligence Clear Filters & Widen Search

Full-stack observability with actually useful AI | Grafana Cloud
Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.

Create free account
Go From AI Idea to AI App Fast
One platform to build, fine-tune, and deploy ML models. No MLOps team required.

Access Gemini 3 and 200+ models. Build chatbots, agents, or custom models with built-in monitoring and scaling.

Try Free
1

Style-Bert-VITS2

Style-Bert-VITS2: Bert-VITS2 with more controllable voice styles

Style-Bert-VITS2 is a text-to-speech system based on Bert-VITS2 that focuses on highly controllable voice styles and emotional expression. It takes the original Bert-VITS2 v2.1 and its Japanese-Extra variant and extends them so you can control emotion and speaking style with fine-grained intensity, not just choose a generic tone. The project targets both power users and beginners: Windows users without Git or Python can install and run it using bundled .bat scripts, while advanced users can work with virtual environments, uv, and Python tooling. ...

Downloads: 7 This Week

Last Update: 2025-11-28
See Project
2

CosyVoice

Multi-lingual large voice generation model, providing inference

CosyVoice is a multilingual large voice generation model that offers a full-stack solution for training, inference, and deployment of high-quality TTS systems. The model supports multiple languages, including Chinese, English, Japanese, Korean, and a range of Chinese dialects such as Cantonese, Sichuanese, Shanghainese, Tianjinese, and Wuhanese. It is designed for zero-shot voice cloning and cross-lingual or mix-lingual scenarios, so a single reference voice can be used to synthesize speech...

Downloads: 2 This Week

Last Update: 2025-11-30
See Project
3

Amazing-Python-Scripts

Curated collection of Amazing Python scripts

Amazing-Python-Scripts is a collaborative repository that collects a wide variety of Python scripts designed to demonstrate practical programming techniques and automation tasks. The project includes scripts ranging from beginner-level utilities to more advanced applications involving machine learning, data processing, and system automation. Its goal is to provide developers with useful coding examples that can solve everyday problems, automate repetitive tasks, or serve as learning exercises. ...

Downloads: 3 This Week

Last Update: 2026-03-11
See Project
4

Alan AI

In-App assistant SDK to build a multimodal conversational UX websites

...A powerful web-based IDE where you can write, test and debug dialog scenarios for your voice assistant or chatbot. Alan's AI-backend powered by the industry’s best Automatic Speech Recognition (ASR), Natural Language Understanding (NLU) and Speech Synthesis. The Alan Cloud provisions and handles the infrastructure required to maintain your voice deployments and perform all the voice processing tasks. To voice enable your app, you only need to get the Alan Client SDK and drop it to your app. ...

Downloads: 6 This Week

Last Update: 4 days ago
See Project
$300 in Free Credit Towards Top Cloud Services
Build VMs, containers, AI, databases, storage—all in one place.

Start your project in minutes. After credits run out, 20+ products include free monthly usage. Only pay when you're ready to scale.

Get Started
5

Fun Audio Chat

Large Audio Language Model built for natural interactions

Fun Audio Chat is an interactive voice-first conversational AI platform designed to let users engage in natural spoken dialogue with large language models in real time, turning speech into context-aware responses while maintaining a smooth back-and-forth experience. It combines speech recognition, audio processing, and AI generation so users can speak simply and receive spoken replies, enabling applications such as virtual assistants, voice bots, and hands-free chat interfaces. The system...

Downloads: 0 This Week

Last Update: 2026-02-27
See Project
6

FireRedTTS-2

Long-form streaming TTS system for multi-speaker dialogue generation

...FireRedTTS2 supports multilingual output and speaker flexibility, enabling scenarios that involve language switching, cross-lingual voice cloning, and expressive dialogue generation that maintains consistency over longer utterances.

Downloads: 3 This Week

Last Update: 2026-02-16
See Project
7

Applio

A simple, high-quality voice conversion tool focused on ease of use

Applio is a high-quality voice conversion toolkit designed to make modern RVC/VITS-based voice cloning accessible to non-experts. It focuses strongly on ease of use: installation scripts for Windows, Linux, and macOS set up dependencies and then launch a browser-based Gradio interface. Within that interface, users can train and run voice conversion models for tasks like singing conversion, speech-to-speech transformation, and voice cloning. The project is structured to be flexible through...

Downloads: 111 This Week

Last Update: 2026-02-18
See Project
8

ElatoAI

Realtime AI Voice Agents with SoTA Multimodal AI models on Arduino ESP

ElatoAI is a real-time AI voice agent platform built around IoT hardware (ESP32) that enables continuous speech-to-speech conversations using state-of-the-art multimodal voice models with minimal latency and global performance via edge computing. The system integrates voice synthesis and recognition by connecting an ESP32 device through secure WebSockets to edge server functions written in Deno, allowing users to speak naturally with AI agents hosted through cloud APIs including OpenAI’s Realtime API, Gemini’s Live API, xAI’s Grok Voice Agent API, and others. ...

Downloads: 1 This Week

Last Update: 4 days ago
See Project
9

Hugging Face - Speech To Speech

Open speech-to-speech models and pipelines by Hugging Face toolkit AI

...Overall, it highlights an emerging approach to voice technology that reduces latency and preserves more of the original speech characteristics.

Downloads: 0 This Week

Last Update: 2026-03-18
See Project
AI-powered service management for IT and enterprise teams
Enterprise-grade ITSM, for every business

Give your IT, operations, and business teams the ability to deliver exceptional services—without the complexity. Maximize operational efficiency with refreshingly simple, AI-powered Freshservice.

Try it Free
10

VibeVoice ComfyUI

ComfyUI integration for Microsoft's VibeVoice text-to-speech model

VibeVoice ComfyUI is a comprehensive wrapper that integrates Microsoft’s VibeVoice text-to-speech models directly into ComfyUI workflows. It exposes VibeVoice as a set of custom nodes so you can build single-speaker and multi-speaker voice generation pipelines visually, combining TTS with other audio or generative components. The integration supports high-quality single-speaker synthesis as well as scripted multi-speaker conversations, with optional voice cloning from audio samples for each speaker. It includes advanced control over generation parameters like attention backend, diffusion steps, sampling temperature, guidance scale, and quantization settings, allowing users to tune the trade-offs between quality, VRAM usage, and speed. ...

Downloads: 0 This Week

Last Update: 2025-11-28
See Project
11

kokoro-onnx

TTS with kokoro and onnx runtime

...It supports multiple languages and voices, with a curated voice list and configuration via a VOICES file hosted alongside the models. The package is distributed on PyPI, meaning you can integrate it directly into applications or scripts using standard Python tooling. It also recommends pairing with an external G2P package to improve pronunciation quality, especially for more complex languages or names, and is licensed under permissive MIT and Apache-style licenses.

Downloads: 167 This Week

Last Update: 2025-11-28
See Project
12

MLX-Audio

A text-to-speech, speech-to-text and speech-to-speech library

...Because it uses MLX and targets Apple Silicon, inference is fast and can take advantage of hardware acceleration and quantization for efficient on-device performance. The project provides a straightforward CLI (mlx_audio.tts.generate) as well as a Python API for programmatic generation of audio, including parameters for voice choice, speed, language hints, output format, and sample rate. It includes examples such as audiobook generation to demonstrate long-form synthesis and joined audio segments. On top of that, MLX-Audio offers a modern web interface powered by FastAPI, with real-time waveform and 3D visualizations, file upload, and audio management.

Downloads: 7 This Week

Last Update: 2026-03-30
See Project
13

Bolna

Conversational voice AI agents

Bolna is an end-to-end open-source platform for building conversational voice AI agents, enabling developers to create voice-first conversational assistants efficiently.

Downloads: 3 This Week

Last Update: 2 days ago
See Project
14

RunAnywhere

Production ready toolkit to run AI locally

RunAnywhere SDKs are a set of cross-platform development tools that enable applications to run artificial intelligence models directly on user devices instead of relying on cloud infrastructure. The toolkit allows developers to integrate language models, speech recognition, and voice synthesis capabilities into mobile or desktop applications while keeping all computation local. By running models entirely on device, the platform eliminates network latency and protects user data because information does not leave the device. The SDK supports popular open-source models such as Llama, Mistral, and Qwen, enabling developers to build AI-powered features such as chat interfaces and voice assistants with minimal external dependencies. ...

Downloads: 0 This Week

Last Update: 2 days ago
See Project
15

Alan AI for Android

Assistant SDK to build a multimodal conversational UX for Android

...A powerful web-based IDE where you can write, test and debug dialog scenarios for your voice assistant or chatbot. Alan's AI-backend powered by the industry’s best Automatic Speech Recognition (ASR), Natural Language Understanding (NLU) and Speech Synthesis. The Alan Cloud provisions and handles the infrastructure required to maintain your voice deployments and perform all the voice processing tasks. Voice enable your app, you only need to get the Alan Client SDK and drop it into your app. ...

Downloads: 0 This Week

Last Update: 2024-07-01
See Project
16

VideoChat

Real-time voice interactive digital human

...It is built as a Gradio Python demo, exposing a web interface where users can talk to an animated avatar that lip-syncs to synthesized speech while responding intelligently. The system is customizable: you can define your own avatar appearance and voice, and it supports voice cloning so you can generate a new voice from a short 3–10 second reference sample. The tech stack integrates FunASR for speech recognition, Qwen for language understanding, multiple TTS engines like GPT-SoVITS, CosyVoice, or edge-tts, and MuseTalk for talking-head generation.

Downloads: 2 This Week

Last Update: 2025-12-18
See Project
17

Alan AI for iOS

In-App assistant SDK to build a multimodal conversational UX for iOS

...A powerful web-based IDE where you can write, test and debug dialog scenarios for your voice assistant or chatbot. Alan's AI-backend powered by the industry’s best Automatic Speech Recognition (ASR), Natural Language Understanding (NLU) and Speech Synthesis. The Alan Cloud provisions and handles the infrastructure required to maintain your voice deployments and perform all the voice processing tasks. Voice enable your app, you only need to get the Alan Client SDK and drop it into your app. ...

Downloads: 0 This Week

Last Update: 2024-07-01
See Project
18

ChatTTS webUI & API

A simple native web interface that uses ChatTTS to synthesize text

ChatTTS-ui is a local web interface and API wrapper around the ChatTTS speech synthesis system, designed to make advanced TTS models easy to use from a browser. It runs a small backend server (Python + Torch + ffmpeg) and exposes a simple webpage where you can type text, adjust parameters, and generate audio. The project supports Chinese, English, and mixed text with digits and control symbols, making it suitable for bilingual content and numerically heavy text like announcements or prompts. ...

Downloads: 10 This Week

Last Update: 2025-11-28
See Project
19

Yandex Smart Home

Adds support for Yandex Smart Home (Alice voice assistant)

Adds support for Yandex Smart Home (Alice voice assistant) into Home Assistant. The component allows you to add devices from Home Assistant to the Yandex smart home platform and manage them from any device with Alice. The component runs on Home Assistant version 2023.2 or later.

Downloads: 6 This Week

Last Update: 2026-03-31
See Project
20

RealtimeSTT

A robust, efficient, low-latency speech-to-text library

RealtimeSTT is a Python-based realtime speech-to-text engine emphasizing low latency, wake-word detection, voice activity detection, and automatic speech segmentation. It provides asynchronous callbacks, nanosecond-precision timestamps, and CLI tools, suitable for building voice assistants, meeting transcribers, or live caption systems.

Downloads: 3 This Week

Last Update: 2025-07-03
See Project
21

TADA

Open Source Speech Language Model

TADA is an open-source speech-language modeling framework designed to unify spoken audio and text representations within a single generative architecture. The system focuses on aligning speech and text streams using a dual-alignment mechanism that synchronizes the acoustic signal with its textual representation. By modeling both modalities together, the framework allows developers to build systems capable of generating, understanding, and transforming speech and language simultaneously. This...

Downloads: 0 This Week

Last Update: 2026-03-24
See Project
22

Fooocus

Focus on prompting and generating

Fooocus is an open-source image generation software that simplifies the process of creating images from text prompts. Built on Gradio and leveraging Stable Diffusion XL, Fooocus eliminates the need for manual parameter tweaking, allowing users to focus solely on crafting prompts. It offers a user-friendly interface with minimal setup, making advanced image synthesis accessible to a broader audience.

Downloads: 250 This Week

Last Update: 2025-06-03
See Project
23

II ElevenLabs UI

Component library and custom registry built on top of shadcn/ui

...It is tightly aligned with ElevenLabs’ ecosystem, allowing seamless integration with their voice synthesis and conversational AI tools. The project also includes a CLI that simplifies the process of adding components directly into existing projects, streamlining development workflows.

Downloads: 0 This Week

Last Update: 2026-03-19
See Project
24

XiaoZhi AI Chatbot

Build your own AI friend

xiaozhi-esp32 is an open-source project that guides users in building their own AI-powered conversational companion using the ESP32 microcontroller. The project provides detailed instructions on assembling the hardware, setting up the software, and integrating AI models to enable natural language interactions. This DIY approach offers an accessible entry point into AI and hardware development.

Downloads: 120 This Week

Last Update: 2 days ago
See Project
25

ebook2audiobook

Generate audiobooks from e-books, voice cloning & 1107+ languages

ebook2audiobook is a tool to convert legally obtained eBooks (non-DRM) into fully narrated audiobooks, complete with chapters and metadata. It automates the pipeline: it reads the eBook file, splits it into appropriate segments (chapters, paragraphs), uses text-to-speech (TTS) models to synthesize audio, optionally applies voice cloning, and outputs a final audiobook — ideal for people who prefer listening over reading, or for accessibility purposes. The tool supports a wide array of...

Downloads: 29 This Week

Last Update: 20 hours ago
See Project

Previous
1
You're on page 2
3
4
5
6
Next

Related Searches

fooocus

applio

audio visualization vlc

esp32-s3

ai chatbot offline

xiaozhi

ai

voice cloning

tts

android tools

Related Categories

Artificial Intelligence

Multimedia

Software Development

Communications

System

SourceForge

Create a Project
Open Source Software
Business Software
Top Downloaded Projects

Company

About
Team
SourceForge Headquarters
1320 Columbia Street Suite 310
San Diego, CA 92101
+1 (858) 422-6466

Resources

Support
Site Documentation
Site Status
SourceForge Reviews

© 2026 Slashdot Media. All Rights Reserved.

Terms Privacy Opt Out Advertise