Join/Login
Business Software
Open Source Software
For Vendors
Blog
About
More

For Vendors Help Create Join Login

Business Software

Open Source Software

SourceForge Podcast

Resources

Articles
Case Studies
Blog

Menu

Help
Create
Join
Login

Home
Open Source Software
Artificial Intelligence
Text to Speech Software
Search Results

Search Results for "file-studio"

x

Sort By:

Relevance

Clear All Filters

OS

Linux 26
Mac 24
Windows 24
More...
BSD 9
ChromeOS 8
Mobile Operating Systems 2
Desktop Operating Systems 1

Category

Artificial Intelligence 26
- Text to Speech 26
- Voice Assistants 1
Multimedia 4
Internet 2
Business 1
Education 1
Games 1
Scientific/Engineering 1
Text Editors 1

License

OSI-Approved Open Source 24

Translations

English 2
Bengali 1
Brazilian Portuguese 1

Programming Language

Python 10
JavaScript 4
TypeScript 3
Java 2
More...
C++ 1
C# 1
Unix Shell 1

Status

Production/Stable 4
Beta 3
Alpha 2

Showing 26 open source projects for "file-studio"

View related business solutions

Text to Speech Linux Clear Filters & Widen Search

Try Google Cloud Risk-Free With $300 in Credit
No hidden charges. No surprise bills. Cancel anytime.

Use your credit across every product. Compute, storage, AI, analytics. When it runs out, 20+ products stay free. You only pay when you choose to.

Start Free
AI-generated apps that pass security review
Stop waiting on engineering. Build production-ready internal tools with AI—on your company data, in your cloud.

Retool lets you generate dashboards, admin panels, and workflows directly on your data. Type something like “Build me a revenue dashboard on my Stripe data” and get a working app with security, permissions, and compliance built in from day one. Whether on our cloud or self-hosted, create the internal software your team needs without compromising enterprise standards or control.

Try Retool free
1

Voicebox

The open-source voice synthesis studio powered by Qwen3-TTS

Voicebox is a local-first voice synthesis studio that aims to bring professional, DAW-like voice generation workflows to a desktop app while keeping models and voice data entirely on your machine. It positions itself as an open-source alternative to cloud voice platforms by emphasizing privacy, offline use, and freedom from subscriptions or usage caps. The tool supports downloading voice models, cloning voices from short audio samples, and generating speech locally, then organizing the results using studio-oriented editing concepts. ...

Downloads: 117 This Week

Last Update: 2 days ago
See Project
2

VoxCPM2

Tokenizer-Free TTS for Multilingual Speech Generation

VoxCPM2 is an advanced open-source text-to-speech system that redefines speech synthesis by eliminating traditional tokenization and instead generating continuous speech representations through a diffusion-based autoregressive architecture. Built on top of the MiniCPM model family, it enables highly natural, expressive, and context-aware speech generation that adapts tone, emotion, and pacing directly from input text. The system is trained on massive multilingual datasets, enabling support...

Downloads: 24 This Week

Last Update: 2026-04-13
See Project
3

kokoro-onnx

TTS with kokoro and onnx runtime

...It focuses on running efficiently on commodity hardware, including macOS with Apple Silicon, while still delivering near real-time performance for many use cases. The project ships prebuilt model files and a simple example script, so you can go from installation to producing an audio.wav file in just a few steps. It supports multiple languages and voices, with a curated voice list and configuration via a VOICES file hosted alongside the models. The package is distributed on PyPI, meaning you can integrate it directly into applications or scripts using standard Python tooling. It also recommends pairing with an external G2P package to improve pronunciation quality, especially for more complex languages or names, and is licensed under permissive MIT and Apache-style licenses.

Downloads: 137 This Week

Last Update: 2025-11-28
See Project
4

ebook2audiobook

Generate audiobooks from e-books, voice cloning & 1107+ languages

ebook2audiobook is a tool to convert legally obtained eBooks (non-DRM) into fully narrated audiobooks, complete with chapters and metadata. It automates the pipeline: it reads the eBook file, splits it into appropriate segments (chapters, paragraphs), uses text-to-speech (TTS) models to synthesize audio, optionally applies voice cloning, and outputs a final audiobook — ideal for people who prefer listening over reading, or for accessibility purposes. The tool supports a wide array of underlying TTS backends (XTTSv2, Bark, VITS, Fairseq, Tacotron2, YourTTS and more), which gives flexibility depending on hardware availability, voice preference, and language. ...

Downloads: 26 This Week

Last Update: 2 days ago
See Project
Go From AI Idea to AI App Fast
One platform to build, fine-tune, and deploy ML models. No MLOps team required.

Access Gemini 3 and 200+ models. Build chatbots, agents, or custom models with built-in monitoring and scaling.

Try Free
5

gTTS

Python library and CLI tool to interface with Google Translate

gTTS (Google Text-to-Speech) is a Python library and command-line tool that wraps the speech functionality of Google Translate. It lets you send text to the Google Translate TTS endpoint and receive spoken audio back as MP3 data, either written to a file, a file-like object, or standard output. The library is designed to handle long texts, using a speech-specific sentence tokenizer that keeps intonation and punctuation natural while splitting requests into acceptable chunks. It supports customizable text pre-processors, which can correct pronunciations, tweak formatting, or handle domain-specific vocabulary before sending it to the API. gTTS is primarily aimed at developers who want a quick way to add cloud-backed speech to scripts, apps, or pipelines without managing any model weights locally. ...

Downloads: 6 This Week

Last Update: 2025-11-28
See Project
6

Voice-Pro

Comprehensive Gradio WebUI for audio processing

Voice-Pro is the best gradio WebUI for transcription, translation and text-to-speech. It can be easily installed with one click. Create a virtual environment using Miniconda, running completely separate from the Windows system (fully portable). Supports real-time transcription and translation, as well as batch mode.

1 Review

Downloads: 31 This Week

Last Update: 2025-12-05
See Project
7

MLX-Audio

A text-to-speech, speech-to-text and speech-to-speech library

...It includes examples such as audiobook generation to demonstrate long-form synthesis and joined audio segments. On top of that, MLX-Audio offers a modern web interface powered by FastAPI, with real-time waveform and 3D visualizations, file upload, and audio management.

Downloads: 5 This Week

Last Update: 2026-03-30
See Project
8

VibeVoice ComfyUI

ComfyUI integration for Microsoft's VibeVoice text-to-speech model

VibeVoice ComfyUI is a comprehensive wrapper that integrates Microsoft’s VibeVoice text-to-speech models directly into ComfyUI workflows. It exposes VibeVoice as a set of custom nodes so you can build single-speaker and multi-speaker voice generation pipelines visually, combining TTS with other audio or generative components. The integration supports high-quality single-speaker synthesis as well as scripted multi-speaker conversations, with optional voice cloning from audio samples for each...

Downloads: 5 This Week

Last Update: 2025-11-28
See Project
9

Auto Synced & Translated Dubs

Automatically translates the text of a video based on a subtitle file

Auto-Synced-Translated-Dubs is a toolchain that automatically translates and re-dubs videos using AI voices while keeping the new speech aligned to the original timing via subtitle files. It assumes you have a human-made SRT (or similar) subtitle file; the script then uses translation services such as Google Cloud or DeepL to generate translated subtitle tracks in one or more target languages. Using the timestamps of each subtitle line, it computes the required duration of each spoken segment and synthesizes audio via neural TTS services, producing one audio clip per subtitle entry. ...

Downloads: 0 This Week

Last Update: 2025-11-28
See Project
Fully Managed MySQL, PostgreSQL, and SQL Server
Automatic backups, patching, replication, and failover. Focus on your app, not your database.

Cloud SQL handles your database ops end to end, so you can focus on your app.

Try Free
10

IMS Toucan

Controllable and fast Text-to-Speech for over 7000 languages

...IMS-Toucan ships with several ready-to-run scripts, including GUIs for interactive demos, prosody override tools, zero-shot language embedding injection, and text-to-audio file generation. Pretrained models are automatically downloaded when needed, and there is an online demo instance hosted on GPU that anyone can try.

Downloads: 1 This Week

Last Update: 2025-11-28
See Project
11

EasyVoice

Open source text-to-speech tool, supports extra-long text

easyVoice is an open-source text-to-speech platform aimed at turning long-form text and novels into high-quality audio, with a strong focus on usability and scalability. It provides a web interface where users can paste or upload large texts and generate speech and subtitles in a single workflow, even for works exceeding 100,000 characters. The system supports multi-role voice acting, letting users assign different neural voices to different characters or narrative roles and configure...

Downloads: 1 This Week

Last Update: 2026-01-26
See Project
12

SpeakLogPSU

SpeakLogPSU can speak chat messages with an individual voice if the NPC or player was configured or with a default one. You will never miss if someone talks to you. Voice cloning can be accomplished with Coqui in less than five minutes without GPU. The result is archived and can be used the next time in game. Some TTS projects already started to add tag support to speak text with emotions or sing it. If a game designer has that in mind with a good chat log she can voiced her...

Downloads: 0 This Week

Last Update: 2024-10-14
See Project
13

Voice Accounting For Blind & Mute People

Free & Easy AI Voice Accounting Software For Blind & Speechless People

Just download the above zip file, extract it and then open the index.html file on internet browsers like Firefox ( preferable ) or Google Chrome. Also, please view and download my full collection of softwares for people with disabilities, here : https://sourceforge.net/projects/softwares-for-disabled-people/ This full collection also includes the Voice Accounting Software as well.

Downloads: 0 This Week

Last Update: 2024-04-30
See Project
14

Softwares For Blind, Deaf, Handicap

Easy AI Softwares for Blind, Deaf, Handicapped, Disabled People

Just download the above zip file, extract it first and then open the index.html file on internet browsers like Firefox ( preferable ) or Google Chrome. Also, keep NumLock ON while using the Numeric Keypad of any Keyboard. Can also attach an external USB keyboard, with seperate Numeric Keypad, if required. I have added some general guidelines for students, using these softwares, on the Wiki Page of this website.

Downloads: 0 This Week

Last Update: 2026-01-18
See Project
15

TTS-Vue

Microsoft speech synthesis tool, built with Electron

...For long texts or big files, TTS-Vue automatically slices content into manageable segments, converts them separately, and then stitches them back into a single audio file, avoiding the usual length or timeout issues with TTS APIs.

Downloads: 41 This Week

Last Update: 2025-11-28
See Project
16

edge-TTS-record

Tool that can record speech synthesis

...Users can type or paste text, preview the speech, and then trigger the recorder; the system automatically captures the audio output from the browser and writes it to a WAV file. The tool includes a small GUI (built with Aardio) and aims to be plug-and-play — after downloading the .exe you can immediately start using it without deep configuration. It is cloud-based in the sense that it relies on Edge’s online TTS service, so internet connection is required; but once recorded, the audio is local.

Downloads: 1 This Week

Last Update: 2025-11-28
See Project
17

Ohod Speed Reading

Speed Reading

برنامج القراءة السريعة برنامج للتدريب على القراءة السريعة جدا

Downloads: 0 This Week

Last Update: 2019-08-04
See Project
18

Text to Speech - Revamped Open Source

Coder Revolt - Text to Speech is a completely Free to use software for commercial/non-commercial rights from business to personal projects. The software itself is designed to be very simple but yet loaded with incredible features that are targeted at solving your problems. Open Source files will be released this week (6/30/2016) and they will be in the full version of Visual Basic and converted version in CSharp C#.

Downloads: 0 This Week

Last Update: 2016-06-27
See Project
19

Text to Speech for Video

create wav files for video character speech by typing in dialogue

Choose from the "voices" available, and type in what you want the computer to say. A wave file called sounds.wav is stored to the output sub folder. Output is intended primarily for users who need speech for animated characters in videos.

Downloads: 0 This Week

Last Update: 2015-10-16
See Project
20

Java-TTS Converter

Text To Speech converter

This application can convert the given text into speech.The speech may converted into seperate audio file for future use. we can give .txt,.doc,.docx text file as a input can convert the text in to audible .wav file.

Downloads: 0 This Week

Last Update: 2013-11-25
See Project
21

Text to Voice

'Text to Voice' or 'Text to Speech' is 1 of the coolest Firefox add-ons. It gives ur brwsr the pwr of speech. Select txt, clck the bttn on the bttm rite & this add-on spks the selectd txt 4 u. Isn't it brllant? Moreovr odio file cn b dnloaded as

Downloads: 3 This Week

Last Update: 2014-04-25
See Project
22

Abhyaas, which means to study in Marathi

...It aims at displaying images and words corrosponding to alphabets but it integrates a "Text-To-Speech" (TTS) system to say out the words. The word list and corresponding images will be loaded from a XML file.

Downloads: 0 This Week

Last Update: 2014-02-21
See Project
23

Concrete Voice

Concrete Voice is a text to speech program. It can read the time, anounce weather, read text file, save text files to audio files, open any text file (supports all text encoding formats) and many more advance stuff!

Downloads: 0 This Week

Last Update: 2016-01-31
See Project
24

FacialDAS

This project aims to distribute a facial animation system with speech, developed to brazilian portuguese case. This system is composed by many modules: movement extraction, facial animation and speech, through a text-to-speech system.

Downloads: 0 This Week

Last Update: 2015-09-22
See Project
25

Kathak (bangla text to speech synthesis)

Kathak is a Bangla text to speech synthesizer which can produce the speech from unicode bangla text input. We are developing the system based on Festvox framework.The Festival Speech Synthesis System was used as a base for developing Kathak.

Downloads: 0 This Week

Last Update: 2013-03-13
See Project

Previous
You're on page 1
2
Next

Related Searches

voice cloning

audio visualization vlc

speech

daw

whisper-windows-x64.exe

demucs

tts

ai pro free

voice to text

subtitle

Related Categories

Artificial Intelligence

Multimedia

Internet

Business

Education

SourceForge

Create a Project
Open Source Software
Business Software
Top Downloaded Projects

Company

About
Team
SourceForge Headquarters
1320 Columbia Street Suite 310
San Diego, CA 92101
+1 (858) 422-6466

Resources

Support
Site Documentation
Site Status
SourceForge Reviews

© 2026 Slashdot Media. All Rights Reserved.

Terms Privacy Opt Out Advertise