Join/Login
Business Software
Open Source Software
For Vendors
Blog
About
More

For Vendors Help Create Join Login

Business Software

Open Source Software

SourceForge Podcast

Resources

Articles
Case Studies
Blog

Menu

Help
Create
Join
Login

Home
Open Source Software
Artificial Intelligence
Text to Speech Software
Search Results

Search Results for "open .dae files"

x

Sort By:

Relevance

Clear All Filters

OS

Linux 28
Windows 26
Mac 25
More...
BSD 14
ChromeOS 11
Desktop Operating Systems 1

Category

Artificial Intelligence 28
- Text to Speech 28
Multimedia 6
Scientific/Engineering 3
Communications 1
Internet 1
Mobile 1
Text Editors 1

License

OSI-Approved Open Source 27

Translations

English 4
Arabic 1

Programming Language

Python 13
C 2
C++ 2
C# 2
More...
Go 2
Java 2
TypeScript 2
Unix Shell 2
AWK 1
Visual Basic .NET 1

Status

Beta 4
Alpha 2
Pre-Alpha 1
Production/Stable 1

Showing 28 open source projects for "open .dae files"

View related business solutions

Text to Speech Linux Clear Filters & Widen Search

MongoDB Atlas runs apps anywhere
Deploy in 115+ regions with the modern database for every enterprise.

MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.

Start Free
Try Google Cloud Risk-Free With $300 in Credit
No hidden charges. No surprise bills. Cancel anytime.

Use your credit across every product. Compute, storage, AI, analytics. When it runs out, 20+ products stay free. You only pay when you choose to.

Start Free
1

kokoro-onnx

TTS with kokoro and onnx runtime

kokoro-onnx is a text-to-speech toolkit that wraps the Kokoro neural TTS model in an easy-to-use ONNX Runtime interface, so you can generate speech from Python with minimal setup. It focuses on running efficiently on commodity hardware, including macOS with Apple Silicon, while still delivering near real-time performance for many use cases. The project ships prebuilt model files and a simple example script, so you can go from installation to producing an audio.wav file in just a few steps....

Downloads: 247 This Week

Last Update: 2025-11-28
See Project
2

edge-tts

Use Microsoft Edge's online text-to-speech service from Python

edge-tts is a Python module and command-line tool that gives you direct access to Microsoft Edge’s online text-to-speech service without needing the Edge browser, Windows, or any API key. It wraps the same cloud voices used by Edge, exposing them through a simple CLI (edge-tts, edge-playback) and a Python API, so you can script high-quality speech generation in your own applications. The tool lets you list available voices, specify locale and voice name, and generate audio files in common...

Downloads: 16 This Week

Last Update: 2026-03-22
See Project
3

pyttsx3

Offline Text To Speech synthesis for python

pyttsx3 is an offline text-to-speech library for Python that wraps native speech engines instead of calling cloud APIs. It is designed to work entirely without an internet connection, making it suitable for local automation, kiosks, accessibility tools, and embedded applications. On Windows it uses SAPI5, on Linux it typically uses eSpeak or eSpeak-NG, and on macOS it can use NSSpeechSynthesizer or AVSpeechSynthesizer, giving it broad cross-platform compatibility. The library exposes a...

Downloads: 14 This Week

Last Update: 2025-11-28
See Project
4

WhisperLive

A nearly-live implementation of OpenAI's Whisper

WhisperLive is a “nearly live” implementation of OpenAI’s Whisper model focused on real-time transcription. It runs as a server–client system in which the server hosts a Whisper backend and clients stream audio to be transcribed with very low delay. The project supports multiple inference backends, including Faster-Whisper, NVIDIA TensorRT, and OpenVINO, allowing you to target GPUs and different CPU architectures efficiently. It can handle microphone input, pre-recorded audio files, and...

Downloads: 14 This Week

Last Update: 2026-03-17
See Project
Our Free Plans just got better! | Auth0
With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.

Try free now
5

gTTS

Python library and CLI tool to interface with Google Translate

gTTS (Google Text-to-Speech) is a Python library and command-line tool that wraps the speech functionality of Google Translate. It lets you send text to the Google Translate TTS endpoint and receive spoken audio back as MP3 data, either written to a file, a file-like object, or standard output. The library is designed to handle long texts, using a speech-specific sentence tokenizer that keeps intonation and punctuation natural while splitting requests into acceptable chunks. It supports...

Downloads: 9 This Week

Last Update: 2025-11-28
See Project
6

Audiblez

Generate audiobooks from e-books

Audiblez is a tool for generating high-quality .m4b audiobooks directly from .epub e-books using the Kokoro-82M neural text-to-speech model. It focuses on making audiobook creation easy and fast: from a single command, the tool splits an e-book into chapters, synthesizes audio for each section, and then merges the results into a structured audiobook with chapter-based WAV files and a final .m4b container. The Kokoro-82M model it uses is compact (82M parameters) yet natural sounding, trained...

Downloads: 21 This Week

Last Update: 2025-11-30
See Project
7

Auto Synced & Translated Dubs

Automatically translates the text of a video based on a subtitle file

Auto-Synced-Translated-Dubs is a toolchain that automatically translates and re-dubs videos using AI voices while keeping the new speech aligned to the original timing via subtitle files. It assumes you have a human-made SRT (or similar) subtitle file; the script then uses translation services such as Google Cloud or DeepL to generate translated subtitle tracks in one or more target languages. Using the timestamps of each subtitle line, it computes the required duration of each spoken...

Downloads: 0 This Week

Last Update: 2025-11-28
See Project
8

KrillinAI

Video translation and dubbing tool powered by LLMs

KrillinAI is an end-to-end content localization, translation, and dubbing tool aimed at helping creators transform videos into multiple languages with minimal manual effort. It integrates several stages of the pipeline: video acquisition (either from local files or remote via download tools), speech recognition (ASR), subtitle segmentation and alignment, machine translation (with context-aware translation to preserve semantics), and voice cloning + text-to-speech (TTS) to produce dubbed...

Downloads: 7 This Week

Last Update: 2025-11-28
See Project
9

EasyVoice

Open source text-to-speech tool, supports extra-long text

...It offers streaming playback so audio starts almost immediately, even for very long inputs, and automatically generates subtitle files suitable for video production or translation workflows. Under the hood, easyVoice uses a modern stack with Vue 3 and Element Plus on the front end, Node.js and Express on the back end, and TTS engines such as Microsoft Azure TTS and OpenAI-compatible APIs, orchestrated through ffmpeg.

Downloads: 6 This Week

Last Update: 2026-01-26
See Project
Forever Free Full-Stack Observability | Grafana Cloud
Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.

Create free account
10

Matcha-TTS

A fast TTS architecture with conditional flow matching

Matcha-TTS is a non-autoregressive neural text-to-speech architecture that uses conditional flow matching to generate speech quickly while maintaining natural quality. It models speech as an ODE-based generative process, and conditional flow matching lets it reach high-quality audio in only a few synthesis steps, which greatly reduces latency compared to score-matching diffusion approaches. The model is fully probabilistic, so it can generate diverse realizations of the same text while still...

Downloads: 4 This Week

Last Update: 2025-11-28
See Project
11

MiniMax-MCP

Official MiniMax Model Context Protocol (MCP) server

MiniMax-MCP is the official Model Context Protocol (MCP) server for accessing MiniMax’s multimodal generative APIs from MCP-compatible clients. It acts as a bridge between tools like Claude Desktop, Cursor, Windsurf, OpenAI Agents, and the MiniMax platform, exposing capabilities such as text-to-speech, voice cloning, image generation, text-to-image, video generation, image-to-video, text-to-video, and music generation. The server is written in Python and distributed under the MIT license,...

Downloads: 3 This Week

Last Update: 2026-05-21
See Project
12

sag

Like the macOS say command, but with a modern voice

sag is a command-line text-to-speech utility inspired by the macOS say command but powered by modern ElevenLabs voice synthesis technology. The project allows users to stream synthesized speech directly to speakers, save audio files, or list and manage available voices through a lightweight terminal interface. Designed for speed and convenience, sag supports voice selection, playback rate adjustments, output format inference, and configurable API endpoints for flexible deployment. It...

Downloads: 0 This Week

Last Update: 2026-05-06
See Project
13

Lingvo

Framework for building neural networks

...Lingvo includes reference models and configurations for domains like machine translation, automatic speech recognition, language modeling, image understanding, and 3D object detection. Centralized hyperparameter configuration files allow researchers to share exact experiment setups so others can retrain and compare results reliably.

Downloads: 0 This Week

Last Update: 2025-11-28
See Project
14

RS Media Robot Development Kit

A series of open source files and programs available to use for developing programs to work with the WowWee Robotics RSMedia Robot. These include a USB serial console, a cross-compiler, a firmware dump program, text-to-speech and source code.

Downloads: 0 This Week

Last Update: 2026-01-14
See Project
15

StyleTTS 2

Towards Human-Level Text-to-Speech through Style Diffusion

StyleTTS2 is a state-of-the-art text-to-speech system that aims for human-level naturalness by combining style diffusion, adversarial training, and large speech language models. It extends the original StyleTTS idea by introducing a style diffusion model that can sample rich, realistic speaking styles conditioned on reference speech, allowing highly expressive and diverse prosody. The architecture uses a two-stage training process and leverages an auxiliary speech language model to guide...

Downloads: 1 This Week

Last Update: 2025-11-28
See Project
16

Bert-VITS2

VITS2 backbone with multilingual-bert

Bert-VITS2 is a neural text-to-speech project that combines a VITS2 backbone with a multilingual BERT front-end to produce high-quality speech in multiple languages. The core idea is to use BERT-style contextual embeddings for text encoding while relying on a refined VITS2 architecture for acoustic generation and vocoding. The repository includes everything needed to train, fine-tune, and run the model, from configuration files to preprocessing scripts, spectrogram utilities, and training...

Downloads: 0 This Week

Last Update: 2025-11-28
See Project
17

TTS-Vue

Microsoft speech synthesis tool, built with Electron

TTS-Vue is a desktop text-to-speech application built with Electron, Vue, ElementPlus, and Vite, focused on using Microsoft’s official Speech API for high-quality neural synthesis. It wraps the Microsoft TTS WebSocket interface in a clean UI so users can paste or load text, choose voices, tweak parameters, and export audio without touching raw API calls. The app supports SSML (Speech Synthesis Markup Language), letting power users specify fine-grained control over pronunciation, pauses,...

Downloads: 73 This Week

Last Update: 2025-11-28
See Project
18

ekho

Chinese text-to-speech engine

ekho is a project with relatively sparse documentation, but from the repository it appears to be a small-scale tool for audio processing and playback, possibly with features for speech synthesis or manipulation. The repo includes scripts and configuration files suggesting interactions with media/audio handling libraries. Because of limited README detail, it seems targeted at users comfortable reading and modifying code, rather than end users expecting polished UIs. The code structure implies...

Downloads: 7 This Week

Last Update: 2025-11-28
See Project
19

edge-TTS-record

Tool that can record speech synthesis

edge-TTS-record is a Windows-based tool that records speech synthesized by the Microsoft Edge browser’s online TTS voices and saves the result as .wav audio files. The idea is simple but effective: since Edge’s online TTS voices (such as “Xiaoxiao” or “Yunyang” for Chinese) are often high-quality, this tool provides a way to “capture” them offline for later use. Users can type or paste text, preview the speech, and then trigger the recorder; the system automatically captures the audio output...

Downloads: 2 This Week

Last Update: 2025-11-28
See Project
20

Al-Mintiq: Arabic eSpeak

Arabic voice files for eSpeak system

Arabic files and voices for eSpeak Text to speech system, المنطيق : ملفات اللغة العربية لبرنامج توليد الكلام من النص إسبيك

Downloads: 3 This Week

Last Update: 2017-12-05
See Project
21

Text to Speech - Revamped Open Source

Coder Revolt - Text to Speech is a completely Free to use software for commercial/non-commercial rights from business to personal projects. The software itself is designed to be very simple but yet loaded with incredible features that are targeted at solving your problems. Open Source files will be released this week (6/30/2016) and they will be in the full version of Visual Basic and converted version in CSharp C#.

Downloads: 0 This Week

Last Update: 2016-06-27
See Project
22

Text to Speech for Video

create wav files for video character speech by typing in dialogue

Choose from the "voices" available, and type in what you want the computer to say. A wave file called sounds.wav is stored to the output sub folder. Output is intended primarily for users who need speech for animated characters in videos.

Downloads: 0 This Week

Last Update: 2015-10-16
See Project
23

Cotovía

Text-to-Speech System for Galician and Spanish

Cotovía is a unit-selection text-to-speech system for Galician and Spanish. Cotovía is distributed under the GPL3.0+ license, but each of the avaliable speaker voices has its own license. The speakers available at sourceforge are free for commercial and non-commercial uses. Another speaker, free for non-commercial uses, is avaliable through external links (see the Blog section). Cotovia has been developed by the University de Vigo and the center 'Ramón Piñeiro' for Research in Humanities,...

Downloads: 23 This Week

Last Update: 2018-01-02
See Project
24

Speect

...It is written in ANSI C and uses a plug-in mechanism for extensions. Speect also includes an extensive set of Python bindings for quick implementation of new ideas, these bindings are derived from SWIG interface files and can easily be extended for other languages supported by SWIG. Speect is free and open source software. As a collection it is distributed under a MIT license.

Downloads: 0 This Week

Last Update: 2013-05-30
See Project
25

email2wav - Text To Speech for emails

A script for producing a collection of audio files containing your emails.

Downloads: 0 This Week

Last Update: 2013-04-18
See Project

Previous
You're on page 1
2
Next

Related Searches

tts

ekho

dubbing

srt to speech

pyttsx3

srt file

tts voices

jarvis voice hindi

subtitle

mobile robot

Related Categories

Artificial Intelligence

Multimedia

Scientific/Engineering

Communications

Internet

SourceForge

Create a Project
Open Source Software
Business Software
Top Downloaded Projects

Company

About
Team
SourceForge Headquarters
1320 Columbia Street Suite 310
San Diego, CA 92101
+1 (858) 422-6466

Resources

Support
Site Documentation
Site Status
SourceForge Reviews

© 2026 Slashdot Media. All Rights Reserved.

Terms Privacy Opt Out Advertise