Join/Login
Business Software
Open Source Software
For Vendors
Blog
About
More

For Vendors Help Create Join Login

Business Software

Open Source Software

SourceForge Podcast

Resources

Articles
Case Studies
Blog

Menu

Help
Create
Join
Login

Home
Open Source Software
Artificial Intelligence
Text to Speech Software
Search Results

Search Results for "environment-modules"

x

Sort By:

Relevance

Clear All Filters

OS

Windows 25
Linux 20
Mac 19
More...
BSD 11
ChromeOS 10
Desktop Operating Systems 1
Server Operating Systems 1

Category

Artificial Intelligence 25
Desktop Environment 3
Multimedia 3
Business 2
Scientific/Engineering 1

License

OSI-Approved Open Source 23
Public Domain 1

Translations

Programming Language

Python 14
C 3
C++ 2
TypeScript 2
More...
Visual Basic 2
BASIC 1
C# 1
Java 1

Status

Production/Stable 4
Beta 3

Showing 25 open source projects for "environment-modules"

View related business solutions

Text to Speech Windows Clear Filters & Widen Search

Build Agents and Models on One Platform
Everything you need to build production-ready agents and models. Access 200+ Google and third-party AI models and tools.

Gemini Enterprise Agent Platform is Google Cloud's comprehensive platform for developers to build, scale, govern, and optimize agents and models. Choose from Google's most advanced models and third-party models like Anthropic's Claude Model Family.

Try It Free
$300 Free Credits to Build on Google Cloud
New to Google Cloud? Get $300 in credits to explore Compute Engine, BigQuery, Cloud Run, Gemini Enterprise Agent Platform, and more.

Start your next project with $300 in free Google Cloud credit. Spin up VMs, run containers, query petabytes in BigQuery, or build agents with Gemini Enterprise Agent Platform. Once your credits are used, keep building with 20+ always-free tier products including Compute Engine, Cloud Storage, GKE, and Cloud Run functions. No commitment required—just sign up and start building.

Claim $300 Free
1

Voice-Pro

Comprehensive Gradio WebUI for audio processing

Voice-Pro is the best gradio WebUI for transcription, translation and text-to-speech. It can be easily installed with one click. Create a virtual environment using Miniconda, running completely separate from the Windows system (fully portable). Supports real-time transcription and translation, as well as batch mode.

1 Review

Downloads: 15 This Week

Last Update: 2025-12-05
See Project
2

AI Runner

Offline inference engine for art, real-time voice conversations

...At the core of its LLM stack is a mode-based architecture with specialized “modes” such as Author, Code, Research, QA and General, and a workflow manager that automatically routes user requests to the right agent based on the task. The project has a strong focus on developer ergonomics, with thorough development guidelines, environment configuration using .env variables, and a clear structure for tests, tools and agents.

Downloads: 3 This Week

Last Update: 2025-12-11
See Project
3

NVIDIA NeMo

Toolkit for conversational AI

...NeMo has separate collections for Automatic Speech Recognition (ASR), Natural Language Processing (NLP), and Text-to-Speech (TTS) models. Each collection consists of prebuilt modules that include everything needed to train on your data. Every module can easily be customized, extended, and composed to create new conversational AI model architectures. Conversational AI architectures are typically large and require a lot of data and compute for training. NeMo uses PyTorch Lightning for easy and performant multi-GPU/multi-node mixed-precision training. ...

Downloads: 4 This Week

Last Update: 2026-04-22
See Project
4

openctp

Provides CTP stock options and Zhongtai Securities XTP

...Its core idea is to wrap heterogeneous stock and derivatives trading gateways such as Zhongtai XTP, Huaxin Qidian TORA, and others with CTPAPI compatible interfaces, so existing CTP programs can connect simply by swapping dynamic libraries rather than rewriting code. The project offers a comprehensive simulation environment similar to SimNow that supports futures, options, A share stocks, funds, bonds, and stock options, and even extends to Hong Kong and US markets. In addition to the core library, openctp supplies Python bindings for CTPAPI and stock options APIs, making it easier to build strategies, tools, and analytics in Python. It also develops full featured and lightweight trading clients like TickTrader, TickTraderMini, and ViTrader, which support multiple desks and markets.

Downloads: 0 This Week

Last Update: 2026-01-08
See Project
Error to trace to log to deploy. One click. No SSH.
Catch the cause before the pager goes off.

AppSignal links every error to the trace, the trace to the log, the log to the deploy that shipped it.

Free 30 days.
5

Speech-AI-Forge

Speech-AI-Forge is a project developed around TTS generation model

...At its core, it acts as a hub that wires together multiple speech-related capabilities, including TTS, speech-to-text and LLM-based control flows, behind a consistent interface. The system is designed to be deployed in several ways: you can try it online via hosted demos, spin it up in a one-click Colab environment, run it in Docker containers, or set it up locally with its environment preparation scripts. It is model-agnostic and advertises support for a variety of TTS and speech models such as ChatTTS, CosyVoice, Fish-Speech, FireredTTS and others, as well as Whisper-based ASR, giving you a flexible playground for experimenting with different speech stacks. ...

Downloads: 0 This Week

Last Update: 2026-02-02
See Project
6

OpenAI-Compatible Edge-TTS API

Free, high-quality text-to-speech API endpoint to replace OpenAI

...The server supports Server-Sent Events (SSE) for streaming audio, enabling low-latency playback in chat UIs and other interactive tools. A Docker image is provided for one-command deployment, and environment variables can be used to configure default voice, language, response format, authentication, and logging options.

Downloads: 0 This Week

Last Update: 2025-11-28
See Project
7

EasyVoice

Open source text-to-speech tool, supports extra-long text

easyVoice is an open-source text-to-speech platform aimed at turning long-form text and novels into high-quality audio, with a strong focus on usability and scalability. It provides a web interface where users can paste or upload large texts and generate speech and subtitles in a single workflow, even for works exceeding 100,000 characters. The system supports multi-role voice acting, letting users assign different neural voices to different characters or narrative roles and configure...

Downloads: 3 This Week

Last Update: 2026-01-26
See Project
8

NVIDIA NeMo Framework

Scalable generative AI framework built for researchers and developers

NVIDIA NeMo is a scalable, cloud-native generative AI framework aimed at researchers and PyTorch developers working on large language models, multimodal models, and speech AI (ASR and TTS), with growing support for computer vision. It provides collections of domain-specific modules and reference implementations that make it easier to pre-train, fine-tune, and deploy very large models on multi-GPU and multi-node infrastructure. NeMo 2.0 introduces a Python-based configuration system, replacing YAML with more flexible, programmable configs that can be versioned and composed for different experiments. The framework builds on PyTorch Lightning–style modular abstractions, so training scripts are composed from reusable components for data loading, models, optimizers, and schedulers, which simplifies experimentation and adaptation. ...

Downloads: 0 This Week

Last Update: 2026-04-22
See Project
9

MiniMax-MCP

Official MiniMax Model Context Protocol (MCP) server

...The server is written in Python and distributed under the MIT license, with a pyproject.toml and uv-based workflow that makes installation and execution reproducible. Configuration is handled through JSON files that tell MCP clients how to launch the server (typically via uvx minimax-mcp) and which environment variables to use for the API key, host, and output directory. The README carefully explains region-specific API hosts for global and mainland users to avoid invalid-key errors, and documents both local stdio transport and SSE-based network transport modes.

Downloads: 1 This Week

Last Update: 2026-05-21
See Project
Our Free Plans just got better! | Auth0
With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.

Try free now
10

YandexStation

Management of Yandex Station and other smart home devices

YandexStation is a Home Assistant custom component that integrates Yandex-branded smart speakers and other devices with Alice into a unified smart home automation environment. It supports both local and cloud control, depending on the device type, with Yandex speakers often supporting both modes and third-party speakers typically limited to cloud control. The integration exposes playback and volume controls, as well as text-to-speech capabilities that send spoken messages in Alice’s voice directly to the speakers. ...

Downloads: 0 This Week

Last Update: 3 days ago
See Project
11

Polyglot

Cross-platform AI language practice app

Polyglot is a cross platform AI language practice application that runs as a desktop app and also offers a web version. It is built around conversational large language models and Azure based text to speech services, turning them into an interactive environment for speaking practice in multiple languages. Users can define custom AI personas, choose languages, and configure their own OpenAI and Azure keys so they retain control over which backends they use. The app supports speech recognition with quick keyboard shortcuts, allowing learners to hold down a key to speak and release it to submit for recognition and response. ...

Downloads: 0 This Week

Last Update: 2025-11-28
See Project
12

ChatTTS_colab

One-click deployment (including offline integration package)

ChatTTS_colab is a wrapper project around the ChatTTS model that focuses on “one-click” deployment, especially in Google Colab. It provides an integrated offline bundle and scripts for Windows and macOS so users can run ChatTTS locally without wrestling with complex environment setup. The repository includes Colab notebooks that launch a Gradio-based web UI and expose streaming TTS, making it possible to listen to generated audio as it is produced. A distinctive feature is the “voice gacha” system, which batch-generates many distinct voice timbres and allows users to save the ones they like into a curated voice library. ...

Downloads: 0 This Week

Last Update: 2025-11-28
See Project
13

PNotes

PNotes is light-weight, flexible, skinnable manager of virtual notes on your desktop. It supports multiple languages, individual note's settings, transparency and scheduling. Absolutely portable as well - no traces in registry. PNotes.NET edition requires .NET framework 4 Client Profile

77 Reviews

Downloads: 255 This Week

Last Update: 2026-05-03
See Project
14

QChartist

Free and Open Source Technical Analysis Charting Software

QChartist is a free and open source technical analysis charting software. Its purpose is to provide a complete set of tools to perform technical analysis on charts and data. It helps to make forecasts mainly for markets but can also be used for weather or any quantifiable data. The program is flexible and its functionalities can be easily extended. You can draw geometrical shapes on your charts or plot programmable indicators from your data. It is also possible to filter or merge data. I got...

1 Review

Downloads: 19 This Week

Last Update: 3 days ago
See Project
15

StyleTTS 2

Towards Human-Level Text-to-Speech through Style Diffusion

...StyleTTS2 supports both single-speaker and multi-speaker configurations, with the ability to sample or transfer styles from reference audio, making it powerful for expressive TTS and character voices. The repository includes training scripts, configuration files, and pre-trained auxiliary modules such as a text aligner, pitch extractor, and PL-BERT-based linguistic encoder.

Downloads: 0 This Week

Last Update: 2025-11-28
See Project
16

VALL-E X

Open source implementation of Microsoft's VALL-E X zero-shot TTS model

...VALL-E-X supports zero-shot cross-lingual synthesis, meaning a monolingual speaker’s voice can be used to speak other languages without additional training. It also preserves aspects of the acoustic environment, such as background noise or reverb, making the generated audio feel more like it came from the same setting as the prompt. The repository includes Python APIs, sample scripts, ready-to-use voice presets, and demos hosted on Hugging Face Spaces and Google Colab so users can try it.

Downloads: 1 This Week

Last Update: 2025-11-28
See Project
17

ekho

Chinese text-to-speech engine

...The code structure implies that Ekho may support hooking into audio input/output streams, perhaps for tasks like audio capture, playback, transformation, or simple voice-based operations. It might serve as a lightweight base or utility for building custom audio-related workflows, such as streaming, playback orchestration, or combining audio modules. Given the limited explicit features, Ekho would be best suited for developers or hobbyists who want a flexible foundation to add their own logic for TTS.

Downloads: 0 This Week

Last Update: 2025-11-28
See Project
18

Audio Webui

A webui for different audio related Neural Networks

...Installation is streamlined through automatic installers and platform-specific scripts that create a virtual environment, install dependencies, and launch the web app with minimal manual setup. For more advanced users, it exposes a rich set of command-line flags to control behavior such as skipping installation, disabling venv, changing model cache directories, sharing Gradio links, setting passwords, and specifying themes or ports.

Downloads: 0 This Week

Last Update: 2025-11-28
See Project
19

Mocking Bird

Clone a voice in 5 seconds to generate arbitrary speech in real-time

...It builds on deep-learning based TTS / voice-cloning technology (in the lineage of projects such as Real-Time-Voice-Cloning), but extends it with support for Mandarin Chinese and multiple Chinese speech datasets — broadening its applicability beyond English. The codebase is implemented in Python (with PyTorch) and includes modules for encoder, synthesizer, vocoder, preprocessing, and inference, as well as demo scripts and a web-server interface for easier experimentation or deployment. MockingBird supports both using pretrained models and training your own synthesizer (with custom datasets), giving flexibility for voice-cloning or custom-voice synthesis depending on your needs.

1 Review

Downloads: 0 This Week

Last Update: 2023-03-23
See Project
20

Speect

Speect is a multilingual TTS system. It offers a full text-to-speech system with various API's, as well as an environment for research and development of TTS systems and voices. It is written in ANSI C and uses a plug-in mechanism for extensions. Speect also includes an extensive set of Python bindings for quick implementation of new ideas, these bindings are derived from SWIG interface files and can easily be extended for other languages supported by SWIG.

Downloads: 0 This Week

Last Update: 2013-05-30
See Project
21

Epos TTS System

Epos is a language independent rule-driven Text-to-Speech (TTS) system

Epos is a language independent rule-driven Text-to-Speech (TTS) system primarily designed to serve as a research tool. Epos is (or tries to be) independent of the language processed, linguistic description method, and computing environment.

1 Review

Downloads: 2 This Week

Last Update: 2015-03-31
See Project
22

Romanian Modular TTS

Modular Text-to-Speech system with a Matlab backbone. Your modules can be attached to this backbone via executable files (independent of the programming language used) respecting the XML interface requirements.

Downloads: 0 This Week

Last Update: 2013-04-11
See Project
23

FacialDAS

This project aims to distribute a facial animation system with speech, developed to brazilian portuguese case. This system is composed by many modules: movement extraction, facial animation and speech, through a text-to-speech system.

Downloads: 1 This Week

Last Update: 2015-09-22
See Project
24

ComTalk

ComTalk uses MSAgent Technology, created by Microsoft, to make an easy to use interface with Speech Recognition and Text-To-Speech technologies. The MSAgent system is the same used to produce the assistants in Microsoft Word and other Microsoft Programs.

Downloads: 0 This Week

Last Update: 2014-07-11
See Project
25

Uplink Desktop

Uplink Desktop is a unique Windows shell replacement based on the game Uplink from Introversion. It has many features including a cd player, webbrowser, favorites list, filebrowser, text-to-speech, calculator, and system info subs.

Downloads: 0 This Week

Last Update: 2013-03-20
See Project

Previous
You're on page 1
Next

Related Searches

ai

ekho

whisper-windows-x64.exe

voice cloning

demucs

offline ai

ai pro free

ai offline

ai chatbot offline

nvidia

Related Categories

Artificial Intelligence

Desktop Environment

Multimedia

Business

Scientific/Engineering

SourceForge

Create a Project
Open Source Software
Business Software
Top Downloaded Projects

Company

About
Team
SourceForge Headquarters
1320 Columbia Street Suite 310
San Diego, CA 92101
+1 (858) 422-6466

Resources

Support
Site Documentation
Site Status
SourceForge Reviews

© 2026 Slashdot Media. All Rights Reserved.

Terms Privacy Opt Out Advertise