cloud mini project free download

Fish Speech

SOTA Open Source TTS

Fish Speech is a state-of-the-art open-source text-to-speech project that has evolved into the OpenAudio series of advanced TTS models. The repository hosts the code and tooling for training, fine-tuning, and serving high-quality TTS, while the current flagship models (OpenAudio-S1 and S1-mini) are distributed via Fish Audio’s playground and Hugging Face. The models are evaluated with Seed TTS metrics and achieve exceptionally low word and character error rates, indicating strong intelligibility and alignment between text and audio. ...

Downloads: 27 This Week

Last Update: 2026-03-10

See Project

Auto Synced & Translated Dubs

Automatically translates the text of a video based on a subtitle file

Auto-Synced-Translated-Dubs is a toolchain that automatically translates and re-dubs videos using AI voices while keeping the new speech aligned to the original timing via subtitle files. It assumes you have a human-made SRT (or similar) subtitle file; the script then uses translation services such as Google Cloud or DeepL to generate translated subtitle tracks in one or more target languages. Using the timestamps of each subtitle line, it computes the required duration of each spoken...

Downloads: 3 This Week

Last Update: 2025-11-28

See Project

pyttsx3

Offline Text To Speech synthesis for python

pyttsx3 is an offline text-to-speech library for Python that wraps native speech engines instead of calling cloud APIs. It is designed to work entirely without an internet connection, making it suitable for local automation, kiosks, accessibility tools, and embedded applications. On Windows it uses SAPI5, on Linux it typically uses eSpeak or eSpeak-NG, and on macOS it can use NSSpeechSynthesizer or AVSpeechSynthesizer, giving it broad cross-platform compatibility. The library exposes a...

Downloads: 23 This Week

Last Update: 2025-11-28

See Project

EPUB to Audiobook Converter

EPUB to audiobook converter, optimized for Audiobookshelf

...It reads each chapter from an EPUB file, generates audio using a chosen text-to-speech backend, and outputs separate MP3 files with chapter titles preserved as metadata to make navigation easier. The project supports multiple TTS providers, including Microsoft Azure TTS, EdgeTTS, OpenAI TTS, local Piper, and Kokoro via an OpenAI-compatible endpoint, allowing users to choose between cloud and self-hosted voices. A recent addition is a Gradio-based WebUI, which wraps all configuration options in a graphical interface for users who prefer not to work with the command line. ...

Downloads: 14 This Week

Last Update: 2026-02-02

See Project

comfyui-mixlab-nodes

Workflow and speech recognition app

...On the AI side, it integrates multiple LLM providers (cloud and local), supports OpenAI-compatible endpoints, Siliconflow models, and includes prompt-focused utilities for random prompt generation, Chinese prompts, clip interrogation.

Downloads: 6 This Week

Last Update: 2025-11-28

See Project

FastKoko

Dockerized FastAPI wrapper for Kokoro-82M text-to-speech model

FastKoko is a self-hosted text-to-speech server built around the Kokoro-82M model and exposed through a FastAPI backend. It is designed to be easy to deploy via Docker, with separate CPU and GPU images so that users can choose between pure CPU inference and NVIDIA GPU acceleration. The project exposes an OpenAI-compatible speech endpoint, which means existing code that talks to the OpenAI audio API can often be pointed at a Kokoro-FastAPI instance with minimal changes. It supports multiple...

Downloads: 2 This Week

Last Update: 2026-06-06

See Project

SoniTranslate

Synchronized Translation for Videos

...Under the hood, it uses advanced speech and diarization models to separate speakers, align audio with timecodes and respect subtitle timing, which lets the generated dub track stay in sync with the original video structure. The project supports a wide range of languages for translation, spanning major world languages (English, Spanish, French, German, Chinese, Arabic, etc.) and many regional or less widely spoken languages, making it suitable for broad internationalization. It offers multiple usage modes, including a Colab notebook for cloud-based experimentation, a Hugging Face Space demo for quick trials, and instructions.

Downloads: 24 This Week

Last Update: 2025-11-28

See Project

TTS Server

Android system TTS application with Microsoft demo interface

tts-server-android is an Android system TTS application that acts both as a powerful local text-to-speech engine and as a flexible TTS “server” for other apps via HTTP. It includes a built-in Microsoft TTS demo interface and lets users configure custom HTTP requests, making it possible to route TTS through various cloud providers or local servers. The app can import other local TTS engines, giving Android devices a unified interface to multiple voices and providers, and it features simple narration/dialogue detection based on Chinese quotation marks so it can read text with different styles for narration and dialogue. It is built with Kotlin and Jetpack Compose, and the project is structured into multiple libraries (lib-tts, lib-server, lib-compose, lib-database, etc.) to separate UI, server logic, and TTS handling. ...

Downloads: 35 This Week

Last Update: 2025-11-28

See Project

Search Results for "cloud mini project"

Showing 8 open source projects for "cloud mini project"

Fish Speech

Auto Synced & Translated Dubs

pyttsx3

EPUB to Audiobook Converter

comfyui-mixlab-nodes

FastKoko

SoniTranslate

TTS Server

Search Results for "cloud mini project"

Showing 8 open source projects for "cloud mini project"

Fish Speech

Auto Synced & Translated Dubs

pyttsx3

EPUB to Audiobook Converter

comfyui-mixlab-nodes

FastKoko

SoniTranslate

TTS Server

Related Searches

Related Categories