ffmpeg-release-essentials free download

ChatTTS webUI & API

A simple native web interface that uses ChatTTS to synthesize text

...For convenience, there is a prepackaged Windows build: you download a release archive, extract it, and double-click app.exe to start the web UI, which opens on localhost:9966.

Downloads: 13 This Week

Last Update: 2025-11-28

See Project

SoniTranslate is a video translation and dubbing system that produces synchronized target-language audio tracks for existing video content. It provides a web UI built with Gradio, allowing users to upload a video, choose source and target languages, and then run a pipeline that handles transcription, translation and re-synthesis of speech. Under the hood, it uses advanced speech and diarization models to separate speakers, align audio with timecodes and respect subtitle timing, which lets...

Downloads: 37 This Week

Last Update: 2025-11-28

See Project

ChatTTS

A generative speech model for daily dialogue

ChatTTS is an open-source conversational text-to-speech model optimized for dialogue, developed by 2Noise. Trained on 100,000+ hours of English and Chinese conversation data, it excels at generating expressive prosody—pauses, interjections, laughter—for more natural-sounding speech synthesis in assistant and chatbot applications.

Downloads: 1 This Week

Last Update: 2025-06-26

See Project

CSM (Conversational Speech Model)

A Conversational Speech Generation Model

The CSM (Conversational Speech Model) is a speech generation model developed by Sesame AI that creates RVQ audio codes from text and audio inputs. It uses a Llama backbone and a smaller audio decoder to produce audio codes for realistic speech synthesis. The model has been fine-tuned for interactive voice demos and is hosted on platforms like Hugging Face for testing. CSM offers a flexible setup and is compatible with CUDA-enabled GPUs for efficient execution.

Downloads: 6 This Week

Last Update: 2025-03-19

See Project

JAVT - Just Another Voice Transformer

Just Another Speech Recognition and Text to Speech software.

JAVT or Just Another Voice Transformer (formerly, it is called Just Another Video Transcriber) is a Speech Recognition software that also support text to Speech and simple media conversion. JAVT allows you to convert from video files to audio wav file using ffmpeg, and then transcribe the audio file to text using either Microsoft SAPI or CMU Sphinx. You can also open a text file and allow JAVT to read it out for you through text to speech conversion.

Downloads: 7 This Week

Last Update: 2020-08-19

See Project

Search Results for "ffmpeg-release-essentials"

Showing 5 open source projects for "ffmpeg-release-essentials"

ChatTTS webUI & API

SoniTranslate

ChatTTS

CSM (Conversational Speech Model)

JAVT - Just Another Voice Transformer

Search Results for "ffmpeg-release-essentials"

Showing 5 open source projects for "ffmpeg-release-essentials"

ChatTTS webUI & API

SoniTranslate

ChatTTS

CSM (Conversational Speech Model)

JAVT - Just Another Voice Transformer

Related Searches

Related Categories