video free download - SourceForge

SoniTranslate

Synchronized Translation for Videos

SoniTranslate is a video translation and dubbing system that produces synchronized target-language audio tracks for existing video content. It provides a web UI built with Gradio, allowing users to upload a video, choose source and target languages, and then run a pipeline that handles transcription, translation and re-synthesis of speech. Under the hood, it uses advanced speech and diarization models to separate speakers, align audio with timecodes and respect subtitle timing, which lets the generated dub track stay in sync with the original video structure. ...

Downloads: 40 This Week

Last Update: 2025-11-28

See Project

KrillinAI

Video translation and dubbing tool powered by LLMs

...The tool offers “one-click” workflows and desktop versions, lowering the barrier for users who may not be familiar with video editing or audio processing pipelines.

Downloads: 3 This Week

Last Update: 2025-11-28

See Project

Open Vision Agents by Stream

Build Vision Agents quickly with any model or video provider

Open Vision Agents by Stream is an open source framework from Stream for building real time, multimodal AI agents that watch, listen, and respond to live video streams. It focuses on combining video understanding models, such as YOLO and Roboflow based detectors, with real time large language models like OpenAI Realtime and Gemini Live to create interactive experiences. The framework uses Stream’s ultra low latency edge network so agents can join sessions quickly and maintain very low audio and video latency while processing frames and generating responses. ...

Downloads: 0 This Week

Last Update: 2025-12-16

See Project

MiniMax-MCP

Official MiniMax Model Context Protocol (MCP) server

...It acts as a bridge between tools like Claude Desktop, Cursor, Windsurf, OpenAI Agents, and the MiniMax platform, exposing capabilities such as text-to-speech, voice cloning, image generation, text-to-image, video generation, image-to-video, text-to-video, and music generation. The server is written in Python and distributed under the MIT license, with a pyproject.toml and uv-based workflow that makes installation and execution reproducible. Configuration is handled through JSON files that tell MCP clients how to launch the server (typically via uvx minimax-mcp) and which environment variables to use for the API key, host, and output directory. ...

Downloads: 0 This Week

Last Update: 2025-11-28

See Project

FastRTC

The python library for real-time communication

FastRTC is a Python library designed to simplify real-time communication (RTC), especially for audio and video streaming applications. It abstracts away much of the complexity that typically comes with implementing WebRTC by providing a simple interface — e.g. a Stream class — that can be mounted within a web backend (for example a FastAPI application). This makes it particularly well suited for building real-time voice (or video) interfaces for applications such as AI assistants, live chat, or collaborative audio/video tools. ...

Downloads: 0 This Week

Last Update: 2025-11-28

See Project

EasyVoice

Open source text-to-speech tool, supports extra-long text

...The system supports multi-role voice acting, letting users assign different neural voices to different characters or narrative roles and configure parameters such as rate, pitch, and volume per role. It offers streaming playback so audio starts almost immediately, even for very long inputs, and automatically generates subtitle files suitable for video production or translation workflows. Under the hood, easyVoice uses a modern stack with Vue 3 and Element Plus on the front end, Node.js and Express on the back end, and TTS engines such as Microsoft Azure TTS and OpenAI-compatible APIs, orchestrated through ffmpeg.

Downloads: 4 This Week

Last Update: 2025-11-28

See Project

Auto Synced & Translated Dubs

Automatically translates the text of a video based on a subtitle file

...The tool then time-stretches or compresses each TTS clip to match the original speech duration exactly, which preserves lip-sync and rhythm as closely as possible without manual editing. Finally, it combines all the clips into a single dubbed audio track that can be muxed with the original video, along with new translated subtitle files.

Downloads: 3 This Week

Last Update: 2025-11-28

See Project

comfyui-mixlab-nodes

Workflow and speech recognition app

...It introduces a “Workflow-to-APP” concept, where a ComfyUI graph can be transformed into a Web App through an AppInfo node, complete with categories, batch prompts, and editable configurations. The project also brings Real-time Design features like screen capture and floating video nodes, enabling creative pipelines that mix live screen content, generative models, and visual effects. For audio and speech, it provides nodes for SpeechRecognition and SpeechSynthesis, plus workflows that combine voice generation with real-time face swapping and other audio-visual effects. On the AI side, it integrates multiple LLM providers (cloud and local), supports OpenAI-compatible endpoints, Siliconflow models, and includes prompt-focused utilities for random prompt generation, Chinese prompts, clip interrogation.

Downloads: 1 This Week

Last Update: 2025-11-28

See Project

edge-tts

Use Microsoft Edge's online text-to-speech service from Python

...The tool lets you list available voices, specify locale and voice name, and generate audio files in common formats like MP3 or WAV. It also supports generating subtitle files (such as SRT or VTT) alongside the speech, which is handy for video narration, e-learning, or accessibility workflows. From the CLI you can adjust parameters such as speaking rate, volume, and pitch, giving you some control over prosody without diving into SSML. The library is asynchronous under the hood, which makes it efficient for batch jobs or web services that need to synthesize many utterances concurrently.

Downloads: 18 This Week

Last Update: 2025-12-12

See Project

Conversations

App in java for chatting to a generative A.I. (involving tts and stt)

...The application is prepared so that only one user occupies the server's resources, so if the server is busy, in theory it will not let you connect. There is a demo video that shows how it works: https://frojasg1.com:8443/resource_counter/resourceCounter?operation=countAndForward&url=https%3A%2F%2Ffrojasg1.com%2Fdemos%2Faplicaciones%2Fchat%2F20240815.Demo.Chat.mp4%3Forigin%3Dsourceforge&origin=web

Downloads: 2 This Week

Last Update: 2025-10-15

See Project

JAVT - Just Another Voice Transformer

Just Another Speech Recognition and Text to Speech software.

JAVT or Just Another Voice Transformer (formerly, it is called Just Another Video Transcriber) is a Speech Recognition software that also support text to Speech and simple media conversion. JAVT allows you to convert from video files to audio wav file using ffmpeg, and then transcribe the audio file to text using either Microsoft SAPI or CMU Sphinx. You can also open a text file and allow JAVT to read it out for you through text to speech conversion.

Downloads: 3 This Week

Last Update: 2020-08-19

See Project

Anonymous Animator

Make your own anonymous video using this application

This application originated from a simple text to speech java application. Now it is a text to video application with an anonymous theme. You can make your own anonymous video simply by typing in your text.

1 Review

Downloads: 0 This Week

Last Update: 2016-05-06

See Project

Text to Speech for Video

create wav files for video character speech by typing in dialogue

Choose from the "voices" available, and type in what you want the computer to say. A wave file called sounds.wav is stored to the output sub folder. Output is intended primarily for users who need speech for animated characters in videos.

Downloads: 0 This Week

Last Update: 2015-10-16

See Project

Search Results for "video"

Showing 13 open source projects for "video"

SoniTranslate

KrillinAI

Open Vision Agents by Stream

MiniMax-MCP

FastRTC

EasyVoice

Auto Synced & Translated Dubs

comfyui-mixlab-nodes

edge-tts

Conversations

JAVT - Just Another Voice Transformer

Anonymous Animator

Text to Speech for Video

Search Results for "video"

Showing 13 open source projects for "video"

SoniTranslate

KrillinAI

Open Vision Agents by Stream

MiniMax-MCP

FastRTC

EasyVoice

Auto Synced & Translated Dubs

comfyui-mixlab-nodes

edge-tts

Conversations

JAVT - Just Another Voice Transformer

Anonymous Animator

Text to Speech for Video

Related Searches

Related Categories