Page 2 | video making free download

BettaFish

Public opinion analysis system

...It also integrates multimodal processing, enabling it to parse images and video alongside text.

Downloads: 0 This Week

Last Update: 2026-02-17

See Project

Whisper-WebUI

A Web UI for easy subtitle using whisper model

Whisper WebUI is an open-source browser-based interface that simplifies the use of Whisper speech recognition models by providing an intuitive graphical environment for transcription, translation, and subtitle generation. Built with Gradio, it allows users to upload audio or video files, process them locally, and generate accurate text outputs without relying on command-line tools. The platform integrates optimized implementations such as faster-whisper, significantly improving transcription speed and reducing memory usage compared to standard models. It supports multiple input sources including local files, YouTube content, and microphone input, making it versatile for different workflows. ...

Downloads: 6 This Week

Last Update: 2026-03-18

See Project

LLM Colosseum

Benchmark LLMs by fighting in Street Fighter 3

LLM-Colosseum is an experimental benchmarking framework designed to evaluate the capabilities of large language models through gameplay interactions rather than traditional text-based benchmarks. The system places language models inside the environment of the classic video game Street Fighter III, where they must interpret the game state and decide which actions to perform during combat. This setup creates a dynamic environment that tests reasoning, situational awareness, and decision-making abilities in real time. Instead of relying purely on reward signals as in reinforcement learning agents, the models analyze contextual information and generate strategic actions based on the game environment. ...

Downloads: 0 This Week

Last Update: 2026-03-07

See Project

Lyra 2

Project Lyra: Open Generative 3D World Models

...The architecture is designed to handle both 3D and 4D scene generation, making it suitable for applications such as simulation, gaming, and virtual environments. By emphasizing open implementations, the project provides researchers and developers with access to cutting-edge generative modeling techniques.

Downloads: 6 This Week

Last Update: 2026-04-18

See Project

Godot RL Agents

An Open Source package that allows video game creators

godot_rl_agents is a reinforcement learning integration for the Godot game engine. It allows AI agents to learn how to interact with and play Godot-based games using RL algorithms. The toolkit bridges Godot with Python-based RL libraries like Stable-Baselines3, making it possible to create complex and visually rich RL environments natively in Godot.

Downloads: 0 This Week

Last Update: 2025-03-13

See Project

HunyuanOCR

OCR expert VLM powered by Hunyuan's native multimodal architecture

...HunyuanOCR handles complex documents: multi-column layouts, tables, mathematical formulas, mixed languages, handwritten or stylized fonts, receipts, tickets, and even video-frame subtitles. The project provides code, pretrained weights, and inference instructions, making it feasible to deploy locally or on a server, and to integrate with applications.

Downloads: 0 This Week

Last Update: 2026-04-08

See Project

Cradle framework

The Cradle framework is a first attempt at General Computer Control

Cradle is an open-source framework designed to enable AI agents to perform complex computer tasks by interacting with software environments in a way similar to human users. The system introduces the concept of General Computer Control, where AI agents receive screenshots as input and perform actions through simulated keyboard and mouse operations. This approach allows agents to interact with any software interface without relying on specialized APIs or predefined automation scripts. The...

Downloads: 0 This Week

Last Update: 2026-03-06

See Project

SAHI

A lightweight vision library for performing large object detection

A lightweight vision library for performing large-scale object detection & instance segmentation. Object detection and instance segmentation are by far the most important fields of applications in Computer Vision. However, detection of small objects and inference on large images are still major issues in practical usage. Here comes the SAHI to help developers overcome these real-world problems with many vision utilities. Detection of small objects and objects far away in the scene is a major...

Downloads: 0 This Week

Last Update: 2025-09-28

See Project

Jina

Build cross-modal and multimodal applications on the cloud

Jina is a framework that empowers anyone to build cross-modal and multi-modal applications on the cloud. It uplifts a PoC into a production-ready service. Jina handles the infrastructure complexity, making advanced solution engineering and cloud-native technologies accessible to every developer. Build applications that deliver fresh insights from multiple data types such as text, image, audio, video, 3D mesh, PDF with Jina AI’s DocArray. Polyglot gateway that supports gRPC, Websockets, HTTP, GraphQL protocols with TLS. Intuitive design pattern for high-performance microservices. ...

Downloads: 0 This Week

Last Update: 2024-11-12

See Project

BWR Ai watermark remover

AI-powered tool to quickly remove watermarks from videos flawlessly

...Its intuitive interface features white and blue design elements for easy navigation, making it ideal for content creators, video editors, social media managers, and marketers. Blue Wave Remover enhances video visuals by removing unwanted logos and overlays, ensuring professional, clean footage for repurposing, presentations, and online sharing. Key functions include automatic watermark detection, AI-powered inpainting, background reconstruction, and seamless integration into existing workflows. ...

1 Review

Downloads: 22 This Week

Last Update: 2025-10-29

See Project

vocal-separate

An extremely simple tool for separating vocals and background music

...After processing, the tool outputs separate WAV files for each extracted stem, making it easy to export and use in audio editing or remix software.

Downloads: 2 This Week

Last Update: 2026-02-17

See Project

FrankMocap

A Strong and Easy-to-use Single View 3D Hand+Body Pose Estimator

...Outputs include textured meshes, joint locations, and model parameters that can be exported to common DCC tools and game engines. The codebase offers pretrained models, clear inference scripts, and utilities to visualize results, making single-camera motion capture approachable on commodity hardware. Researchers and creators use it for motion studies, AR/VR prototyping, character animation, and human-in-the-loop editing.

Downloads: 1 This Week

Last Update: 2025-10-07

See Project

AI Atelier

Based on the Disco Diffusion, version of the AI art creation software

Based on the Disco Diffusion, we have developed a Chinese & English version of the AI art creation software "AI Atelier". We offer both Text-To-Image models (Disco Diffusion and VQGAN+CLIP) and Text-To-Text (GPT-J-6B and GPT-NEOX-20B) as options. Making available complete source code of licensed works and modifications, which include larger works using a licensed work, under the same license. Copyright and license notices must be preserved. When a modified version is used to provide a...

Downloads: 0 This Week

Last Update: 2023-03-23

See Project

Yukki Music Bot

Telegram Group Calls Streaming bot with some useful features

Yukki Music Bot is a Powerful Telegram Music+Video Bot written in Python using Pyrogram and Py-Tgcalls by which you can stream songs, video and even live streams in your group calls via various sources.

Downloads: 8 This Week

Last Update: 2024-09-19

See Project

Face Mask Detection

Face Mask Detection system based on computer vision and deep learning

...Our face mask detector doesn't use any morphed masked images dataset and the model is accurate. Owing to the use of MobileNetV2 architecture, it is computationally efficient, thus making it easier to deploy the model to embedded systems (Raspberry Pi, Google Coral, etc.).

1 Review

Downloads: 0 This Week

Last Update: 2022-05-26

See Project

Consistent Depth

We estimate dense, flicker-free, geometrically consistent depth

...This approach achieves improved geometric consistency and visual stability compared to prior monocular reconstruction methods. The project can process challenging hand-held video footage, including those with moderate dynamic motion, making it practical for real-world usage.

Downloads: 2 This Week

Last Update: 3 days ago

See Project

vid2vid

Pytorch implementation of our method for high-resolution

...It uses generative adversarial networks combined with temporal modeling strategies to maintain coherence and reduce flickering artifacts. The framework is capable of producing high-resolution outputs and is widely used in research related to video synthesis, animation, and simulation. It also supports diverse input modalities, making it flexible for different types of video generation tasks.

Downloads: 0 This Week

Last Update: 2026-03-18

See Project

Python Computer Vision Framework

The Python Computer Vision Framework is an opened project deisgned for all those interested in computer vision. It aims at making computer vision more easy and structured and matlab-free. It may also be used for other artistic and scientific areas.

1 Review

Downloads: 0 This Week

Last Update: 2013-04-10

See Project

Search Results for "video making" - Page 2

Showing 43 open source projects for "video making"

BettaFish

Whisper-WebUI

LLM Colosseum

Lyra 2

Godot RL Agents

HunyuanOCR

Cradle framework

SAHI

Jina

BWR Ai watermark remover

vocal-separate

FrankMocap

AI Atelier

Yukki Music Bot

Face Mask Detection

Consistent Depth

vid2vid

Python Computer Vision Framework

Search Results for "video making" - Page 2

Showing 43 open source projects for "video making"

BettaFish

Whisper-WebUI

LLM Colosseum

Lyra 2

Godot RL Agents

HunyuanOCR

Cradle framework

SAHI

Jina

BWR Ai watermark remover

vocal-separate

FrankMocap

AI Atelier

Yukki Music Bot

Face Mask Detection

Consistent Depth

vid2vid

Python Computer Vision Framework

Related Searches

Related Categories