Page 2 | video processing free download

Dolphin

Document Image Parsing via Heterogeneous Anchor Prompting”

...It is designed to integrate with other tools and libraries and provide stable playback or media-processing pipelines, while remaining open-source so that users can inspect, extend, and adapt it.

Downloads: 0 This Week

Last Update: 2025-12-17

See Project

edge-tts

Use Microsoft Edge's online text-to-speech service from Python

edge-tts is a Python module and command-line tool that gives you direct access to Microsoft Edge’s online text-to-speech service without needing the Edge browser, Windows, or any API key. It wraps the same cloud voices used by Edge, exposing them through a simple CLI (edge-tts, edge-playback) and a Python API, so you can script high-quality speech generation in your own applications. The tool lets you list available voices, specify locale and voice name, and generate audio files in common...

Downloads: 27 This Week

Last Update: 2025-12-12

See Project

GLM-4.5V

GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning

GLM-4.5V is the preceding iteration in the GLM-V series that laid much of the groundwork for general multimodal reasoning and vision-language understanding. It embodies the design philosophy of mixing visual and textual modalities into a unified model capable of general-purpose reasoning, content understanding, and generation, while already supporting a wide variety of tasks: from image captioning and visual question answering to content recognition, GUI-based agents, video understanding,...

Downloads: 0 This Week

Last Update: 2025-12-18

See Project

Jina

Build cross-modal and multimodal applications on the cloud

Jina is a framework that empowers anyone to build cross-modal and multi-modal applications on the cloud. It uplifts a PoC into a production-ready service. Jina handles the infrastructure complexity, making advanced solution engineering and cloud-native technologies accessible to every developer. Build applications that deliver fresh insights from multiple data types such as text, image, audio, video, 3D mesh, PDF with Jina AI’s DocArray. Polyglot gateway that supports gRPC, Websockets, HTTP,...

Downloads: 0 This Week

Last Update: 2024-11-12

See Project

OpenCV

Open Source Computer Vision Library

The Open Source Computer Vision Library has >2500 algorithms, extensive documentation and sample code for real-time computer vision. It works on Windows, Linux, Mac OS X, Android, iOS in your browser through JavaScript. Languages: C++, Python, Julia, Javascript Homepage: https://opencv.org Q&A forum: https://forum.opencv.org/ Documentation: https://docs.opencv.org Source code: https://github.com/opencv Please pay special attention to our tutorials!...

124 Reviews

Downloads: 3,458 This Week

Last Update: 2025-12-31

See Project

BWR Ai watermark remover

AI-powered tool to quickly remove watermarks from videos flawlessly

Blue Wave Remover is an advanced AI-driven video watermark removal software designed to effortlessly eliminate logos, text, timestamps, and watermarks from video content. Utilizing cutting-edge computer vision and generative AI algorithms, it accurately detects and removes both static and moving watermarks while preserving the original video's quality, colors, and clarity. The program supports popular video formats and offers batch processing for fast and efficient removal on multiple files. ...

1 Review

Downloads: 11 This Week

Last Update: 2025-10-29

See Project

DPG for X (dpg4x)

DPG for X (dpg4x) is a program that was designed to allow the easy creation of DPG video files on Linux, but now it can also run on OS X and Windows. DPG is a special format of MPEG-1 video specifically made for playback on a Nintendo DS.

7 Reviews

Downloads: 138 This Week

Last Update: 2025-12-07

See Project

Warlock-Studio

Suite with Real-ESRGAN, BSRGAN , RealESRNet, IRCNN, GFPGAN & RIFE.

v5.1.1. Warlock-Studio is a Windows application that uses Real-ESRGAN, BSRGAN, IRCNN, GFPGAN, RealESRNet, RealESRAnime and RIFE Artificial Intelligence models to upscale, restore faces, interpolate frames and reduce noise in images and videos. the application supports GPU acceleration (including multi-GPU setups) and offers batch processing for large workloads. It includes drag-and-drop handling for single or multiple files, optional pre-resize functions, and an automatic tiling system...

Downloads: 19 This Week

Last Update: 2026-01-02

See Project

HunyuanVideo-I2V

A Customizable Image-to-Video Model based on HunyuanVideo

HunyuanVideo-I2V is a customizable image-to-video generation framework developed by Tencent, extending the capabilities of HunyuanVideo. It allows for high-quality video creation from still images, using PyTorch and providing pre-trained model weights, inference code, and customizable training options. The system includes a LoRA training code for adding special effects and enhancing video realism, aiming to offer versatile and scalable solutions for generating videos from static image inputs.

1 Review

Downloads: 7 This Week

Last Update: 2025-03-10

See Project

ESRM

Un "Screen Recorder" codificado 100% con AI, escrito en python.

🎥 Effect Screen Recorder Master (ESRM) ESRM es un programa para grabación de pantalla con efectos visuales en tiempo real, hecho 100% con chat GPT, está escrito en python y con interfaz gráfica en Custom Tkinter, se usa FFMPEG para realizar las grabaciones (en cualquier caso, debe instalar FFMPEG y añadirlo al PATH). Está escrito en python v3.12.8, pero está disponible desde las 3.11.6 y superiores. Usa CustomTkinter para una interfaz moderna y ffmpeg para realizar las grabaciones en...

Downloads: 0 This Week

Last Update: 2025-03-17

See Project

MLT Multimedia Framework

A multimedia authoring and processing framework and a video playout server for television broadcasting.

17 Reviews

Downloads: 2 This Week

Last Update: 2025-12-31

See Project

Internet DJ Console

A feature packed DJ console and internet radio client for Linux users

Conceived as an internet radio Shoutcast/Icecast client and DJ console IDJC has two main media players, a background track player, effects buttons, crossfader, webm, aac, ogg, and mp3 streaming, stream automation timers, aux input, voice and VoIP integration. Media file formats include: mp3, ogg, flac, wma, wav, m4a, m3u, xspf, pls, and cue sheet support, IRC track and station announcements, uses jack audio connection kit to provide a flexible audio chain. This list of features is by no...

32 Reviews

Downloads: 17 This Week

Last Update: 2026-01-10

See Project

PyExe:YT thumbnail downloader (b) [ISA]

PyExe: YouTube thumbnail downloader (type-b) [I.S.A]

PyExe: YouTube thumbnail downloader (type-b) [Improved.Simplified.Alternative] Download YouTube video thumbnails. Compatible only for windows OS.

Downloads: 0 This Week

Last Update: 2024-06-03

See Project

EvaDB

Database system for building simpler and faster AI-powered application

Over the last decade, AI models have radically changed the world of natural language processing and computer vision. They are accurate on various tasks ranging from question answering to object tracking in videos. To use an AI model, the user needs to program against multiple low-level libraries, like PyTorch, Hugging Face, Open AI, etc. This tedious process often leads to a complex AI app that glues together these libraries to accomplish the given task.

Downloads: 1 This Week

Last Update: 2023-11-19

See Project

MahaKurawa.My.ID MP4 VA Extract

MahaKurawa.My.ID MP4 VA Extract is a tool to extract mp4 file content

MahaKurawa.My.ID MP4 VA Extract is a tool to extract MP4 file video and audio content. It also have ability to extract MKV file and single SSA Subtitle file. This software will not convert any video and audio file from MP4 file. This software just extract them as it is. This tool is made for that specific purpose. This tool "MahaKurawa.My.ID MP4 VA Extract v.1.0.3.1" can be obtained for free on https://www.mahakurawa.my.id.

Downloads: 0 This Week

Last Update: 2023-12-14

See Project

FrankMocap

A Strong and Easy-to-use Single View 3D Hand+Body Pose Estimator

FrankMocap is a monocular 3D human capture system that estimates body, hand, and optionally face pose from a single RGB image or video. It regresses parametric human models (e.g., SMPL/SMPL-X) directly, producing temporally stable meshes and joint angles suitable for animation or analytics. The pipeline couples a robust 2D keypoint detector with 3D mesh regression networks and priors that keep results anatomically plausible. It can run frame-by-frame or with temporal smoothing, and includes demo apps for live webcam capture as well as batch processing. ...

Downloads: 0 This Week

Last Update: 2025-10-07

See Project

trimmer

Utility to concatanate, trim, and transcode video files.

This is a utility for simple post-production processing of video files. It will concatanate multiple files together, trim the beginning and end, and transcode. Uses VLC for the player and ffmpeg for transcoding. See the Wiki for more.

Downloads: 0 This Week

Last Update: 2023-04-19

See Project

Real-ESRGAN

Real-ESRGAN aims at developing Practical Algorithms

Real-ESRGAN is a highly popular open-source project that provides practical algorithms for general image and video restoration using deep learning-based super-resolution techniques. It extends the original Enhanced Super-Resolution Generative Adversarial Network (ESRGAN) approach by training on synthetic degradations to make results more robust on real-world images, effectively enhancing resolution, reducing noise/artifacts, and reconstructing fine detail in low-quality imagery. The...

Downloads: 47 This Week

Last Update: 2025-12-11

See Project

VSGAN

VapourSynth Single Image Super-Resolution Generative Adversarial

Single Image Super-Resolution Generative Adversarial Network (GAN) which uses the VapourSynth processing framework to handle input and output image data. Transform, Filter, or Enhance your input video, or the VSGAN result with VapourSynth, a Script-based NLE. You can chain models or re-run the model twice-over (or more). Have low VRAM? Don’t worry! The Network will be applied in quadrants of the image to reduce up-front VRAM usage.

Downloads: 0 This Week

Last Update: 2023-03-29

See Project

LiVES

LiVES is a Video Editing System. It is designed to be simple to use, y

LiVES mixes realtime video performance and non-linear editing in one professional quality application. It is designed to be simple to use, yet powerful. It is small in size, yet it has many advanced features. Using LiVES, you can start editing and making video right away, without having to worry about formats, frame sizes, or framerates. It is a very flexible tool which is used by both professional VJ's and video editors - mix and switch clips from the keyboard, use dozens of realtime...

15 Reviews

Downloads: 8 This Week

Last Update: 2020-11-08

See Project

DeepLearning

Deep Learning (Flower Book) mathematical derivation

...At the same time, it also introduces deep learning techniques used by practitioners in the industry, including deep feedforward networks, regularization, optimization algorithms, convolutional networks, sequence modeling and practical methods, and investigates topics such as natural language processing, Applications in speech recognition, computer vision, online recommender systems, bioinformatics, and video games. Finally, the Deep Learning book provides research directions covering theoretical topics including linear factor models, autoencoders, representation learning, structured probabilistic models, etc.

Downloads: 1 This Week

Last Update: 2022-08-02

See Project

FastoCloud PRO

IPTV/NVR/CCTV/Video cloud https://fastocloud.com

IPTV/Video cloud Features: Cross-platform (Linux, MacOSX, FreeBSD, Raspbian/Armbian) GPU/CPU Encode/Decode/Post Processing Stream statistics CCTV Adaptive hls streams Load balancing Temporary urls HLS push EPG scanning Subtitles to text conversions AD insertion Logo overlay Video effects Relays Timeshifts Catchups Playlists Restream/Transcode from online streaming services like Youtube, Twitch Mozaic Many Outputs Physical Inputs Streaming Protocols File Formats Presets Vods/Series server-side support Pay per view channels Channels on demand HTTP Live Streaming (HLS) server-side support Public API, client server communication via JSON RPC Protocol gzip compression Deep learning video analysis Supported deep learning frameworks: Tensorflow NCSDK Caffe ML Hardware:

Downloads: 0 This Week

Last Update: 2020-06-20

See Project

PyTracking

Visual tracking library based on PyTorch

A general python framework for visual object tracking and video object segmentation, based on PyTorch. Official implementation of the RTS (ECCV 2022), ToMP (CVPR 2022), KeepTrack (ICCV 2021), LWL (ECCV 2020), KYS (ECCV 2020), PrDiMP (CVPR 2020), DiMP (ICCV 2019), and ATOM (CVPR 2019) trackers, including complete training code and trained models.

Downloads: 0 This Week

Last Update: 2023-08-14

See Project

PyTorch Natural Language Processing

Basic Utilities for PyTorch Natural Language Processing (NLP)

PyTorch-NLP is a library for Natural Language Processing (NLP) in Python. It’s built with the very latest research in mind, and was designed from day one to support rapid prototyping. PyTorch-NLP comes with pre-trained embeddings, samplers, dataset loaders, metrics, neural network modules and text encoders. It’s open-source software, released under the BSD3 license. With your batch in hand, you can use PyTorch to develop and train your model using gradient descent. For example, check out...

Downloads: 0 This Week

Last Update: 2022-08-09

See Project

pytivo

pyTivo is both an HMO and GoBack server. Similar to TiVo Desktop pyTivo loads many standard video compression codecs and outputs mpeg2 video to the TiVo. However, pyTivo is able to load MANY more file types than TiVo Desktop.

2 Reviews

Downloads: 5 This Week

Last Update: 2018-06-18

See Project

Search Results for "video processing" - Page 2

Showing 66 open source projects for "video processing"

Dolphin

edge-tts

GLM-4.5V

Jina

OpenCV

BWR Ai watermark remover

DPG for X (dpg4x)

Warlock-Studio

HunyuanVideo-I2V

ESRM

MLT Multimedia Framework

Internet DJ Console

PyExe:YT thumbnail downloader (b) [ISA]

EvaDB

MahaKurawa.My.ID MP4 VA Extract

FrankMocap

trimmer

Real-ESRGAN

VSGAN

LiVES

DeepLearning

FastoCloud PRO

PyTracking

PyTorch Natural Language Processing

pytivo

Search Results for "video processing" - Page 2

Showing 66 open source projects for "video processing"

Dolphin

edge-tts

GLM-4.5V

Jina

OpenCV

BWR Ai watermark remover

DPG for X (dpg4x)

Warlock-Studio

HunyuanVideo-I2V

ESRM

MLT Multimedia Framework

Internet DJ Console

PyExe:YT thumbnail downloader (b) [ISA]

EvaDB

MahaKurawa.My.ID MP4 VA Extract

FrankMocap

trimmer

Real-ESRGAN

VSGAN

LiVES

DeepLearning

FastoCloud PRO

PyTracking

PyTorch Natural Language Processing

pytivo

Related Searches

Related Categories