Document Image Parsing via Heterogeneous Anchor Prompting”
Use Microsoft Edge's online text-to-speech service from Python
GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning
Build cross-modal and multimodal applications on the cloud
Open Source Computer Vision Library
AI-powered tool to quickly remove watermarks from videos flawlessly
Suite with Real-ESRGAN, BSRGAN , RealESRNet, IRCNN, GFPGAN & RIFE.
A Customizable Image-to-Video Model based on HunyuanVideo
Un "Screen Recorder" codificado 100% con AI, escrito en python.
A feature packed DJ console and internet radio client for Linux users
PyExe: YouTube thumbnail downloader (type-b) [I.S.A]
Database system for building simpler and faster AI-powered application
MahaKurawa.My.ID MP4 VA Extract is a tool to extract mp4 file content
A Strong and Easy-to-use Single View 3D Hand+Body Pose Estimator
Utility to concatanate, trim, and transcode video files.
Real-ESRGAN aims at developing Practical Algorithms
VapourSynth Single Image Super-Resolution Generative Adversarial
LiVES is a Video Editing System. It is designed to be simple to use, y
Deep Learning (Flower Book) mathematical derivation
IPTV/NVR/CCTV/Video cloud https://fastocloud.com
Visual tracking library based on PyTorch
Basic Utilities for PyTorch Natural Language Processing (NLP)