A GUI tool for extracting hard-coded subtitle (hardsub) from videos
The most powerful and modular diffusion model GUI, api and backend
Open-source MCP server that gives your coding agent
Uncommon Objects in 3D dataset
The data structure for multimodal data
Build AI-powered semantic search applications
The Triton Inference Server provides an optimized cloud
Build cross-modal and multimodal applications on the cloud
An extremely simple tool for separating vocals and background music
AI-powered tool to quickly remove watermarks from videos flawlessly
OpenFieldAI is an AI based Open Field Test Rodent Tracker
Chatbot daemon that connects to your favorite chat services
A computer vision framework to create and deploy apps in minutes
GFPGAN aims at developing Practical Algorithms
Based on the Disco Diffusion, version of the AI art creation software
Official implementation for UniVL video and language training models
Gluon CV Toolkit
Software tool that converts text to video for more engaging experience
We estimate dense, flicker-free, geometrically consistent depth
Easy-OCR solution and Tesseract trainer for GNU/Linux
The leading software for creating deepfakes
Deep Learning (Flower Book) mathematical derivation
Basic Utilities for PyTorch Natural Language Processing (NLP)
Identification codes
World's simplest facial recognition api for Python & the command line