Official Python inference and LoRA trainer package
"VideoRAG: Chat with Your Videos
AI-powered video clipping and highlight generation
[CVPR 2025 Best Paper Award] VGGT
Edit videos with Claude Code
TTS for Context-Aware Speech Generation and True-to-Life Voice Cloning
Clone a voice in 5 seconds to generate arbitrary speech in real-time
ImageBind One Embedding Space to Bind Them All
A python tool that uses GPT-4, FFmpeg, and OpenCV
Automatically translates the text of a video based on a subtitle file
PyTorch code and models for VJEPA2 self-supervised learning from video
Large Multimodal Models for Video Understanding and Editing
MARS5 speech model (TTS) from CAMB.AI
Repo of Qwen2-Audio chat & pretrained large audio language model
Open-weight, large-scale hybrid-attention reasoning model
Audiocraft is a library for audio processing and generation
Real-time music generation using stable diffusion techniques AI
Twitch YouTube bot. Automatically make video compilations
A dataset of short, object-centric video clips