Chat & pretrained large vision language model
Open source clone of the Age of Empires II engine
A Python library for audio
A SOTA open-source image editing model
Multi-modal large language model designed for audio understanding
AI-powered tool to quickly remove watermarks from videos flawlessly
A Model Context Protocol server for searching and analyzing arXiv
GenAI Processors is a lightweight Python library
Dataset of GPT-2 outputs for research in detection, biases, and more
Clarity in the current fast-paced mess of Open Source innovation
Windows application to search multiple pdfs and chat with them
PowerPoint Generator: Your Gateway to Effortless Presentations
Overcoming Data Limitations for High-Quality Video Diffusion Models
Suite with Real-ESRGAN, BSRGAN , RealESRNet, IRCNN, GFPGAN & RIFE.
Video,audio&Files Downloader&Convert with built-in browser with AI.
Alfred workflow using ChatGPT, DALL·E 2 and other models for chatting
Let us control diffusion models
Open-source framework that gives you AI Agents
AI R&D Efficiency Improvement Research: Do-It-Yourself Training LoRA
PyTorch implementation of VALL-E (Zero-Shot Text-To-Speech)
A walk along memory lane
A collection of practical tips can be found at the bottom of this page
Code release for ConvNeXt V2 model
PyTorch implementation of MAE