Implementation of Make-A-Video, new SOTA text to video generator
Implementation of Video Diffusion Models
A GUI tool for extracting hard-coded subtitle (hardsub) from videos
AI Upscaler for Blender using Real-ESRGAN
Application that simplifies the installation of AI-related projects
A gradio web UI for running Large Language Models like LLaMA
Image/video AI upscaler app (BSRGAN)
Multimodal AI Story Teller, built with Stable Diffusion, GPT, etc.
The most powerful and modular diffusion model GUI, api and backend
InvokeAI is a leading creative engine for Stable Diffusion models
A deep learning toolkit for Text-to-Speech, battle-tested in research
PyTorch implementation of VALL-E (Zero-Shot Text-To-Speech)
NVR with realtime local object detection for IP cameras
Dealing with all unstructured data, such as reverse image search
Open source personal AI Assistant for Linux, Windows and Mac
Implementation of Phenaki Video, which uses Mask GIT
Generates code using AI based on your text prompt
Label Studio is a multi-type data labeling and annotation tool
Toolkit for conversational AI
A framework to enable multimodal models to operate a computer
Text-to-3D & Image-to-3D & Mesh Exportation with NeRF + Diffusion
PDF to Markdown with vision models
Convert AI papers to GUI
textgen, Text Generation models
AI based photo editing website for changing image background