Implementation of Video Diffusion Models
Implementation of Make-A-Video, new SOTA text to video generator
A GUI tool for extracting hard-coded subtitle (hardsub) from videos
A gradio web UI for running Large Language Models like LLaMA
Application that simplifies the installation of AI-related projects
AI Upscaler for Blender using Real-ESRGAN
A minimal implementation of diffusion models for text generation
Based on the Disco Diffusion, version of the AI art creation software
Image/video AI upscaler app (BSRGAN)
Training & Implementation of chatbots leveraging GPT-like architecture
The most powerful and modular diffusion model GUI, api and backend
InvokeAI is a leading creative engine for Stable Diffusion models
NVR with realtime local object detection for IP cameras
Multimodal AI Story Teller, built with Stable Diffusion, GPT, etc.
A deep learning toolkit for Text-to-Speech, battle-tested in research
Implementation of Phenaki Video, which uses Mask GIT
PyTorch implementation of VALL-E (Zero-Shot Text-To-Speech)
Open source personal AI Assistant for Linux, Windows and Mac
Implementation of MusicLM music generation model in Pytorch
Label Studio is a multi-type data labeling and annotation tool
Stable Diffusion built-in to Blender
Generates code using AI based on your text prompt
A walk along memory lane
Implementation of NÜWA, attention network for text to video synthesis
CLIP + FFT/DWT/RGB = text to image/video