Unified Multimodal Understanding and Generation Models
The official Python SDK for the xAI API
Open source multimodal creative AI assistant with infinite canvas tool
Generate Any 3D Scene in Seconds
Capable of understanding text, audio, vision, video
Official implementation of DreamCraft3D
A Systematic Framework for Interactive World Modeling
Movie metadata scraper and organizer for media libraries and NFO
AI assistant based on large models that can actively think and plan
Native InstantID support for ComfyUI
RGBD video generation model conditioned on camera input
A Universal Customization Method for Single and Multi Conditioning
Project Lyra: Open Generative 3D World Models
Open source personal AI Assistant for Linux, Windows and Mac
Stable Virtual Camera: Generative View Synthesis with Diffusion Models
An extensive node suite that enables ComfyUI to process 3D inputs
LISA: Reasoning Segmentation via Large Language Model
A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming
Simplest working implementation of Stylegan2
Jittor is a high-performance deep learning framework
Stable Diffusion with Core ML on Apple Silicon
Plug-n-play module turning text-to-image models into animation
A powerful, free and open-source tool for TextureAtlases/Spritesheets
AI Suite for upscaling, interpolating & restoring images/videos
RyuX Passgen is an open-source password generator