Stable Diffusion web UI
Improve human sleep through scientifically
GLM-4.5V and GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning
Implementation of Vision Transformer, a simple way to achieve SOTA
4M: Massively Multimodal Masked Modeling
A reactive notebook for Python
Spatiotemporal Signal Processing with Neural Machine Learning Models
Open Source Differentiable Computer Vision Library
High-Resolution Image Synthesis with Latent Diffusion Models
CLIP + FFT/DWT/RGB = text to image/video
Run the Stable Diffusion releases in a Docker container
Let us control diffusion models
Interactive deep learning book with multi-framework code
Easily build, customize and control your own LLMs
Meta-Transformer for Unified Multimodal Learning
High-Resolution 3D Human Digitization from A Single Image
Scaled-YOLOv4: Scaling Cross Stage Partial Network
We estimate dense, flicker-free, geometrically consistent depth
AI for GNU Image Manipulation Program
Implementation of EfficientNet model. Keras and TensorFlow Keras
DCVGAN: Depth Conditional Video Generation, ICIP 2019.
Learning infinite-resolution image processing with GAN and RL
A Realistic and Rich 3D Environment
Keras code and weights files for popular deep learning models
OpenAI’s compact 20B open model for fast, agentic, and local use