Official Python inference and LoRA trainer package
Awesome multilingual OCR toolkits based on PaddlePaddle
Official inference repo for FLUX.1 models
ChatGPT interface with better UI
Code for running inference and finetuning with SAM 3 model
A theoretical reconstruction of the Claude Mythos architecture
Qwen3-TTS is an open-source series of TTS models
Text and image to video generation: CogVideoX and CogVideo
Qwen3-Coder is the code version of Qwen3
Python inference and LoRA trainer package for the LTX-2 audio–video
GLM-4.5: Open-source LLM for intelligent agents by Z.ai
Easy Docker setup for Stable Diffusion with user-friendly UI
LTX-Video Support for ComfyUI
Models for object and human mesh reconstruction
High-Resolution Image Synthesis with Latent Diffusion Models
Reference PyTorch implementation and models for DINOv3
Tiny vision language model
PyTorch code and models for the DINOv2 self-supervised learning
Lets make video diffusion practical
Sharp Monocular Metric Depth in Less Than a Second
Provides convenient access to the Anthropic REST API from any Python 3
Generate Any 3D Scene in Seconds
AlphaFold 3 inference pipeline
DeepSeek Coder: Let the Code Write Itself
Stable Diffusion WebUI Forge is a platform on top of Stable Diffusion