Qwen3-TTS is an open-source series of TTS models
Official inference repo for FLUX.1 models
Text and image to video generation: CogVideoX and CogVideo
A Family of Open Sourced Music Foundation Models
A 0.1B Omni model trained from scratch
Project Lyra: Open Generative 3D World Models
A theoretical reconstruction of the Claude Mythos architecture
DeepSeek Coder: Let the Code Write Itself
ChatGLM-6B: An Open Bilingual Dialogue Language Model
High-Resolution Image Synthesis with Latent Diffusion Models
Fast-stable-diffusion + DreamBooth
Advancing Open-source World Models
Recovering the Visual Space from Any Views
Tiny vision language model
Easy Docker setup for Stable Diffusion with user-friendly UI
Repo for SeedVR2 & SeedVR
26m function call model that runs on incredibly small devices
Z80-μLM is a 2-bit quantized language model
State-of-the-art Image & Video CLIP, Multimodal Large Language Models
ChatGPT interface with better UI
Revolutionizing Database Interactions with Private LLM Technology
Official code base for LeWorldModel: Stable End-to-End Joint-Embedding
A trainable PyTorch reproduction of AlphaFold 3
A Multi-Modal World Model for Reconstructing, Generating, Simulation
A Systematic Framework for Interactive World Modeling