Official inference repo for FLUX.1 models
GLM-5: From Vibe Coding to Agentic Engineering
Official Python inference and LoRA trainer package
The most powerful local music generation model
Contexts Optical Compression
Fast-stable-diffusion + DreamBooth
MiniMax M2.1, a SOTA model for real-world dev & agents.
Convert Google Gemini web into OpenAI-compatible API
Python SDK for Claude Agent
Open-source, high-performance AI model with advanced reasoning
AlphaFold 3 inference pipeline
Miso TTS is an 8 billion, highly emotive text-to-speech model
Official inference repo for FLUX.2 models
Genome modeling and design across all domains of life
Qwen3 is the large language model series developed by Qwen team
HY-Motion model for 3D character animation generation
Diversity-driven optimization and large-model reasoning ability
Qwen3.6 is the large language model series developed by Qwen team
GLM-4 series: Open Multilingual Multimodal Chat LMs
Fast, Sharp & Reliable Agentic Intelligence
MiMo-V2-Flash: Efficient Reasoning, Coding, and Agentic Foundation
Model export recipes, Python primitives, and Swift runtime utilities
C++ implementation of ChatGLM-6B & ChatGLM2-6B & ChatGLM3 & GLM4(V)
Qwen2.5-VL is the multimodal large language model series
Inference script for Oasis 500M