Research code artifacts for Code World Model (CWM)
A Customizable Image-to-Video Model based on HunyuanVideo
Inference code for scalable emulation of protein equilibrium ensembles
Generate Any 3D Scene in Seconds
gpt-oss-120b and gpt-oss-20b are two open-weight language models
Miso TTS is an 8 billion, highly emotive text-to-speech model
tiktoken is a fast BPE tokeniser for use with OpenAI's models
State-of-the-art TTS model under 25MB
C++ implementation of ChatGLM-6B & ChatGLM2-6B & ChatGLM3 & GLM4(V)
OpenTinker is an RL-as-a-Service infrastructure for foundation models
Convert Google Gemini web into OpenAI-compatible API
Qwen3 is the large language model series developed by Qwen team
Tiny vision language model
Repo for SeedVR2 & SeedVR
Models for object and human mesh reconstruction
A Pragmatic VLA Foundation Model
Recovering the Visual Space from Any Views
Large Multimodal Models for Video Understanding and Editing
Industrial-level controllable zero-shot text-to-speech system
Unified Multimodal Understanding and Generation Models
DeepSeek Coder: Let the Code Write Itself
1B text generation model based on the HRM architecture
Foundation Models for Time Series
Hackable and optimized Transformers building blocks