A theoretical reconstruction of the Claude Mythos architecture
Qwen3 is the large language model series developed by Qwen team
CodeGeeX: An Open Multilingual Code Generation Model (KDD 2023)
Diversity-driven optimization and large-model reasoning ability
Text and image to video generation: CogVideoX and CogVideo
Industrial-level controllable zero-shot text-to-speech system
gpt-oss-120b and gpt-oss-20b are two open-weight language models
Provides convenient access to the Anthropic REST API from any Python 3
DeepSeek Coder: Let the Code Write Itself
The official repo of Qwen chat & pretrained large language model
Contexts Optical Compression
Advancing Open-source World Models
Open-source multi-speaker long-form text-to-speech model
Qwen2.5-VL is the multimodal large language model series
Project Lyra: Open Generative 3D World Models
26m function call model that runs on incredibly small devices
Recovering the Visual Space from Any Views
CogView4, CogView3-Plus and CogView3(ECCV 2024)
Mixture-of-Experts Vision-Language Models for Advanced Multimodal
Qwen-Image is a powerful image generation foundation model
Open-source image generative foundation model
ChatGLM-6B: An Open Bilingual Dialogue Language Model
Video Object and Interaction Deletion
An experimental version of DeepSeek model
A series of math-specific large language models of our Qwen2 series