GLM-4.5V and GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning
Ring is a reasoning MoE LLM provided and open-sourced by InclusionAI
Tencent Hunyuan Multimodal diffusion transformer (MM-DiT) model
Open-source large language model family from Tencent Hunyuan
High-Resolution 3D Assets Generation with Large Scale Diffusion Models
Pushing the Limits of Mathematical Reasoning in Open Language Models
DeepSeek Coder: Let the Code Write Itself
Generating Immersive, Explorable, and Interactive 3D Worlds
VMZ: Model Zoo for Video Modeling
Official implementation of Watermark Anything with Localized Messages
Foundation Models for Time Series
tiktoken is a fast BPE tokeniser for use with OpenAI's models
Uncommon Objects in 3D dataset
High-resolution models for human tasks
Chinese and English multimodal conversational language model
Repo of Qwen2-Audio chat & pretrained large audio language model
Tongyi Deep Research, the Leading Open-source Deep Research Agent
Models for object and human mesh reconstruction
The official PyTorch implementation of Google's Gemma models
Inference code for scalable emulation of protein equilibrium ensembles
Towards Real-World Vision-Language Understanding
Chat & pretrained large vision language model
Designed for text embedding and ranking tasks
A series of math-specific large language models of our Qwen2 series
Chinese LLaMA-2 & Alpaca-2 Large Model Phase II Project