FAIR Sequence Modeling Toolkit 2
A Production-ready Reinforcement Learning AI Agent Library
CogView4, CogView3-Plus and CogView3(ECCV 2024)
Pushing the Limits of Mathematical Reasoning in Open Language Models
A state-of-the-art open visual language model
Chinese and English multimodal conversational language model
Tooling for the Common Objects In 3D dataset
MapAnything: Universal Feed-Forward Metric 3D Reconstruction
A series of math-specific large language models of our Qwen2 series
Implementation of "MobileCLIP" CVPR 2024
Official implementation of Watermark Anything with Localized Messages
Chat & pretrained large vision language model
Tongyi Deep Research, the Leading Open-source Deep Research Agent
GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning
MedicalGPT: Training Your Own Medical GPT Model with ChatGPT Training
Inference script for Oasis 500M
A PyTorch library for implementing flow matching algorithms
GLM-4-Voice | End-to-End Chinese-English Conversational Model
Official implementation of DreamCraft3D
Open-source large language model family from Tencent Hunyuan
GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning
Large Multimodal Models for Video Understanding and Editing
Chat & pretrained large audio language model proposed by Alibaba Cloud
DeepMind model for tracking arbitrary points across videos & robotics
State-of-the-art Image & Video CLIP, Multimodal Large Language Models