Open-Source Financial Large Language Models
Tongyi Deep Research, the Leading Open-source Deep Research Agent
Generating Immersive, Explorable, and Interactive 3D Worlds
Official implementation of Watermark Anything with Localized Messages
Real-time behaviour synthesis with MuJoCo, using Predictive Control
tiktoken is a fast BPE tokeniser for use with OpenAI's models
Reproduction of Poetiq's record-breaking submission to the ARC-AGI-1
Generate Any 3D Scene in Seconds
CodeGeeX2: A More Powerful Multilingual Code Generation Model
State-of-the-art (SoTA) text-to-video pre-trained model
Qwen2.5-VL is the multimodal large language model series
The Clay Foundation Model - An open source AI model and interface
C++ implementation of ChatGLM-6B & ChatGLM2-6B & ChatGLM3 & GLM4(V)
Controllable & emotion-expressive zero-shot TTS
Unified Multimodal Understanding and Generation Models
MapAnything: Universal Feed-Forward Metric 3D Reconstruction
Inference framework for 1-bit LLMs
HY-Motion model for 3D character animation generation
Tencent Hunyuan Multimodal diffusion transformer (MM-DiT) model
Collection of Gemma 3 variants that are trained for performance
Tool for exploring and debugging transformer model behaviors
Repo for SeedVR2 & SeedVR
AlphaFold 3 inference pipeline
Release for Improved Denoising Diffusion Probabilistic Models
OCR expert VLM powered by Hunyuan's native multimodal architecture