Contexts Optical Compression
Netease Youdao's open-source embedding and reranker models
Python SDK for Claude Agent
CodeGeeX: An Open Multilingual Code Generation Model (KDD 2023)
Robust Speech Recognition Across Languages, Dialects
A Powerful Native Multimodal Model for Image Generation
tiktoken is a fast BPE tokeniser for use with OpenAI's models
Recovering the Visual Space from Any Views
VGGSfM: Visual Geometry Grounded Deep Structure From Motion
Generating Immersive, Explorable, and Interactive 3D Worlds
HY-Motion model for 3D character animation generation
The Clay Foundation Model - An open source AI model and interface
A SOTA open-source image editing model
Achieving 3+ generation speedup on reasoning tasks
Accurate × Fast × Comprehensive
GPT4V-level open-source multi-modal model based on Llama3-8B
Hunyuan Translation Model Version 1.5
Multimodal embedding and reranking models built on Qwen3-VL
VMZ: Model Zoo for Video Modeling
Open-source framework for intelligent speech interaction
Generate Any 3D Scene in Seconds
FAIR Sequence Modeling Toolkit 2
A PyTorch library for implementing flow matching algorithms
Hackable and optimized Transformers building blocks
Official implementation of DreamCraft3D