Powerful AI language model (MoE) optimized for efficiency/performance
Text and image to video generation: CogVideoX and CogVideo
Open-source, high-performance AI model with advanced reasoning
State-of-the-art TTS model under 25MB
CodeGeeX: An Open Multilingual Code Generation Model (KDD 2023)
FAIR Sequence Modeling Toolkit 2
CodeGeeX2: A More Powerful Multilingual Code Generation Model
VGGSfM: Visual Geometry Grounded Deep Structure From Motion
A state-of-the-art open visual language model
Chinese and English multimodal conversational language model
DeepMind model for tracking arbitrary points across videos & robotics
Renderer for the harmony response format to be used with gpt-oss
VMZ: Model Zoo for Video Modeling
code for Mesh R-CNN, ICCV 2019
A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming
GLM-130B: An Open Bilingual Pre-Trained Model (ICLR 2023)
Code for the paper "Improved Techniques for Training GANs"