text and image to video generation: CogVideoX (2024) and CogVideo
Renderer for the harmony response format to be used with gpt-oss
CodeGeeX: An Open Multilingual Code Generation Model (KDD 2023)
The leading agent orchestration platform for Claude
21 Lessons, Get Started Building with Generative AI
MobileLLM Optimizing Sub-billion Parameter Language Models
DeepMind model for tracking arbitrary points across videos & robotics
FAIR Sequence Modeling Toolkit 2
VMZ: Model Zoo for Video Modeling
VGGSfM: Visual Geometry Grounded Deep Structure From Motion
[NeurIPS 2023] ImageReward: Learning and Evaluating Human Preferences
Code for Cicero, an AI agent that plays the game of Diplomacy
ChatGLM2-6B: An Open Bilingual Chat LLM
ChatGLM3 series: Open Bilingual Chat LLMs | Open Source Bilingual Chat
code for Mesh R-CNN, ICCV 2019
Training Large Language Model to Reason in a Continuous Latent Space
CodeGeeX2: A More Powerful Multilingual Code Generation Model
A state-of-the-art open visual language model
Chinese and English multimodal conversational language model
kaldi-asr/kaldi is the official location of the Kaldi project
Text-to-Image generation. The repo for NeurIPS 2021 paper
GLM-130B: An Open Bilingual Pre-Trained Model (ICLR 2023)
The Pocket Datalab
We estimate dense, flicker-free, geometrically consistent depth