The Pocket Datalab
text and image to video generation: CogVideoX (2024) and CogVideo
Fast backend for long-term AI user memory via structured profiles
CodeGeeX: An Open Multilingual Code Generation Model (KDD 2023)
ChatGLM3 series: Open Bilingual Chat LLMs | Open Source Bilingual Chat
ChatGLM2-6B: An Open Bilingual Chat LLM
Context data platform for building observable, self-learning AI agents
[NeurIPS 2023] ImageReward: Learning and Evaluating Human Preferences
21 Lessons, Get Started Building with Generative AI
CodeGeeX2: A More Powerful Multilingual Code Generation Model
Training Large Language Model to Reason in a Continuous Latent Space
MobileLLM Optimizing Sub-billion Parameter Language Models
VGGSfM: Visual Geometry Grounded Deep Structure From Motion
FAIR Sequence Modeling Toolkit 2
The leading agent orchestration platform for Claude
A state-of-the-art open visual language model
Chinese and English multimodal conversational language model
Code for Cicero, an AI agent that plays the game of Diplomacy
DeepMind model for tracking arbitrary points across videos & robotics
code for Mesh R-CNN, ICCV 2019
Renderer for the harmony response format to be used with gpt-oss
kaldi-asr/kaldi is the official location of the Kaldi project
VMZ: Model Zoo for Video Modeling
Meta-Datenbank-Anwendung für die Audio- und TV-Sendungen des CC2.TV
Text-to-Image generation. The repo for NeurIPS 2021 paper