Official implementation of Watermark Anything with Localized Messages
ChatGLM-6B: An Open Bilingual Dialogue Language Model
Implementation of the Surya Foundation Model for Heliophysics
Release for Improved Denoising Diffusion Probabilistic Models
Industrial-level controllable zero-shot text-to-speech system
A Pragmatic VLA Foundation Model
Stable Virtual Camera: Generative View Synthesis with Diffusion Models
Tool for exploring and debugging transformer model behaviors
Unified Multimodal Understanding and Generation Models
CogView4, CogView3-Plus and CogView3(ECCV 2024)
Foundation Models for Time Series
tiktoken is a fast BPE tokeniser for use with OpenAI's models
Ling-V2 is a MoE LLM provided and open-sourced by InclusionAI
Recovering the Visual Space from Any Views
Block Diffusion for Ultra-Fast Speculative Decoding
FAIR Sequence Modeling Toolkit 2
VMZ: Model Zoo for Video Modeling
A Production-ready Reinforcement Learning AI Agent Library
Video understanding codebase from FAIR for reproducing video models
Towards Real-World Vision-Language Understanding
Real-time behaviour synthesis with MuJoCo, using Predictive Control
Generating Immersive, Explorable, and Interactive 3D Worlds
Accurate × Fast × Comprehensive
GLM-4.5: Open-source LLM for intelligent agents by Z.ai
Audio foundation model excelling in audio understanding