Stable Virtual Camera: Generative View Synthesis with Diffusion Models
Text and image to video generation: CogVideoX and CogVideo
Diversity-driven optimization and large-model reasoning ability
Netease Youdao's open-source embedding and reranker models
VGGSfM: Visual Geometry Grounded Deep Structure From Motion
The official PyTorch implementation of Google's Gemma models
State-of-the-art (SoTA) text-to-video pre-trained model
A Production-ready Reinforcement Learning AI Agent Library
PyTorch code and models for the DINOv2 self-supervised learning
Ring is a reasoning MoE LLM provided and open-sourced by InclusionAI
FAIR Sequence Modeling Toolkit 2
AI Suite for upscaling, interpolating & restoring images/videos
Official DeiT repository
Real-time behaviour synthesis with MuJoCo, using Predictive Control
A method to increase the speed and lower the memory footprint
Real Time Speech Enhancement in the Waveform Domain (Interspeech 2020)