Production-grade platform for building agentic IM bots
Motion-controllable Video Generation via Latent Trajectory Guidance
Autonomous research from idea to paper. Chat an Idea. Get a Paper 🦞
VGGSfM: Visual Geometry Grounded Deep Structure From Motion
PyTorch code and models for the DINOv2 self-supervised learning
Practice made claude perfect
Cosmos-RL is a flexible and scalable Reinforcement Learning framework
Unified framework for building enterprise RAG pipelines
CoreNet: A library for training deep neural networks
Universal LLM Deployment Engine with ML Compilation
Generating Immersive, Explorable, and Interactive 3D Worlds
Bring the notion of Model-as-a-Service to life
Communicate with an LLM provider using a single interface
Open-source industrial-grade ASR models
Ultimate meta-skill for generating best-in-class Claude Code skills
Block Diffusion for Ultra-Fast Speculative Decoding
Official implementation of Watermark Anything with Localized Messages
Video understanding codebase from FAIR for reproducing video models
The open source post-building layer for agents
Streamlines and simplifies prompt design for both developers
Towards Efficient Self-Evolving Agent System
Learning to Reason with Search for LLMs via Reinforcement Learning
Build multimodal language agents for fast prototype and production
LightLLM is a Python-based LLM (Large Language Model) inference
Open-source framework for intelligent speech interaction