Definitions for AI/ML tasks like dataset creation
Long-form streaming TTS system for multi-speaker dialogue generation
Open-source industrial-grade ASR models
End-to-end pipeline converting generative videos
Motion-controllable Video Generation via Latent Trajectory Guidance
Multimodal embedding and reranking models built on Qwen3-VL
Implementation of "MobileCLIP" CVPR 2024
VMZ: Model Zoo for Video Modeling
Official implementation of Watermark Anything with Localized Messages
Video understanding codebase from FAIR for reproducing video models
Open-source deep-learning framework for building and training
Lemonade helps users run local LLMs with the highest performance
Mentat - The AI Coding Assistant
A state-of-the-art open visual language model
Chinese and English multimodal conversational language model
Inference code for CodeLlama models
A Python package for extending the official PyTorch
Productive, portable, and performant GPU programming in Python
Kubernetes observability and automation
Official python implementation of UTCP. UTCP is an open standard
GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning
High-Fidelity and Controllable Generation of Textured 3D Assets
Python IDE
Convert codebases into structured prompts optimized for LLM analysis
Controllable & emotion-expressive zero-shot TTS