Unified Model Serving Framework
Conversational voice AI agents
Structured outputs for llms
We write your reusable computer vision tools
The Multi-Agent Framework
A nearly-live implementation of OpenAI's Whisper
Towards Human-Level Text-to-Speech through Style Diffusion
Unified Multimodal Understanding and Generation Models
Sharp Monocular Metric Depth in Less Than a Second
DeepSeek Coder: Let the Code Write Itself
High-Resolution Image Synthesis with Latent Diffusion Models
Parse files for optimal RAG
Full stack AI software engineer
State-of-the-art diffusion models for image and audio generation
State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX
Composable building blocks to build Llama Apps
Clone a voice in 5 seconds to generate arbitrary speech in real-time
CodeGeeX2: A More Powerful Multilingual Code Generation Model
Programmatic access to the AlphaGenome model
Collection of Gemma 3 variants that are trained for performance
Official repository for LTX-Video
OCR expert VLM powered by Hunyuan's native multimodal architecture
HexStrike AI MCP Agents is an advanced MCP server
VMZ: Model Zoo for Video Modeling
HunyuanVideo: A Systematic Framework For Large Video Generation Model