Unified Model Serving Framework
Conversational voice AI agents
Structured outputs for llms
We write your reusable computer vision tools
The Multi-Agent Framework
A nearly-live implementation of OpenAI's Whisper
Towards Human-Level Text-to-Speech through Style Diffusion
Unified Multimodal Understanding and Generation Models
Sharp Monocular Metric Depth in Less Than a Second
DeepSeek Coder: Let the Code Write Itself
High-Resolution Image Synthesis with Latent Diffusion Models
Parse files for optimal RAG
Full stack AI software engineer
State-of-the-art diffusion models for image and audio generation
State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX
Clone a voice in 5 seconds to generate arbitrary speech in real-time
CodeGeeX2: A More Powerful Multilingual Code Generation Model
Composable building blocks to build Llama Apps
Programmatic access to the AlphaGenome model
OCR expert VLM powered by Hunyuan's native multimodal architecture
Collection of Gemma 3 variants that are trained for performance
Official repository for LTX-Video
HexStrike AI MCP Agents is an advanced MCP server
VMZ: Model Zoo for Video Modeling
HunyuanVideo: A Systematic Framework For Large Video Generation Model