Collection of Gemma 3 variants that are trained for performance
Unified Model Serving Framework
State-of-the-art (SoTA) text-to-video pre-trained model
OCR expert VLM powered by Hunyuan's native multimodal architecture
The official Python library for the OpenAI API
Stable Diffusion built-in to Blender
We write your reusable computer vision tools
Structured outputs for llms
Build multi-modal Agents with memory, knowledge, tools and reasoning
Parse files for optimal RAG
A Pythonic framework to simplify AI service building
Open-source multi-speaker long-form text-to-speech model
A nearly-live implementation of OpenAI's Whisper
High-Resolution Image Synthesis with Latent Diffusion Models
State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX
Composable building blocks to build Llama Apps
Programmatic access to the AlphaGenome model
CodeGeeX2: A More Powerful Multilingual Code Generation Model
Tongyi Deep Research, the Leading Open-source Deep Research Agent
Style-Bert-VITS2: Bert-VITS2 with more controllable voice styles
VMZ: Model Zoo for Video Modeling
HunyuanVideo: A Systematic Framework For Large Video Generation Model
A framework to enable multimodal models to operate a computer
Virtual AI anchor that combines state-of-the-art technology
Reverse-engineered Python API for Google Gemini web app