ERNIE 4.5 MoE model in FP8 for efficient high-performance inference
Code generation model trained on 80+ languages with FIM support
Python Computer Vision & Video Analytics Framework With Batteries Incl
Speaker segmentation model for 10s audio chunks with powerset labels
Open-weight, large-scale hybrid-attention reasoning model
State-of-the-art image-to-markdown OCR model
Powerful 12B parameter model for top-tier text-to-image creation
Text-to-image diffusion model for high-quality image generation
Advanced base model for high-quality text-to-image generation
Efficient text-to-image model with enhanced quality and typography
Lightweight, fast, and high-quality open TTS model with 82M params
High-accuracy multilingual speech recognition and translation model
Dialogue-optimized 7B language model for safe and helpful chatting
7B-parameter foundational LLM by Meta for text generation tasks
Multilingual 8B-parameter chat-optimized LLM fine-tuned by Meta
Instruction-tuned 8B LLM by Meta for helpful, safe English dialogue
12B-parameter image generator using fast rectified flow transformers
Latent diffusion model for high-quality text-to-image generation
Extension for Stable Diffusion using edge, depth, pose, and more
Small, high-performing language model for QA, chat, and code tasks
Generates high-quality short videos from a single still image input
Advanced MMDiT text-to-image model for high-quality visual generation
Bilingual 6.2B parameter chatbot optimized for Chinese and English
Instruction-tuned 7B model for chat and task-oriented text generation