Compact 360M text model with high efficiency and fine-tuning support
State-of-the-art image-to-markdown OCR model
Powerful 12B parameter model for top-tier text-to-image creation
Text-to-image diffusion model for high-quality image generation
Advanced base model for high-quality text-to-image generation
Efficient text-to-image model with enhanced quality and typography
Lightweight, fast, and high-quality open TTS model with 82M params
High-accuracy multilingual speech recognition and translation model
Dialogue-optimized 7B language model for safe and helpful chatting
7B-parameter foundational LLM by Meta for text generation tasks
Multilingual 8B-parameter chat-optimized LLM fine-tuned by Meta
Instruction-tuned 8B LLM by Meta for helpful, safe English dialogue
12B-parameter image generator using fast rectified flow transformers
Latent diffusion model for high-quality text-to-image generation
Extension for Stable Diffusion using edge, depth, pose, and more
Small, high-performing language model for QA, chat, and code tasks
Generates high-quality short videos from a single still image input
Advanced MMDiT text-to-image model for high-quality visual generation
Bilingual 6.2B parameter chatbot optimized for Chinese and English
Instruction-tuned 7B model for chat and task-oriented text generation
Multilingual voice cloning TTS model with 6-second sample support
GPT-2 is a 124M parameter English language model for text generation
Whisper-large-v3-turbo delivers fast, multilingual speech recognition
Llama-3.3-70B-Instruct is a multilingual AI optimized for helpful chat
Llama-2-70B-Chat is Meta’s largest fine-tuned open-source chat LLM