High-accuracy multilingual speech recognition and translation model
Dialogue-optimized 7B language model for safe and helpful chatting
7B-parameter foundational LLM by Meta for text generation tasks
Multilingual 8B-parameter chat-optimized LLM fine-tuned by Meta
Instruction-tuned 8B LLM by Meta for helpful, safe English dialogue
12B-parameter image generator using fast rectified flow transformers
Latent diffusion model for high-quality text-to-image generation
Extension for Stable Diffusion using edge, depth, pose, and more
Small, high-performing language model for QA, chat, and code tasks
Generates high-quality short videos from a single still image input
Advanced MMDiT text-to-image model for high-quality visual generation
Bilingual 6.2B parameter chatbot optimized for Chinese and English
Instruction-tuned 7B model for chat and task-oriented text generation
Multilingual voice cloning TTS model with 6-second sample support
GPT-2 is a 124M parameter English language model for text generation
Whisper-large-v3-turbo delivers fast, multilingual speech recognition
Llama-3.3-70B-Instruct is a multilingual AI optimized for helpful chat
Llama-2-70B-Chat is Meta’s largest fine-tuned open-source chat LLM
BGE-M3 is a multilingual embedding model
Llama-2-7B is a 7B-parameter transformer model for text generation
Detects speech activity in audio using pyannote.audio 2.1 pipeline
Time series forecasting model using T5 architecture with 46M params
Multimodal ERNIE 4.5 MoE model for image-text reasoning and chat