Small, high-performing language model for QA, chat, and code tasks
Generates high-quality short videos from a single still image input
Advanced MMDiT text-to-image model for high-quality visual generation
Bilingual 6.2B parameter chatbot optimized for Chinese and English
Instruction-tuned 7B model for chat and task-oriented text generation
Multilingual voice cloning TTS model with 6-second sample support
GPT-2 is a 124M parameter English language model for text generation
Whisper-large-v3-turbo delivers fast, multilingual speech recognition
Llama-3.3-70B-Instruct is a multilingual AI optimized for helpful chat
Llama-2-70B-Chat is Meta’s largest fine-tuned open-source chat LLM
BGE-M3 is a multilingual embedding model
Llama-2-7B is a 7B-parameter transformer model for text generation
Detects speech activity in audio using pyannote.audio 2.1 pipeline
Time series forecasting model using T5 architecture with 46M params
Multimodal ERNIE 4.5 MoE model for image-text reasoning and chat