Dia-1.6B generates lifelike English dialogue and vocal expressions
converts csv files into one (or more if splitted) xls file(s)
Very lightweight and minimalistic python browser with webkit
Compact 360M text model with high efficiency and fine-tuning support
Dialogue-optimized 7B language model for safe and helpful chatting
Latent diffusion model for high-quality text-to-image generation
Advanced MMDiT text-to-image model for high-quality visual generation
ERNIE 4.5 MoE model in FP8 for efficient high-performance inference
CTC-based forced aligner for audio-text in 158 languages
State-of-the-art image-to-markdown OCR model
Powerful 12B parameter model for top-tier text-to-image creation
12B-parameter image generator using fast rectified flow transformers
Generates high-quality short videos from a single still image input
BGE-M3 is a multilingual embedding model
Multimodal ERNIE 4.5 MoE model for image-text reasoning and chat
Lightweight, fast, and high-quality open TTS model with 82M params
Instruction-tuned 7B model for chat and task-oriented text generation
Multilingual voice cloning TTS model with 6-second sample support
Llama-3.3-70B-Instruct is a multilingual AI optimized for helpful chat
Llama-2-70B-Chat is Meta’s largest fine-tuned open-source chat LLM
Vision-language-action model for robot control via images and text