Dia-1.6B generates lifelike English dialogue and vocal expressions
Grok-1 is a 314B-parameter open-weight language model by xAI
ERNIE 4.5 MoE model in FP8 for efficient high-performance inference
Code generation model trained on 80+ languages with FIM support
State-of-the-art RL-trained coding agent for complex SWE tasks
CTC-based forced aligner for audio-text in 158 languages
Mirror of Ultralytics YOLO-World model weights for object detection
Speaker segmentation model for 10s audio chunks with powerset labels
Open-weight, large-scale hybrid-attention reasoning model
Compact 360M text model with high efficiency and fine-tuning support
State-of-the-art image-to-markdown OCR model
Powerful 12B parameter model for top-tier text-to-image creation
Text-to-image diffusion model for high-quality image generation