Bilingual 6.2B parameter chatbot optimized for Chinese and English
CTC-based forced aligner for audio-text in 158 languages
Mirror of Ultralytics YOLO-World model weights for object detection
Compact 360M text model with high efficiency and fine-tuning support
Whisper-large-v3-turbo delivers fast, multilingual speech recognition
Detects speech activity in audio using pyannote.audio 2.1 pipeline