State-of-the-art TTS model under 25MB
A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming
Powerful AI language model (MoE) optimized for efficiency/performance
Open-source, high-performance AI model with advanced reasoning
Open-source multi-speaker long-form text-to-speech model
Blazeface is a lightweight model that detects faces in images
A CNN model that predicts human joints from RGB images of a person
Locally run an Instruction-Tuned Chat-Style LLM
A collection of high-quality models for the MuJoCo physics engine