Official repository for LTX-Video
Implementation of "MobileCLIP" CVPR 2024
Qwen3-omni is a natively end-to-end, omni-modal LLM
DeepMind model for tracking arbitrary points across videos & robotics
A fast, local neural text to speech system
Real Time Speech Enhancement in the Waveform Domain (Interspeech 2020)
A library for Multilingual Unsupervised or Supervised word Embeddings
Compact 8B multimodal instruct model optimized for edge deployment
Ultra-efficient 3B multimodal instruct model built for edge deployment
Efficient 14B multimodal instruct model with edge deployment and FP8