CLI MCP package manager & registry for all platforms and all clients
Dshell is a network forensic analysis framework
Omnilingual ASR Open-Source Multilingual SpeechRecognition
PyTorch code and models for V-JEPA self-supervised learning from video
[CVPR 2025 Best Paper Award] VGGT
A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming
Library for OCR-related tasks powered by Deep Learning
Capable of understanding text, audio, vision, video
Controllable and fast Text-to-Speech for over 7000 languages
LLM powered fuzzing via OSS-Fuzz
Mixture-of-Experts Vision-Language Models for Advanced Multimodal
kaldi-asr/kaldi is the official location of the Kaldi project
Curl cryptocurrencies exchange rates
Official implementation of Watermark Anything with Localized Messages
A Python toolbox for performing gradient-free optimization
Multilingual Automatic Speech Recognition with word-level timestamps
Build GenAI application quick and easy
Efficient binary-decimal & decimal-binary conversion routines for IEEE
Your open-source LLM evaluation toolkit
Persistent HTTP cache for python requests
A Unified Framework for Image Customization
Tensor search for humans
Stanford NLP Python library for many human languages
UI-TARS-desktop version that can operate on your local personal device
LLM-based Reinforcement Learning audio edit model