Build AI-powered semantic search applications
Unified KV Cache Compression Methods for Auto-Regressive Models
Open-source deep-learning framework
Miso TTS is an 8 billion, highly emotive text-to-speech model
Stanford NLP Python library for many human languages
Real time face swap and one-click video deepfake
Multilingual Document Layout Parsing in a Single Vision-Language Model
Shared repository for open-sourced projects from the Google AI Lang
Pluggable SOTA multi-object tracking modules for segmentation
Open-source industrial-grade ASR models
Ultimate meta-skill for generating best-in-class Claude Code skills
Streaming Real-time Audio-Driven Avatar Generation
Multimodal embedding and reranking models built on Qwen3-VL
Video understanding codebase from FAIR for reproducing video models
A new Minecraft world editor and converter
DeepMind model for tracking arbitrary points across videos & robotics
A series of math-specific large language models of our Qwen2 series
Turn WiFi signals into real-time human pose estimation and detection
Vision utilities for web interaction agents
ShredOS Disk Eraser 64 bit for all Intel 64 bit processors
Small python-gtk application, to merge or split PDFs
Build your chatbot within minutes on your favorite device
Multi-Platform Live Stream Automatic Recording Tool
Designed for text embedding and ranking tasks
Python IDE for beginners