Real time face swap and one-click video deepfake
Open source AI Agents hosted on the oTTomator Live Agent Studio
A robust, efficient, low-latency speech-to-text library
A nearly-live implementation of OpenAI's Whisper
A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming
Build Vision Agents quickly with any model or video provider
The open-source data curation platform for LLMs
NVR with realtime local object detection for IP cameras
Virtual AI anchor that combines state-of-the-art technology
Self-learning data agent that grounds its answers in layers of content
Python & JS/TS SDK for running AI-generated code/code
Document Image Parsing via Heterogeneous Anchor Prompting”
Open-Source Financial Large Language Models
Qwen3-ASR is an open-source series of ASR models
Python chatbot framework with Natural Language Understanding
A text-to-speech, speech-to-text and speech-to-speech library
An Open-Source AI Agent Platform for Financial Analysis using LLMs
Code to accompany "A Method for Animating Children's Drawings"
DeepMind model for tracking arbitrary points across videos & robotics
Data science on data without acquiring a copy
Analyzing Hacker News discussions from a decade ago in hindsight
Anthropic's educational courses
An MCP server that autonomously evaluates web applications
Open source framework for deep learning satellite and aerial imagery
Powerful open source image generation model