Real time face swap and one-click video deepfake
Open source AI Agents hosted on the oTTomator Live Agent Studio
A robust, efficient, low-latency speech-to-text library
A nearly-live implementation of OpenAI's Whisper
A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming
The open-source data curation platform for LLMs
NVR with realtime local object detection for IP cameras
Build Vision Agents quickly with any model or video provider
Virtual AI anchor that combines state-of-the-art technology
EPUB to audiobook converter, optimized for Audiobookshelf
Document Image Parsing via Heterogeneous Anchor Prompting”
Python & JS/TS SDK for running AI-generated code/code
Code to accompany "A Method for Animating Children's Drawings"
Python chatbot framework with Natural Language Understanding
A text-to-speech, speech-to-text and speech-to-speech library
Open source framework for deep learning satellite and aerial imagery
Open-Source Financial Large Language Models
An Open-Source AI Agent Platform for Financial Analysis using LLMs
Anthropic's educational courses
DeepMind model for tracking arbitrary points across videos & robotics
Data science on data without acquiring a copy
An MCP server that autonomously evaluates web applications
OpenFieldAI is an AI based Open Field Test Rodent Tracker
Powerful open source image generation model