Open Source OCR Engine
Real time face swap and one-click video deepfake
State-of-the-art 2D and 3D Face Analysis Project
LLM Frontend for Power Users
157 models, 30 providers, one command to find what runs on hardware
From Vibe Coding to Agentic Engineering
A Family of Open Sourced Music Foundation Models
Instant voice cloning by MIT and MyShell. Audio foundation model
Code for running inference and finetuning with SAM 3 model
Client-side indecent content checking powered by TensorFlow.js
A simple, high-quality voice conversion tool focused on ease of use
AI agent stdlib that works with any LLM and TypeScript AI SDK
Captcha solver extension for humans
Vald. A Highly Scalable Distributed Vector Search Engine
The all-in-one Desktop & Docker AI application with full RAG and AI
Open-source vector similarity search for Postgres
Awesome multilingual OCR toolkits based on PaddlePaddle
Synchronized Translation for Videos
A high-performance ML model serving framework, offers dynamic batching
The open-source voice synthesis studio powered by Qwen3-TTS
Java wrapper for the popular chat & VOIP service
Telegram Drive
Use Microsoft Edge's online text-to-speech service from Python
Deep learning library
A RWKV management and startup tool, full automation, only 8MB