Open-source, code-first Python toolkit for building, evaluating, etc.
Free and source-available fair-code licensed workflow automation tool
A lightweight audio-to-MIDI converter with pitch bend detection
Offline Text To Speech synthesis for python
Personal AI, On Personal Devices
Deep Research framework, combining language models with tools
OBLITERATE THE CHAINS THAT BIND YOU
Unofficial Python API and agentic skill for Google NotebookLM
Public repository for Agent Skills
Fully automatic censorship removal for language models
OCRmyPDF adds an OCR text layer to scanned PDF files
Robust Speech Recognition via Large-Scale Weak Supervision
Fast and memory-efficient exact attention
A simple, high-quality voice conversion tool focused on ease of use
Google Gen AI Python SDK provides an interface for developers
World's first open-source, agentic video production system
Comprehensive Gradio WebUI for audio processing
Reverse-engineered Python API for Google Gemini web app
Stable Diffusion web UI
Python-based neural networks API
The agent that grows with you
Create UIs for your machine learning model in Python in 3 minutes
From Images to High-Fidelity 3D Assets
Image polygonal annotation with Python
Improve your Baduk skills by training with KataGo