Image polygonal annotation with Python
Free and source-available fair-code licensed workflow automation tool
Unofficial Python API and agentic skill for Google NotebookLM
Offline Text To Speech synthesis for python
OBLITERATE THE CHAINS THAT BIND YOU
OCRmyPDF adds an OCR text layer to scanned PDF files
Public repository for Agent Skills
Open-source, code-first Python toolkit for building, evaluating, etc.
A lightweight audio-to-MIDI converter with pitch bend detection
The most powerful and modular diffusion model GUI, api and backend
AI agent harness for AI coding agents
Oobabooga - The definitive Web UI for local AI, with powerful features
The agent that grows with you
Robust Speech Recognition via Large-Scale Weak Supervision
Google Gen AI Python SDK provides an interface for developers
Stable Diffusion web UI
Speech recognition module for Python
Reverse-engineered Python API for Google Gemini web app
A simple, high-quality voice conversion tool focused on ease of use
Unified web UI for training and running open models locally
From Images to High-Fidelity 3D Assets
Python & JS/TS SDK for running AI-generated code/code
Code for running inference and finetuning with SAM 3 model
Comprehensive Gradio WebUI for audio processing
Open source healthcare AI