A lightweight audio-to-MIDI converter with pitch bend detection
Free and source-available fair-code licensed workflow automation tool
Offline Text To Speech synthesis for python
Open source machine learning framework
Image polygonal annotation with Python
Port of Facebook's LLaMA model in C/C++
Reverse-engineered Python API for Google Gemini web app
Public repository for Agent Skills
OCRmyPDF adds an OCR text layer to scanned PDF files
OBLITERATE THE CHAINS THAT BIND YOU
AI agent harness for AI coding agents
The most powerful and modular diffusion model GUI, api and backend
Oobabooga - The definitive Web UI for local AI, with powerful features
From Images to High-Fidelity 3D Assets
Robust Speech Recognition via Large-Scale Weak Supervision
The agent that grows with you
Python & JS/TS SDK for running AI-generated code/code
AI video generator optimized for low VRAM and older GPUs use
Single-cell analysis in Python
Speech recognition module for Python
Stable Diffusion web UI
World's first open-source, agentic video production system
A simple, high-quality voice conversion tool focused on ease of use
Unified web UI for training and running open models locally
Agentic, Reasoning, and Coding (ARC) foundation models