The behavior guidance framework for customer-facing LLM agents
PDF to Markdown with vision models
Open source self-hosted web archiving
Enhances Tesseract OCR output using LLMs (local or API)
AI-powered code assistant for Vim. OpenAI and ChatGPT plugin for Vim
Translate the video from one language to another and embed dubbing
Claude Code skill implementing Manus-style persistent planning
CLIP, Predict the most relevant text snippet given an image
Industrial-level controllable zero-shot text-to-speech system
Tilf (Tiny Elf) is a free, simple yet powerful pixel art editor
Clone a voice in 5 seconds to generate arbitrary speech in real-time
AI Image Upscaler & Enhancer
Lightweight Markdown-only skills for autonomous ML research
A community-supported supercharged version of paperless
Investment research for everyone, anywhere
The official Python SDK for the ElevenLabs API
The Ren'Py Visual Novel Engine
Persian NLP Toolkit
A nearly-live implementation of OpenAI's Whisper
Collection of Gemma 3 variants that are trained for performance
Management of Yandex Station and other smart home devices
A fast TTS architecture with conditional flow matching
A minimalist command line knowledge base manager
A lightweight text-to-speech model with zero-shot voice cloning
Python & command-line tool to gather text on the Web