A high-quality tool for convert PDF to Markdown and JSON
Chat with it via text and voice
Unofficial Python API and agentic skill for Google NotebookLM
Official python implementation of UTCP. UTCP is an open standard
An opinionated CLI to transcribe Audio files w/ Whisper on-device
Arcade Tool Development Kit (TDK), Worker, Evals, and CLI
GUI/CLI tool for downloading Xiaohongshu
The most powerful and modular diffusion model GUI, api and backend
Reverse engineering Gemini's SynthID detection
Fast stable diffusion on CPU and AI PC
Open-source infrastructure for Computer-Use Agents. Sandboxes
Private chat with local GPT with document, images, video, etc.
The official gpt4free repository
EPUB to audiobook converter, optimized for Audiobookshelf
LangChain powered shell command generator and runner CLI
Run a full local LLM stack with one command using Docker
TTS for Context-Aware Speech Generation and True-to-Life Voice Cloning
Contexts Optical Compression
A state-of-the-art open visual language model
Enterprise multi-agent orchestration framework for scalable AI apps
End-to-end pipeline converting generative videos
Gorilla: An API store for LLMs
A GUI tool for extracting hard-coded subtitle (hardsub) from videos
Generate audiobooks from e-books, voice cloning & 1107+ languages
Framework and no-code GUI for fine-tuning LLMs