Google Gen AI Python SDK provides an interface for developers
Label Studio is a multi-type data labeling and annotation tool
Awesome multilingual OCR toolkits based on PaddlePaddle
Generate audiobooks from e-books, voice cloning & 1107+ languages
Stable Diffusion WebUI optimized for AMD GPUs with editing tools
The most powerful and modular diffusion model GUI, api and backend
Self-host the powerful Chatterbox TTS model
A simple tool for reading in poorly redacted documents
EPUB to audiobook converter, optimized for Audiobookshelf
LLM abstractions that aren't obstructions
Small python-gtk application, to merge or split PDFs
A Web UI for easy subtitle using whisper model
Central interface to connect your LLM's with external data
Username OSINT tool for discovering accounts across many websites
The agent that grows with you
A fast TTS architecture with conditional flow matching
A minimalist command line knowledge base manager
Open-Sora: Democratizing Efficient Video Production for All
A better UI for your package managers
Open source no-code system for text annotation and building of text
Edit PDF files with Nano Banana
Multi-tool for semantic search
TUI for Ollama and other LLM providers
Easily compute clip embeddings and build a clip retrieval system
RAG-Anything: All-in-One RAG Framework