EPUB to audiobook converter, optimized for Audiobookshelf
OCR software, free and offline
Effortless data labeling with AI support from Segment Anything
Turns Data and AI algorithms into production-ready web applications
Python inference and LoRA trainer package for the LTX-2 audio–video
Image polygonal annotation with Python
DoWhy is a Python library for causal inference
A Web UI for easy subtitle using whisper model
the terminal client for Ollama
Open Source Generative Process Automation
Sunfish: a Python Chess Engine in 111 lines of code
AI tool for automating desktop tasks via natural language input
A single-file tkinter-based Ollama GUI project
Clone a voice in 5 seconds to generate arbitrary speech in real-time
Visual tool for building, testing, and deploying AI agent workflows
Style-Bert-VITS2: Bert-VITS2 with more controllable voice styles
The AI toolkit for the AI developer
Weaving the Digital Agent Galaxy
Agent S: an open agentic framework that uses computers like a human
UI-TARS-desktop version that can operate on your local personal device
Synthetic Data Generation for tabular, relational and time series data
GUI Exploration Lab. One of the best GUI agent solutions
Multimodal Agents as Smartphone Users, an LLM-based multimodal agent
Real-World Centric Foundation GUI Agents
Open source AI pair programmer for coding, debugging, automation