A GUI tool for extracting hard-coded subtitle (hardsub) from videos
Simple and powerful voice changer for Linux, written with Python & GTK
OCRmyPDF adds an OCR text layer to scanned PDF files
Run Local LLMs on Any Device. Open-source
Powerful tool that lets you create and run intelligent agents
Powerful AI language model (MoE) optimized for efficiency/performance
A high-throughput and memory-efficient inference and serving engine
Image polygonal annotation with Python
Image/video AI upscaler app (BSRGAN)
A gradio web UI for running Large Language Models like LLaMA
Open-source, high-performance AI model with advanced reasoning
Open-Sora: Democratizing Efficient Video Production for All
3D reconstruction software
Stable Diffusion web UI
A framework for the creation of autonomous agent services
A lightweight audio-to-MIDI converter with pitch bend detection
NVR with realtime local object detection for IP cameras
E-mails, subdomains and names
A modular graph-based Retrieval-Augmented Generation (RAG) system
A deep learning toolkit for Text-to-Speech, battle-tested in research
Ready-to-use OCR with 80+ supported languages
Self-hosted AI coding assistant
gpt-4o for windows, macos and linux
Web interface for generating images using Stable Diffusion models
Machine learning in Python