OCRmyPDF adds an OCR text layer to scanned PDF files
A gradio web UI for running Large Language Models like LLaMA
3D reconstruction software
Visualizer for neural network, deep learning, machine learning models
Stable Diffusion web UI
Open-source, high-performance AI model with advanced reasoning
Chemcrow
Powerful AI language model (MoE) optimized for efficiency/performance
Telegram Drive
Open-Sora: Democratizing Efficient Video Production for All
One-click face swap
A retargetable MLIR-based machine learning compiler runtime toolkit
Image polygonal annotation with Python
State-of-the-art TTS model under 25MB
Generate short videos with one click using AI LLM
Low-code app builder for RAG and multi-agent AI applications
A lightweight audio-to-MIDI converter with pitch bend detection
Vector Database for the next generation of AI applications
The AI-powered coding wizard
Simple and powerful voice changer for Linux, written with Python & GTK
NVR with realtime local object detection for IP cameras
Speech-to-text, text-to-speech, and speaker recognition
A high-throughput and memory-efficient inference and serving engine
A Lightweight Face Recognition and Facial Attribute Analysis
Open Source Computer Vision Library