OCRmyPDF adds an OCR text layer to scanned PDF files
Vector Database for the next generation of AI applications
Visualizer for neural network, deep learning, machine learning models
It's possible for machines to become self-aware.
A GUI tool for extracting hard-coded subtitle (hardsub) from videos
Agentic, Reasoning, and Coding (ARC) foundation models
Image inpainting tool powered by SOTA AI Model
The all-in-one Desktop & Docker AI application with full RAG and AI
ONNX Runtime: cross-platform, high performance ML inferencing
Python-based neural networks API
Open source AI agent CLI tool to bring Gemini into your terminal
A gradio web UI for running Large Language Models like LLaMA
LLM Frontend for Power Users
RGBD video generation model conditioned on camera input
Open-Sora: Democratizing Efficient Video Production for All
Powerful AI language model (MoE) optimized for efficiency/performance
Comprehensive Gradio WebUI for audio processing
Source code of PyGAD, Python 3 library for building genetic algorithms
Qwen3 is the large language model series developed by Qwen team
A deep learning toolkit for Text-to-Speech, battle-tested in research
Structure-from-Motion and Multi-View Stereo
Awesome multilingual OCR toolkits based on PaddlePaddle
A high-throughput and memory-efficient inference and serving engine
Stable Diffusion web UI
A retargetable MLIR-based machine learning compiler runtime toolkit