Agentic, Reasoning, and Coding (ARC) foundation models
OCR software, free and offline
Image inpainting tool powered by SOTA AI Model
Official inference repo for FLUX.2 models
Wan2.1: Open and Advanced Large-Scale Video Generative Model
Robust Speech Recognition via Large-Scale Weak Supervision
1 min voice data can also be used to train a good TTS model
Awesome multilingual OCR toolkits based on PaddlePaddle
A Lightweight Face Recognition and Facial Attribute Analysis
A high-throughput and memory-efficient inference and serving engine
NVR with realtime local object detection for IP cameras
Synchronized Translation for Videos
Web interface for generating images using Stable Diffusion models
YOLOv5 is the world's most loved vision AI
A GUI tool for extracting hard-coded subtitle (hardsub) from videos
Machine learning in Python
Open source personal AI Assistant for Linux, Windows and Mac
Open source annotation tool for machine learning practitioners
A Python wrapper you can't refuse
Generate short videos with one click using AI LLM
TensorFlow is an open source library for machine learning
A community-supported supercharged version of paperless
A gradio web UI for running Large Language Models like LLaMA
A simple, high-quality voice conversion tool focused on ease of use
No fortress, purely open ground. OpenManus is Coming