Open-source, high-performance AI model with advanced reasoning
OCR software, free and offline
Agentic, Reasoning, and Coding (ARC) foundation models
Official inference repo for FLUX.2 models
Robust Speech Recognition via Large-Scale Weak Supervision
Wan2.1: Open and Advanced Large-Scale Video Generative Model
Image inpainting tool powered by SOTA AI Model
1 min voice data can also be used to train a good TTS model
Awesome multilingual OCR toolkits based on PaddlePaddle
A GUI tool for extracting hard-coded subtitle (hardsub) from videos
A high-throughput and memory-efficient inference and serving engine
Synchronized Translation for Videos
Web interface for generating images using Stable Diffusion models
YOLOv5 is the world's most loved vision AI
NVR with realtime local object detection for IP cameras
Open source personal AI Assistant for Linux, Windows and Mac
A Lightweight Face Recognition and Facial Attribute Analysis
Machine learning in Python
Generate short videos with one click using AI LLM
Open source annotation tool for machine learning practitioners
A Python wrapper you can't refuse
TensorFlow is an open source library for machine learning
A gradio web UI for running Large Language Models like LLaMA
A simple, high-quality voice conversion tool focused on ease of use
Real-World Centric Foundation GUI Agents