AutoML toolkit for automate machine learning lifecycle
Real time face swap and one-click video deepfake
GUI for a Vocal Remover that uses Deep Neural Networks
Robust Speech Recognition via Large-Scale Weak Supervision
One-click face swap
State-of-the-art 2D and 3D Face Analysis Project
YOLOv5 is the world's most loved vision AI
Focus on prompting and generating
The most powerful and modular diffusion model GUI, api and backend
A GUI tool for extracting hard-coded subtitle (hardsub) from videos
Powerful AI language model (MoE) optimized for efficiency/performance
Image inpainting tool powered by SOTA AI Model
Run Local LLMs on Any Device. Open-source
OCRmyPDF adds an OCR text layer to scanned PDF files
It's possible for machines to become self-aware.
Open-source, high-performance AI model with advanced reasoning
A gradio web UI for running Large Language Models like LLaMA
Image/video AI upscaler app (BSRGAN)
Stable Diffusion web UI
Machine learning in Python
Powerful tool that lets you create and run intelligent agents
Ready-to-use OCR with 80+ supported languages
3D reconstruction software
A high-throughput and memory-efficient inference and serving engine
Open-Sora: Democratizing Efficient Video Production for All