Open Source OCR Engine
GUI for a Vocal Remover that uses Deep Neural Networks
Free and source-available fair-code licensed workflow automation tool
Real time face swap and one-click video deepfake
Port of OpenAI's Whisper model in C/C++
Free and Open Source AI Image Upscaler for Linux, MacOS and Windows
Get up and running with Llama 2 and other large language models
State-of-the-art 2D and 3D Face Analysis Project
The most powerful and modular diffusion model GUI, api and backend
Stable Diffusion web UI
Wan2.2: Open and Advanced Large-Scale Video Generative Model
Focus on prompting and generating
Build your own AI friend
LLM Frontend for Power Users
3D reconstruction software
Speech-to-text, text-to-speech, and speaker recognition
Port of Facebook's LLaMA model in C/C++
Advanced language and coding AI model
Open source machine learning framework
Image generation model with single-stream diffusion transformer
OCRmyPDF adds an OCR text layer to scanned PDF files
Agentic, Reasoning, and Coding (ARC) foundation models
Code for running inference and finetuning with SAM 3 model
Run Local LLMs on Any Device. Open-source
GLM-4.5: Open-source LLM for intelligent agents by Z.ai