GUI for a Vocal Remover that uses Deep Neural Networks
Real time face swap and one-click video deepfake
Agentic, Reasoning, and Coding (ARC) foundation models
State-of-the-art 2D and 3D Face Analysis Project
The most powerful and modular diffusion model GUI, api and backend
Run Local LLMs on Any Device. Open-source
Focus on prompting and generating
GLM-4.5: Open-source LLM for intelligent agents by Z.ai
OCRmyPDF adds an OCR text layer to scanned PDF files
3D reconstruction software
The Pocket Datalab
Stable Diffusion web UI
Wan2.2: Open and Advanced Large-Scale Video Generative Model
RGBD video generation model conditioned on camera input
Robust Speech Recognition via Large-Scale Weak Supervision
Qwen3 is the large language model series developed by Qwen team
Wan2.1: Open and Advanced Large-Scale Video Generative Model
Awesome multilingual OCR toolkits based on PaddlePaddle
Open-source, high-performance AI model with advanced reasoning
Powerful AI language model (MoE) optimized for efficiency/performance
Open-Sora: Democratizing Efficient Video Production for All
Diversity-driven optimization and large-model reasoning ability
InvokeAI is a leading creative engine for Stable Diffusion models
Contexts Optical Compression
gpt-4o for windows, macos and linux