The most powerful and modular diffusion model GUI, api and backend
Fast stable diffusion on CPU and AI PC
Generate audiobooks from e-books
Framework and no-code GUI for fine-tuning LLMs
Image polygonal annotation with Python
Style-Bert-VITS2: Bert-VITS2 with more controllable voice styles
GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning
Agent S: an open agentic framework that uses computers like a human
A graphical manager for ollama that can manage your LLMs
A fractal neural network
AI Suite for upscaling, interpolating & restoring images/videos
Meta-Datenbank-Anwendung für die Audio- und TV-Sendungen des CC2.TV
Real-time behaviour synthesis with MuJoCo, using Predictive Control
Official Code for DragGAN (SIGGRAPH 2023)
Face Recognition based Attendance System for school, college...
Easy-OCR solution and Tesseract trainer for GNU/Linux