OCR model for complex documents with layout-aware structured outputs
Foundation model for image generation
Comprehensive Markdown plugin built for Django
Speakr is a personal, self-hosted web application
Image inpainting tool powered by SOTA AI Model
Chat with it via text and voice
The Ren'Py Visual Novel Engine
Synchronized Translation for Videos
Toolkit for conversational AI
Easily compute clip embeddings and build a clip retrieval system
Collection of Gemma 3 variants that are trained for performance
Python library and CLI tool to interface with Google Translate
Python module for parsing semi-structured text into python tables
High accuracy RAG for answering questions from scientific documents
Easy-to-use and powerful NLP library with Awesome model zoo
A nearly-live implementation of OpenAI's Whisper
A very simple framework for state-of-the-art NLP
Qwen2.5-VL is the multimodal large language model series
A python parametric CAD scripting framework based on OCCT
Get free HTTPS certificates forever from Let's Encrypt
A sound cloning tool with a web interface, using your voice
Stable Diffusion web UI
StreamSpeech is a seamless model for offline speech recognition
Unlock the fullest potential of your device
Speech-AI-Forge is a project developed around TTS generation model