High-performance inference server for text embeddings models API layer
Module for automatic summarization of text documents and HTML pages
Large Language Model Text Generation Inference
Hypernetworks that adapt LLMs for specific benchmark tasks
First class Sublime Text AI assistant with gpt-5, Opus 4.6, Gemini 3
AI tool that removes hardcoded subtitles and text from videos locally
A GUI tool for extracting hard-coded subtitle (hardsub) from videos
Awesome multilingual OCR toolkits based on PaddlePaddle
TTS with kokoro and onnx runtime
Wan2.1: Open and Advanced Large-Scale Video Generative Model
FastAPI framework, high performance, easy to learn, fast to code
A simple, high-quality voice conversion tool focused on ease of use
Offline inference engine for art, real-time voice conversations
Official MiniMax Model Context Protocol (MCP) server
EPUB to audiobook converter, optimized for Audiobookshelf
A robust, efficient, low-latency speech-to-text library
Open-Source Python3 tool for recognizing layouts, tables, and math
Mozc - a Japanese Input Method Editor designed for multi-platform
Speech-AI-Forge is a project developed around TTS generation model
Wan2.2: Open and Advanced Large-Scale Video Generative Model
Windows GUI Automation with Python (based on text properties)
Official inference repo for FLUX.1 models
High-Quality Voice Cloning TTS for 600+ Languages
Tokenizer-Free TTS for Multilingual Speech Generation
A simple native web interface that uses ChatTTS to synthesize text