OCR software, free and offline
Contexts Optical Compression
Python tool for converting files and office documents to Markdown
High-Quality Voice Cloning TTS for 600+ Languages
Open source semantic search and text analytics for large document sets
Offline Text To Speech synthesis for python
Official inference repo for FLUX.2 models
AI bridge enabling assistants to control and automate Unity Editor
A simple, high-quality voice conversion tool focused on ease of use
Text and image to video generation: CogVideoX and CogVideo
Use Microsoft Edge's online text-to-speech service from Python
Automatic Speech Recognition with Word-level Timestamps
An easy 1-click way to create beautiful artwork on your PC using AI
Coding agent for DeepSeek models that runs in your terminal
A generative speech model for daily dialogue
Official MiniMax Model Context Protocol (MCP) server
Stanford CoreNLP, a Java suite of core NLP tools
Mixture-of-Experts Vision-Language Models for Advanced Multimodal
Subtitle Creation Assistant
Lightning-fast, on-device TTS, running natively via ONNX
Generate audiobooks from e-books, voice cloning & 1107+ languages
TTS for Context-Aware Speech Generation and True-to-Life Voice Cloning
Nexa SDK is a comprehensive toolkit for supporting ONNX and GGML
A Powerful Native Multimodal Model for Image Generation
Video-based AI memory library. Store millions of text chunks in MP4