Converts text to speech in realtime
Open Source AI Automation
State-of-the-art TTS model under 25MB
Text mining using tidy tools
The pluggable natural language linter for text and markdown
Industrial-level controllable zero-shot text-to-speech system
CLIP, Predict the most relevant text snippet given an image
Qwen3-omni is a natively end-to-end, omni-modal LLM
A persistent, network resilient, full text search library
Faster Whisper transcription with CTranslate2
Speech recognition module for Python
Streaming markdown renderer for AI apps with smooth updates
Unifying 3D Mesh Generation with Language Models
Python library and CLI tool to interface with Google Translate
State-of-the-art (SoTA) text-to-video pre-trained model
Generating Immersive, Explorable, and Interactive 3D Worlds
Qwen-Image is a powerful image generation foundation model
Gp.nvim (GPT prompt) Neovim AI plugin
AI video generator optimized for low VRAM and older GPUs use
Video translation and dubbing tool powered by LLMs
The Open Source AI-Powered Code Editor. A fork of VSCode and Continue
A fast TTS architecture with conditional flow matching
AI-powered tool for generating, optimizing, and translating subtitles
Discourse Network Analyzer (DNA)
AI-powered open source platform for building intelligent wiki bases