State-of-the-art TTS model under 25MB
A lightweight text-to-speech model with zero-shot voice cloning
Industrial-level controllable zero-shot text-to-speech system
Converts text to speech in realtime
Faster Whisper transcription with CTranslate2
AI video generator optimized for low VRAM and older GPUs use
Qwen3-omni is a natively end-to-end, omni-modal LLM
The Open Source AI-Powered Code Editor. A fork of VSCode and Continue
Speech recognition module for Python
CLIP, Predict the most relevant text snippet given an image
Text mining using tidy tools
A persistent, network resilient, full text search library
The pluggable natural language linter for text and markdown
AI-powered open source platform for building intelligent wiki bases
Generating Immersive, Explorable, and Interactive 3D Worlds
Qwen-Image is a powerful image generation foundation model
State-of-the-art (SoTA) text-to-video pre-trained model
Streaming markdown renderer for AI apps with smooth updates
Video translation and dubbing tool powered by LLMs
Python library and CLI tool to interface with Google Translate
Unifying 3D Mesh Generation with Language Models
AI-powered tool for generating, optimizing, and translating subtitles
A fast TTS architecture with conditional flow matching
Gp.nvim (GPT prompt) Neovim AI plugin
Discourse Network Analyzer (DNA)