General natural language facilities for node
Vim plugin for LLM-assisted code/text completion
OCR model for complex documents with layout-aware structured outputs
Qwen3-omni is a natively end-to-end, omni-modal LLM
A fast, helpful, and open-source document parser
Automatable GenAI Scripting
Unifying 3D Mesh Generation with Language Models
dLLM: Simple Diffusion Language Modeling
Workflow and speech recognition app
A persistent, network resilient, full text search library
AI-powered tool for generating, optimizing, and translating subtitles
Open source text-to-speech tool, supports extra-long text
Generate blog articles from video or audio
Towards Human-Sounding Speech
Browser extension and cross-platform desktop app based on ChatGPT API
An Open Source text-to-speech system built by inverting Whisper
Spark-TTS Inference Code
Controllable & emotion-expressive zero-shot TTS
A minimal LLM chat app that runs entirely in your browser
TextWorld is a sandbox learning environment for the training
Instantly generate AI-powered subtitles on your device
Flowly is 100x faster than OpenClaw
Converts text to speech in realtime
Collection of Gemma 3 variants that are trained for performance
Web-based tool converts GitHub repository contents