Mice speech to text with MX Cinnamon OS ISO
Toolkit for conversational AI
Interface for OuteTTS models
Towards Human-Sounding Speech
AI Slack bot for reading, summarizing, and chatting with content
Open-source Video Translation Skill
Easy-to-use Speech Toolkit including Self-Supervised Learning model
Multi-modal large language model designed for audio understanding
A TTS model capable of generating ultra-realistic dialogue
Qwen3-ASR is an open-source series of ASR models
Powerful Android AI agent with tools, automation, and Linux shell
LLM-based Reinforcement Learning audio edit model
Open source personal AI Assistant for Linux, Windows and Mac
Open source AI model for generating full songs from lyrics prompts
Pushing the Frontier of Long Audio-Visual Generation
Automatically translates the text of a video based on a subtitle file
AI framework for automated short video creation and editing tools
LLM Large Model of Selling Anchor
Virtual AI anchor that combines state-of-the-art technology
Chat & pretrained large audio language model proposed by Alibaba Cloud
A Conversational Speech Generation Model
Run GGUF models easily with a UI or API. One File. Zero Install.
Unlimited, private and free Speech-To-Text program
Two Integrated Text To Speech Engines uses MMS & Silero