Converts text to speech in realtime
Long-form streaming TTS system for multi-speaker dialogue generation
Open Source Speech Language Model
Toolkit for conversational AI
Mice speech to text with MX Cinnamon OS ISO
Interface for OuteTTS models
Towards Human-Sounding Speech
AI Slack bot for reading, summarizing, and chatting with content
Open-source Video Translation Skill
Easy-to-use Speech Toolkit including Self-Supervised Learning model
Extract audio and video content and organize it into a Markdown note
Multi-modal large language model designed for audio understanding
A TTS model capable of generating ultra-realistic dialogue
Qwen3-ASR is an open-source series of ASR models
Powerful Android AI agent with tools, automation, and Linux shell
LLM-based Reinforcement Learning audio edit model
Open source AI model for generating full songs from lyrics prompts
Open source personal AI Assistant for Linux, Windows and Mac
Pushing the Frontier of Long Audio-Visual Generation
Automatically translates the text of a video based on a subtitle file
AI framework for automated short video creation and editing tools
LLM Large Model of Selling Anchor
Virtual AI anchor that combines state-of-the-art technology
Chat & pretrained large audio language model proposed by Alibaba Cloud
A Conversational Speech Generation Model