Generate high-definition story short videos with one click using AI
Controllable & emotion-expressive zero-shot TTS
World's first open-source, agentic video production system
A specialized Claude Code workspace for creating long-form
Framework for building AI-powered interactive digital humans and agent
AI Slack bot for reading, summarizing, and chatting with content
Repo of Qwen2-Audio chat & pretrained large audio language model
Software that uses AI to perform real-time voice conversion
Open-source framework for intelligent speech interaction
Toolkit for conversational AI
Open source personal AI Assistant for Linux, Windows and Mac
Aider is AI pair programming in your terminal
AI framework for automated short video creation and editing tools
Open-source abilities for OpenHome agents
Fully Local Manus AI. No APIs, No $200 monthly bills
Qwen3-ASR is an open-source series of ASR models
A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming
Run a full local LLM stack with one command using Docker
Open Source Speech Language Model
Open source AI model for generating full songs from lyrics prompts
Chat & pretrained large audio language model proposed by Alibaba Cloud
Open speech-to-speech models and pipelines by Hugging Face toolkit AI
Context-aware AI Sales Agent to automate sales outreach
SoTA open-source TTS
Multi-modal large language model designed for audio understanding