Framework for building AI-powered interactive digital humans and agent
AI Slack bot for reading, summarizing, and chatting with content
Repo of Qwen2-Audio chat & pretrained large audio language model
A specialized Claude Code workspace for creating long-form
Fully Local Manus AI. No APIs, No $200 monthly bills
Aider is AI pair programming in your terminal
Toolkit for conversational AI
Open-source abilities for OpenHome agents
Open source personal AI Assistant for Linux, Windows and Mac
Software that uses AI to perform real-time voice conversion
A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming
Open Source Speech Language Model
AI framework for automated short video creation and editing tools
Qwen3-ASR is an open-source series of ASR models
Open source AI model for generating full songs from lyrics prompts
Chat & pretrained large audio language model proposed by Alibaba Cloud
Long-form streaming TTS system for multi-speaker dialogue generation
Run a full local LLM stack with one command using Docker
Open speech-to-speech models and pipelines by Hugging Face toolkit AI
Multi-modal large language model designed for audio understanding
SoTA open-source TTS
A Python library for audio
Context-aware AI Sales Agent to automate sales outreach
LLM-based Reinforcement Learning audio edit model
LLM Large Model of Selling Anchor