Han Language Processing
Open source AI VTuber platform with voice chat and Live2D avatars
Towards Human-Level Text-to-Speech through Style Diffusion
A Web UI for easy subtitle using whisper model
Conversational voice AI agents
Multi-modal large language model designed for audio understanding
Framework for building AI-powered interactive digital humans and agent
Virtual AI anchor that combines state-of-the-art technology
LLM Large Model of Selling Anchor
Flowly is 100x faster than OpenClaw
Easy-to-use Speech Toolkit including Self-Supervised Learning model
A Conversational Speech Generation Model
Chat & pretrained large audio language model proposed by Alibaba Cloud
Build Vision Agents quickly with any model or video provider
Official Python inference and LoRA trainer package
Transforming Multimodal Content into Captivating Multilingual Audio
Generate blog articles from video or audio
One-click deployment (including offline integration package)
Towards Studio-Grade Character Animation via In-Context Learning of 3D
Powerful Android AI agent with tools, automation, and Linux shell
FAIR Sequence Modeling Toolkit 2
A very simple framework for state-of-the-art NLP
Pre-trained Deep Learning models and demos
Stanford NLP Python library for many human languages
Mice speech to text with MX Cinnamon OS ISO