Conversational voice AI agents
Chat with it via text and voice
Scalable generative AI framework built for researchers and developers
Han Language Processing
Framework for building AI-powered interactive digital humans and agent
Multi-modal large language model designed for audio understanding
High-quality multi-lingual text-to-speech library by MyShell.ai
LLM Large Model of Selling Anchor
Build Vision Agents quickly with any model or video provider
Easy-to-use Speech Toolkit including Self-Supervised Learning model
Flowly is 100x faster than OpenClaw
Powerful Android AI agent with tools, automation, and Linux shell
Official Python inference and LoRA trainer package
Towards Human-Level Text-to-Speech through Style Diffusion
Generate blog articles from video or audio
One-click deployment (including offline integration package)
Towards Studio-Grade Character Animation via In-Context Learning of 3D
A very simple framework for state-of-the-art NLP
A Conversational Speech Generation Model
Pre-trained Deep Learning models and demos
Transforming Multimodal Content into Captivating Multilingual Audio
Chat & pretrained large audio language model proposed by Alibaba Cloud
Virtual AI anchor that combines state-of-the-art technology
FAIR Sequence Modeling Toolkit 2
Mice speech to text with MX Cinnamon OS ISO