LLM Large Model of Selling Anchor
Open source AI VTuber platform with voice chat and Live2D avatars
NLP Cloud serves high performance pre-trained or custom models for NER
Han Language Processing
Capable of understanding text, audio, vision, video
Real-time voice interactive digital human
Qwen3-ASR is an open-source series of ASR models
Large Audio Language Model built for natural interactions
GLM-4-Voice | End-to-End Chinese-English Conversational Model
Persian NLP Toolkit
State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX
Conversational voice AI agents
AI-powered tool for generating, optimizing, and translating subtitles
Framework for building neural networks
Qwen3-omni is a natively end-to-end, omni-modal LLM
Open source AI wearable platform for recording and summarizing speech
A Web UI for easy subtitle using whisper model
Bailing is a voice dialogue robot similar to GPT-4o
Python Audio Analysis Library: Feature Extraction, Classification
Powerful Android AI agent with tools, automation, and Linux shell
Multi-modal large language model designed for audio understanding
Towards Studio-Grade Character Animation via In-Context Learning of 3D
Industrial-strength Natural Language Processing (NLP)
Framework for building AI-powered interactive digital humans and agent
Data manipulation and transformation for audio signal processing