PRML algorithms implemented in Python
Conversational voice AI agents
UI Automation Framework for Games and Apps
Data manipulation and transformation for audio signal processing
A Web UI for easy subtitle using whisper model
A library to generate LaTeX expression from Python code
Python Audio Analysis Library: Feature Extraction, Classification
Flock is a workflow-based low-code platform for building chatbots
A library for audio and music analysis, feature extraction
Models for the spaCy Natural Language Processing (NLP) library
Interactive Machine Learning experiments
AI-powered tool for generating, optimizing, and translating subtitles
Jittor is a high-performance deep learning framework
Qwen3-ASR is an open-source series of ASR models
Build voice-based LLM agents. Modular + open source
An on-premises, OCR-free unstructured data extraction
A very simple framework for state-of-the-art NLP
GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning
AI assistant based on large models that can actively think and plan
Pre-trained Deep Learning models and demos
GLM-4-Voice | End-to-End Chinese-English Conversational Model
Capable of understanding text, audio, vision, video
The leading agent orchestration platform for Claude
Framework for building AI-powered interactive digital humans and agent