A simple tool for reading in poorly redacted documents
Build voice-based LLM agents. Modular + open source
Qwen3-Coder is the code version of Qwen3
The leading agent orchestration platform for Claude
Python Audio Analysis Library: Feature Extraction, Classification
VMZ: Model Zoo for Video Modeling
A very simple framework for state-of-the-art NLP
Data manipulation and transformation for audio signal processing
A ranked list of awesome machine learning Python libraries
Integrating LLMs into structured NLP pipelines
State-of-the-art Image & Video CLIP, Multimodal Large Language Models
GLM-4-Voice | End-to-End Chinese-English Conversational Model
Pycorrector is a toolkit for text error correction
PRML algorithms implemented in Python
Towards Studio-Grade Character Animation via In-Context Learning of 3D
A library to generate LaTeX expression from Python code
Industrial-strength Natural Language Processing (NLP)
End-to-end speech processing toolkit
Multilingual Document Layout Parsing in a Single Vision-Language Model
Qwen3-ASR is an open-source series of ASR models
Code release for Cut and Learn for Unsupervised Object Detection
Jittor is a high-performance deep learning framework
AI assistant based on large models that can actively think and plan
Chinese XLNet pre-trained model
Pre-trained Deep Learning models and demos