UI Automation Framework for Games and Apps
PRML algorithms implemented in Python
A Web UI for easy subtitle using whisper model
A library to generate LaTeX expression from Python code
Python Audio Analysis Library: Feature Extraction, Classification
Flock is a workflow-based low-code platform for building chatbots
AI-powered tool for generating, optimizing, and translating subtitles
Interactive Machine Learning experiments
Qwen3-ASR is an open-source series of ASR models
An on-premises, OCR-free unstructured data extraction
GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning
Pre-trained Deep Learning models and demos
Framework for building AI-powered interactive digital humans and agent
Towards Studio-Grade Character Animation via In-Context Learning of 3D
A Foundation Model for the Language of Financial Markets
Multilingual Document Layout Parsing in a Single Vision-Language Model
Code release for Cut and Learn for Unsupervised Object Detection
Bailing is a voice dialogue robot similar to GPT-4o
Chinese XLNet pre-trained model
Framework for building neural networks
Refer and Ground Anything Anywhere at Any Granularity
End-to-end speech processing toolkit
PyTorch code and models for VJEPA2 self-supervised learning from video
Language modeling in a sentence representation space
GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning