Models for the spaCy Natural Language Processing (NLP) library
Framework for building AI-powered interactive digital humans and agent
End-to-end speech processing toolkit
GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning
Integrating LLMs into structured NLP pipelines
Chinese XLNet pre-trained model
Framework for building neural networks
Powerful Android AI agent with tools, automation, and Linux shell
Multilingual Document Layout Parsing in a Single Vision-Language Model
Pre-trained Deep Learning models and demos
Multi-modal large language model designed for audio understanding
Refer and Ground Anything Anywhere at Any Granularity
Language modeling in a sentence representation space
The standard data-centric AI package for data quality and ML
Mice speech to text with MX Cinnamon OS ISO
Run GGUF models easily with a UI or API. One File. Zero Install.
A Python application to add watermarks (text or image) to PDF files
mice stt tts
Obsei is a low code AI powered automation tool
Label, clean and enrich text datasets with LLMs
Convert an image to text to spot intelligible words.
OpenMMLab Text Detection, Recognition and Understanding Toolbox
Resources, corpora, and tools for Chinese natural language processing