Image processing in Python
Image polygonal annotation with Python
Contexts Optical Compression
OCRmyPDF adds an OCR text layer to scanned PDF files
Chat & pretrained large vision language model
State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX
Ready-to-use OCR with 80+ supported languages
2D and 3D Face alignment library build using pytorch
NLP Cloud serves high performance pre-trained or custom models for NER
CLI tool to extract (meta)data from PDF and manipulate PDF files
Accurate × Fast × Comprehensive
Capable of understanding text, audio, vision, video
GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning
State-of-the-art Image & Video CLIP, Multimodal Large Language Models
A simple tool for reading in poorly redacted documents
AI assistant based on large models that can actively think and plan
Convert AI papers to GUI
Qwen3-omni is a natively end-to-end, omni-modal LLM
The standard data-centric AI package for data quality and ML
NOTICE OF CONSOLIDATION & PARTNERSHIP PENDING As of April 2026, the 20
Framework for building neural networks
Refer and Ground Anything Anywhere at Any Granularity
GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning
Language modeling in a sentence representation space
Refractoring ChatBot+LLM, Gpt-3.5-turbo, ChatGPT Bot/Voice Assistant