Image processing in Python
Awesome multilingual OCR toolkits based on PaddlePaddle
Image polygonal annotation with Python
Contexts Optical Compression
Chat & pretrained large vision language model
OCRmyPDF adds an OCR text layer to scanned PDF files
An open source object detection toolbox based on PyTorch
NLP Cloud serves high performance pre-trained or custom models for NER
Ready-to-use OCR with 80+ supported languages
State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX
Capable of understanding text, audio, vision, video
State-of-the-art Image & Video CLIP, Multimodal Large Language Models
The leading agent orchestration platform for Claude
GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning
This repository contains the complete code and data for studying primo
Stanford NLP Python library for many human languages
A fast, powerful, and simple hierarchical vision transformer
The standard data-centric AI package for data quality and ML
Qwen3-omni is a natively end-to-end, omni-modal LLM
Convert AI papers to GUI
Refractoring ChatBot+LLM, Gpt-3.5-turbo, ChatGPT Bot/Voice Assistant
Framework for building neural networks
Refer and Ground Anything Anywhere at Any Granularity
Language modeling in a sentence representation space
Jittor is a high-performance deep learning framework