Large Language Model Text Generation Inference
Document (PDF, Word, PPTX ...) extraction and parse API
Module for automatic summarization of text documents and HTML pages
High-performance inference server for text embeddings models API layer
AI tool that removes hardcoded subtitles and text from videos locally
Persian NLP Toolkit
Han Language Processing
Robust Speech Recognition via Large-Scale Weak Supervision
A full spaCy pipeline and models for scientific/biomedical documents
Underthesea - Vietnamese NLP Toolkit
Open source healthcare AI
OCR model for complex documents with layout-aware structured outputs
Contexts Optical Compression
OCRmyPDF adds an OCR text layer to scanned PDF files
Python & command-line tool to gather text on the Web
Toolkit for conversational AI
Comprehensive Gradio WebUI for audio processing
The most accurate natural language detection library for Python
A Repo For Document AI
Translate the video from one language to another and embed dubbing
Easy-to-use and powerful NLP library with Awesome model zoo
Agent harness to make your slop code well-engineered and beautiful
Generate audiobooks from EPUBs, PDFs and text with captions
A high-quality PDF to Markdown tool based on large language model
Easy-to-use and high-performance NLP and LLM framework