Document (PDF, Word, PPTX ...) extraction and parse API
Python module for parsing semi-structured text into python tables
A fast, helpful, and open-source document parser
JavaScript parser and stringifier for YAML
Parse files for optimal RAG
Java library for parsing and rendering CommonMark (Markdown)
A JavaScript library for parsing and formatting chords and chord sheet
Markdown parser, done right. 100% CommonMark support, extensions
A fast, powerful, CommonMark compliant, extensible Markdown processor
Parse text and tables from PDF files.
Ksoup is a lightweight Kotlin Multiplatform library for parsing HTML
Parser generator to read, process, or translate structured text
A machine learning software for extracting information
Zero-copy PDF text extraction library written in Zig
RAG-Anything: All-in-One RAG Framework
Multilingual Document Layout Parsing in a Single Vision-Language Model
An incremental parsing system for programming tools
A post-modern modal text editor
A python library that makes AMR parsing, generation and visualization
Chat with it via text and voice
Semantic search and document parsing tools for the command line
Fast and efficient unstructured data extraction
OCR software, free and offline
Tree-sitter bindings for Emacs Lisp
Persian NLP Toolkit