Document (PDF, Word, PPTX ...) extraction and parse API
Python module for parsing semi-structured text into python tables
A fast, helpful, and open-source document parser
JavaScript parser and stringifier for YAML
Java library for parsing and rendering CommonMark (Markdown)
Parse files for optimal RAG
A JavaScript library for parsing and formatting chords and chord sheet
Markdown parser, done right. 100% CommonMark support, extensions
Parse text and tables from PDF files.
A fast, powerful, CommonMark compliant, extensible Markdown processor
Ksoup is a lightweight Kotlin Multiplatform library for parsing HTML
Parser generator to read, process, or translate structured text
A machine learning software for extracting information
Zero-copy PDF text extraction library written in Zig
RAG-Anything: All-in-One RAG Framework
Multilingual Document Layout Parsing in a Single Vision-Language Model
An incremental parsing system for programming tools
A post-modern modal text editor
A python library that makes AMR parsing, generation and visualization
Semantic search and document parsing tools for the command line
Chat with it via text and voice
Fast and efficient unstructured data extraction
OCR software, free and offline
Tree-sitter bindings for Emacs Lisp
Persian NLP Toolkit