Document (PDF, Word, PPTX ...) extraction and parse API
Python module for parsing semi-structured text into python tables
A fast, helpful, and open-source document parser
A JavaScript library for parsing and formatting chords and chord sheet
JavaScript parser and stringifier for YAML
Parse files for optimal RAG
Java library for parsing and rendering CommonMark (Markdown)
Markdown parser, done right. 100% CommonMark support, extensions
Parse text and tables from PDF files.
A machine learning software for extracting information
Ksoup is a lightweight Kotlin Multiplatform library for parsing HTML
Parser generator to read, process, or translate structured text
A fast, powerful, CommonMark compliant, extensible Markdown processor
Zero-copy PDF text extraction library written in Zig
RAG-Anything: All-in-One RAG Framework
A python library that makes AMR parsing, generation and visualization
An incremental parsing system for programming tools
Chat with it via text and voice
Multilingual Document Layout Parsing in a Single Vision-Language Model
A post-modern modal text editor
Fast and efficient unstructured data extraction
Tree-sitter bindings for Emacs Lisp
OCR software, free and offline
Persian NLP Toolkit
Convert notion pages, block and list of blocks to markdown