Hackable CLI tool for converting Markdown files to PDF using Node.js
A simple and hackable CLI tool for converting markdown to pdf. It uses Marked to convert markdown to HTML and Puppeteer (headless Chromium) to further convert the HTML to PDF. It also uses highlight.js for code highlighting. The whole source code of this tool is only ~250 lines of JS ~500 lines of Typescript and ~100 lines of CSS, so it is easy to clone and customize.
A high-quality tool for convert PDF to Markdown and JSON
MinerU is an open-source, high-quality document extraction toolkit focused on converting PDFs (and other document formats) into structured Markdown and JSON. It leverages OCR and layout analysis to preserve semantic structure and metadata, ideal for research and data science workflows.