A high-quality tool for convert PDF to Markdown and JSON
MinerU is an open-source, high-quality document extraction toolkit focused on converting PDFs (and other document formats) into structured Markdown and JSON. It leverages OCR and layout analysis to preserve semantic structure and metadata, ideal for research and data science workflows.
Convert files like docx, xlsx, pptx, html, and more to MarkDown
Bridgex is an open‑source graphical interface for converting files to Markdown, built in Python and based on Pyside6 (Qt for Python). Its objective is to simplify access to the Markitdown library through a straightforward, modular visual experience.
Features ✨
- Cross‑platform graphical interface.
- Efficient file‑to‑Markdown conversion.
- Modularity: easy to adapt and extend.
- Support for multiple input formats.
- Lightweight editing prior to saving.
Supported Formats...