PyMuPDF is a high-performance, Python-centric library for reading, extracting, and manipulating PDFs with ease and precision. It enables developers to access text, images, fonts, annotations, metadata, and structural layout of PDF documents, and to perform tasks such as extracting content, editing objects, rendering pages, searching text, modifying page content, and manipulating PDF components like links and annotations. PyMuPDF also supports advanced operations like splitting, merging, inserting, or deleting pages; drawing and filling shapes; handling color spaces; and converting between formats. The library is lightweight but robust, optimized for speed and low memory overhead. On top of the base PyMuPDF, PyMuPDF Pro adds support for reading and writing Microsoft Office-format documents and enhanced functionality for integrating Large Language Model (LLM) pipelines and Retrieval Augmented Generation (RAG).