ofn-extract-objects.py free download

PyPDF

A pure-python PDF library capable of splitting, merging, cropping

pypdf is a pure Python library for working with PDF files, allowing developers to split, merge, rotate, encrypt, and extract content from PDFs. It’s an actively maintained fork of PyPDF2, improving performance, compatibility, and support for modern PDF standards. Suitable for both automation scripts and full-featured applications, pypdf handles PDFs without requiring external dependencies.

Downloads: 10 This Week

Last Update: 7 days ago

See Project

DocTR

Library for OCR-related tasks powered by Deep Learning

DocTR provides an easy and powerful way to extract valuable information from your documents. Seemlessly process documents for Natural Language Understanding tasks: we provide OCR predictors to parse textual information (localize and identify each word) from your documents. Robust 2-stage (detection + recognition) OCR predictors with pretrained parameters. User-friendly, 3 lines of code to load a document and extract text with a predictor.

Downloads: 12 This Week

Last Update: 2025-07-09

See Project

pikepdf

A Python library for reading and writing PDF, powered by QPDF

pikepdf is a Python library allowing the creation, manipulation, and repair of PDFs. It provides a Pythonic wrapper around the C++ PDF content transformation library, QPDF. Python + QPDF = “py” + “qpdf” = “pyqpdf”, which looks like a dyslexia test and is no fun to type. But say “pyqpdf” out loud, and it sounds like “pikepdf”. pikepdf is a library intended for developers who want to create, manipulate, parse, repair, and abuse the PDF format. It supports reading and write PDFs, including...

Downloads: 7 This Week

Last Update: 2025-11-10

See Project

LangExtract

A Python library for extracting structured information

LangExtract is a Python library developed by Google that leverages large language models (LLMs) to extract structured information from unstructured text—such as clinical notes, research papers, or literary works—based on user-defined instructions. It is designed to transform free-form text into reliable, schema-constrained data while maintaining traceability back to the source material. Each extracted entity is precisely grounded in its original context, allowing visual inspection and validation via automatically generated interactive HTML visualizations. ...

Downloads: 0 This Week

Last Update: 3 days ago

See Project

CNN for Image Retrieval

cnn-for-image-retrieval is a research-oriented project that demonstrates the use of convolutional neural networks (CNNs) for image retrieval tasks. The repository provides implementations of CNN-based methods to extract feature representations from images and use them for similarity-based retrieval. It focuses on applying deep learning techniques to improve upon traditional handcrafted descriptors by learning features directly from data. The code includes training and evaluation scripts that can be adapted for custom datasets, making it useful for experimenting with retrieval systems in computer vision. ...

Downloads: 0 This Week

Last Update: 3 days ago

See Project

BeaEngine 5

BeaEngine disasm project

BeaEngine is a C library designed to decode instructions from 16-bit, 32-bit and 64-bit intel architectures. It includes standard instructions set and instructions set from FPU, MMX, SSE, SSE2, SSE3, SSSE3, SSE4.1, SSE4.2, VMX, CLMUL, AES, MPX, AVX, AVX2, AVX512 (VEX & EVEX prefixes), CET, BMI1, BMI2, SGX, UINTR, KL, TDX and AMX extensions. If you want to analyze malicious codes and more generally obfuscated codes, BeaEngine sends back a complex structure that describes precisely the...

Downloads: 2 This Week

Last Update: 2024-06-05

See Project

gditools

A Python program/library aimed at GD-ROM image files.

This Python program/library is designed to handle GD-ROM image (GDI) files. It can be used to list files, extract data, generate sorttxt file, extract bootstrap (IP.BIN) file and more. This project can be used in standalone mode, in interactive mode or as a library in another Python program (check the 'addons' folder to learn how). For your convenience, you can use the gditools.py GUI program supplied in the Files section (optional).

Downloads: 17 This Week

Last Update: 2020-05-06

See Project

Metabrain

The purpose of the Metabrain library is to give developers a way to extract this information from the Internet without resorting to natural language parsing or other complex techniques, using instead statistical methods and patterns/trends analysis.

Downloads: 0 This Week

Last Update: 2013-04-26

See Project

HtmlList

A python package to find repetitive format pattern in HTML pages and extract information from them using this pattern. The idea is that in pages that have some kind of a list, there will be a repetitive pattern for the human eye (the page format).

Downloads: 0 This Week

Last Update: 2013-04-23

See Project

pdftable

Python module and command line utility that analyzes XML output from the program pdftohtml in order to extract tables from PDF files. Outputs CSV.

1 Review

Downloads: 0 This Week

Last Update: 2013-04-22

See Project

Search Results for "ofn-extract-objects.py"

Showing 10 open source projects for "ofn-extract-objects.py"

PyPDF

DocTR

pikepdf

LangExtract

CNN for Image Retrieval

BeaEngine 5

gditools

Metabrain

HtmlList

pdftable

Search Results for "ofn-extract-objects.py"

Showing 10 open source projects for "ofn-extract-objects.py"

PyPDF

DocTR

pikepdf

LangExtract

CNN for Image Retrieval

BeaEngine 5

gditools

Metabrain

HtmlList

pdftable

Related Searches

Related Categories