Build gen AI apps with an all-in-one modern database: MongoDB Atlas
MongoDB Atlas provides built-in vector search and a flexible document model so developers can build, scale, and run gen AI apps without stitching together multiple databases. From LLM integration to semantic search, Atlas simplifies your AI architecture—and it’s free to get started.
Start Free
Deliver secure remote access with OpenVPN.
Trusted by nearly 20,000 customers worldwide, and all major cloud providers.
OpenVPN's products provide scalable, secure remote access — giving complete freedom to your employees to work outside the office while securely accessing SaaS, the internet, and company resources.
openPDF is based on a several open source software products, such as iText, JPedal, CryptoApplet among others. Allow users to view/modify PDF documents and forms, barcodes generation, data extraction and signature validation.
This library provides tools to make a PDF/A preflight on a PDF document. It is highly based on apache PDFBOX. Conformance to the ISO 19005 (PDF/A) norm is checked. The goal is to pass completely the isartor test.
Bookmark sync tool for ebook readers with xml bookmark system. Primarily focussed on devices based on 4FFF N618 ebook reader, wich uses Adobe Digital Editions for content sync but have no bookmark sync.
SplitPDF -SplitPDF.jar- is a ‘command-line driven’ Java-program, it splits a PDF-file by bookmarks into separated PDF’s. The bookmark is used as title for the newly created PDF. Extremely usefull and fast in a batch processing environment.
No-Nonsense Code-to-Cloud Security for Devs | Aikido
Connect your GitHub, GitLab, Bitbucket, or Azure DevOps account to start scanning your repos for free.
Aikido provides a unified security platform for developers, combining 12 powerful scans like SAST, DAST, and CSPM. AI-driven AutoFix and AutoTriage streamline vulnerability management, while runtime protection blocks attacks.
ElateXam is a complete toolsuite for electronic exams. It includes several task types (multiple choice, cloze texts, free texts, mapping, drawing, autotool), correction tools, analysis and export features. It's used at the university of Leipzig.
This project creates a command line java application that uses OpenOffice.org in a headless mode to convert a document to the pdf file format. The source document had to be in a file format that OpenOffice.org can open.
GribCS is a .NET library to read GRIB (GRid in Binary) files. GRIB is a gridded data standard from WMO (World Meteorological Organisation) and is used by many meteorological organisation. Based on "Java GRIB reader".
cobol2j reads or writes COBOL or RPG data files imported from mainframes, AS/400 or Baby/36 environment. Packed decimal, zoned or packed date fields decoding included. EBCDIC conversion. ETL ISAM data to any other platform. PC Cobol ( ASCII ) supported.
Trusted by 150 million+ creators and businesses globally
Unlock Picsart's full editing suite by embedding our Editor SDK directly into your platform. Offer your users the power of a full design suite without leaving your site.
ANts P2P realizes a third generation P2P net. It protects your privacy while you are connected and makes you not trackable, hiding your identity (ip) and crypting everything you are sending/receiving from others.
Java to PDF. For now, it only has a function to combine multiple image files (.jpg, .bmp or .png) into a single pdf file. Tools: NetBeans 6.9 and JDK 1.6
CNV Workshop is a web-enabled platform for analyzing genome variation such as copy number variation (CNV). Learn about CNV Workshop in our associated BMC Bioinformatics manuscript: http://www.biomedcentral.com/1471-2105/11/74
The COSMAT project provides a RESTful web service named COSMATService, that extracts data comming from a pdf file and translates the content to several languages. The returnes extractions and tranbslations are encoded in the tei format.
Note as of 2013-09-13: I'm moving this project over to github due to this:
http://www.gluster.org/2013/08/how-far-the-once-mighty-sourceforge-has-fallen/
Feel free to rejoin the more updated versions on
https://github.com/mnott/PDFOCRWrapper
Thanks.
Matthias
--
This is a wrapper written in Java that allows to recursively iterate a directory structure and call an OCR engine on each found PDF on the condition that it hat not yet been called for that PDF. It works well...
Farben is a Java-based universal binary file reader and writer. Easily view binary files hierarchically. Edit embedded file metadata in a variety of formats. Identify orphaned files. A wide variety of file formats will be included.
Booletin es un buscador de Boletines oficiales (BOE, BOCM, etc.), que incluye un sistema de alertas por correo electrónico. Utiliza Apache Lucene para indexar el contenido en pdf de los boletines oficiales de España.
The PDF Form Generator module currently works with properties files only, but additional formats (such as csv, xml, tab delimited etc) will soon be supported.
The JODConverterService is written as a WCF application and provides functionality to convert documents such as .eml, .doc(x), .xls(x), etc. to the PDF format by using the Java library JODConverter which uses a service instance of OpenOffice.org.
A Java library for rendering forms on PDF (may be extended for other formats), based on a Template File (PDF or other type), and an XML description of contents. This library uses the iText package (http://www.lowagie.com/iText/) for PDF manipulation.
PODR is a PHP mailmerging and converting library mostly designed to parse and convert ODT templates to DOC/PDF. Templating is based on Savant, Conversion uses a webservice of JODConverter. A filter is available to include runtime generated images.