MongoDB Atlas is the developer-friendly database used to build, scale, and run gen AI and LLM-powered apps—without needing a separate vector database. Atlas offers built-in vector search, global availability across 115+ regions, and flexible document modeling. Start building AI apps faster, all in one place.
Start Free
Photo and Video Editing APIs and SDKs
Trusted by 150 million+ creators and businesses globally
Unlock Picsart's full editing suite by embedding our Editor SDK directly into your platform. Offer your users the power of a full design suite without leaving your site.
Note as of 2013-09-13: I'm moving this project over to github due to this:
http://www.gluster.org/2013/08/how-far-the-once-mighty-sourceforge-has-fallen/
Feel free to rejoin the more updated versions on
https://github.com/mnott/PDFOCRWrapper
Thanks.
Matthias
--
This is a wrapper written in Java that allows to recursively iterate a directory structure and call an OCR engine on each found PDF on the condition that it hat not yet been called for that PDF. It works well...
Booletin es un buscador de Boletines oficiales (BOE, BOCM, etc.), que incluye un sistema de alertas por correo electrónico. Utiliza Apache Lucene para indexar el contenido en pdf de los boletines oficiales de España.
Toolkit e-formulieren is een opensource toolkit voor het op een gebruikersvriendelijke manier kunnen maken en onderhouden van e-formulieren.
De Toolkit maakt gebruik van Orbeon, en ondersteunt XForms-compliant e-formulieren, evt. met voorinvulling.
Application to create PDF document on the fly from any source file format(PRN,HTML,TEXT,CSV) with complete mailing system and reports module. It's made over following library Pdf - IText Web Server- Simple Frame work Database- H2
Automatically embed Wikipedia topic information into PDF documents via pop up annotations. This relies on the Wikipedia Miner service that is also available on Sourceforge.
Events in Google calendars form the base for bills, to be completed with rates, text and other items. The program outputs bills in PDF format, which may be mailed. The program offers a simple debitors/creditors accounting.
SPASE Model is a collection of tools for working with the structured data model information. Tools can convert the relational version of the data model into various expressions, including XSD, XMI and PDF documentation.
Shared Questionnaire System(SQS) is a full-functional Optical Mark Reader(OMR) form processing system implemented in Java-Swing, XSL-FO and AJAX with straightforward GUIs. It is aimed at developing social platform to share knowledge about questionnaire.
Sales CRM and Pipeline Management Software | Pipedrive
The easy and effective CRM for closing deals
Pipedrive’s simple interface empowers salespeople to streamline workflows and unite sales tasks in one workspace. Unlock instant sales insights with Pipedrive’s visual sales pipeline and fine-tune your strategy with robust reporting features and a personalized AI Sales Assistant.
I looked for a Java Component to read a Interleaved 2 of 5 barcode and didn't find. So I did it. Eu procurei por um componente Java para a leitura de códigos de barra 2 int 5 e não encontrei. Então eu fiz. Contato: www.flavioschiavoni.blogspot.co
A simple application designed to help manage your family's medical prescriptions. It allows a family to track prescriptions per person and output them in PDF for dissemination. Hopefully it proves useful, and helps ease the stress of health care.
An image postprocessor for the DIY Book Scanner described on instructables.com and diybookscanner.org. Gets images ready for OCR or for PDF. Written in Java based on a partial port of the Leptonica image processing library.
Web Time Entry is a Web based project time entry and client billing system. It can track, approve, and manage employee time incurred for projects and clients you define. It can schedule projects to employees and track time incurred on project schedul
JODConverter automates conversions between office document formats using OpenOffice.org. Supported formats include OpenDocument, PDF, RTF, Word, Excel, PowerPoint, and Flash. It can be used as a Java library, a command line tool, or a Web application.
This project provides Lucene search to Mediawiki. It is designed to be easy to install, configure, and run. It provides real-time, multiple field indexing and searching as well as text indexing of standard attachment types (pdf, xls, doc, ppt, vsd).
JFlightLog is a FREE, graphical pilot log for Ms Windows and others(it's java). Saves as Excel XLS, PDF, CSV or TXT. Imports CSV from other logbook programs or spreadsheets. Prints entries directly. Auto-creates charts for visualizing flight times.
A graphical editor based on Visual Syntax of Semantics for Business Vocabulary and Rules(SBVR). SBVR VE is based on eclipse platform. For documentation, please refer to OPAALS deliverable http://files.opaals.org/OPAALS/Year_3_Deliverables/WP10/D10.14.pdf
Java package to study a clustering model described in the paper \"Novel Clustering Algorithm Based Upon Games on Evolving Network\" by Q. Li, Z. Chen, Y. He and J-P. Jiang (in arxiv: http://arxiv.org/pdf/0812.5064v1), generalizations and similar issues.
The PDF Forms Designer is a tool for manipulating PDF forms. It's a 100% Java, client-side application that allows users to graphically add, through a drag and drop mechanism, form fields, free text and images to new, and existing PDF documents.
DUPS can be used to present any set of images depicting a chronological sequence. It creates a set of HTML pages acting as a browser based picture/pdf viewer. The emphasis lies on displaying two pictures side by side enabling the user to compare them.
Web Based application to manage library materials and material borrowers. Includes a web search engine to find and reserve items by borrower ID. Painless installation, very extensible and has an intuitive U.I. Runs on MySQL. A great tool for any libr
YARFRAW is an open source Java API library for working with RSS/RDF/ATOM feeds. It offers an unified data model in builder style to help developers to build and to read feeds. It currently supports RSS 0.9x, RSS 1.0, RSS 2.0 and Atom 1.0/0.3
DataVision is a reporting tool similar to Crystal Reports. DV supports many data sources (JDBC, files) and many output formats (HTML, XML, PDF, LaTeX, Excel, delimited files, DocBook). DV includes a GUI editor. DV is embeddable. Reports are XML-based.
The aim of this project is to develop a Portable Document Format (PDF) importer for OpenOffice.org Writer based on XPDF. This project was inspired by the PDF importer within KWord.