Showing 75 open source projects for "pdf ocr windows"

View related business solutions
  • Stop Cyber Threats with VM-Series Next-Gen Firewall on Azure Icon
    Stop Cyber Threats with VM-Series Next-Gen Firewall on Azure

    Native application identity and user-based security for your Azure cloud

    Gain integrated visibility across all traffic in a single pass. Deploy Palo Alto Networks VM-Series to determine application identity and content while automating security policy updates via rich APIs.
    Get a free trial
  • Atera - an All-in-one platform for IT management Icon
    Atera - an All-in-one platform for IT management

    Ideal for IT departments and MSPs (managed service providers)

    Your IT essentials, integrated & elevated. Take your IT management from automated to autonomous, download Atera's agent to start your free trial!
    Try Atera now
  • 1

    mcf2pdf

    mcf2pdf converts files of the "My CEWE Photobook" software to PDF

    mcf2pdf converts .mcf files of the "My CEWE Photobook" software (see http://www.cewe-photobook.co.uk/ or http://www.cewe-fotobuch.de (german)) to PDF files, so you can better preview the results and even send them to others by e-mail. This project has moved to GitHub. Please visit https://github.com/albrechtf/mcf2pdf/releases for download of latest version.
    Downloads: 12 This Week
    Last Update:
    See Project
  • 2
    MathOCR

    MathOCR

    A scientific document recognition system

    MathOCR is a printed scientific document recognition system. MathOCR is still in the pre-alpha stage, recognition result may not be good enough for practical purposes. MathOCR is a printed scientific document recognition system written in pure Java. MathOCR has the functionality of image preprocessing, layout analysis and character recognition, especially the ability to recognize mathematical expression. MathOCR can work do not depends on other OCR software.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    S1000D Transformation Toolkit
    The S1000D Transformation Toolkit provides a reference implementation supporting the transformation, packaging and viewing of S1000D data into a SCORM 2004 3rd Edition Content Package, Mobile Web Application and PDF.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    JBerd

    JBerd

    Java tools for decoding and manipulating BER encoded ASN.1 Files

    ...A set of Java facilities for writing BER applications that require BER decoding Go to the "files" section (link at the top of this page) to download a pdf of detailed documentation. Andrew Forsyth
    Downloads: 0 This Week
    Last Update:
    See Project
  • Your monitoring isn't a stack. It's a pile. Fix that. Icon
    Your monitoring isn't a stack. It's a pile. Fix that.

    Errors, performance, logs, uptime. One install, one invoice, one UI.

    Replace Datadog, New Relic, and Sentry without adding three more dashboards.
    Free 30 days.
  • 5
    OpenSearchServer Extractor

    OpenSearchServer Extractor

    A RESTFul/JSON Web Service for text and metata extraction

    An open source RESTFul Web Service for text , meta-data extraction and analysis. oss-text-extractor supports various binary formats: Word processor (doc, docx, odt, rtf) Spreadsheet (xls, xlsx, ods) Presentation (ppt, pptx, odp) Publishing (pdf, pub) Web (rss, html/xhtml) Medias (audio, images) Others (vsd, text)
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    The goal of this project is to provide a reusable library to transform common file formats to content objects and ContentProvider plugins to common file repositories like Filesystem, CMIS and others for iQser GIN Semantic Middleware (www.iqser.com).
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    Regain is a Java search engine based on Jakarta Lucene. It provides indexing and searching files for plenty of formats (HTML,XML,doc(x),xls(x),ppt(x),oo,PDF,RTF,mp3,mp4,Java). A TagLibrary eases integrating search results in your JSP based web page.
    Downloads: 14 This Week
    Last Update:
    See Project
  • 8
    Texlipse is a plugin that adds Latex editing support for the popular Eclipse Java IDE. Key features include: syntax highlight, command completion, bibliography completion, outline navigation and automatic building.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 9

    Detexter

    Detexter is an app designed to extract text from PDF files.

    Detexter lets you extract text from multiple PDF files. Detexter uses the PDFBox library for its text extraction.
    Downloads: 0 This Week
    Last Update:
    See Project
  • $300 Free Credits to Build on Google Cloud Icon
    $300 Free Credits to Build on Google Cloud

    New to Google Cloud? Get $300 in credits to explore Compute Engine, BigQuery, Cloud Run, Gemini Enterprise Agent Platform, and more.

    Start your next project with $300 in free Google Cloud credit. Spin up VMs, run containers, query petabytes in BigQuery, or build agents with Gemini Enterprise Agent Platform. Once your credits are used, keep building with 20+ always-free tier products including Compute Engine, Cloud Storage, GKE, and Cloud Run functions. No commitment required—just sign up and start building.
    Claim $300 Free
  • 10
    jPod is a rich PDF manipulation and rendering framework. A complete rendering library based on jPod is available here at "jPodRenderer". To see jPod & jPodRenderer at work, have a look at www.cabaret-solutions.com
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    jPod Renderer is based on the jPod library, also hosted here at "jpodlib". This is the long awaited release for platform specific rendering code, both on AWT and SWT. To see jPod and jPod Renderer at work, have a look at www.cabaret-solutions.com
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12

    Wordpress PDF Blog Export

    Aplicación JAVA que genera un PDF a partir de un XML de Wordpress

    Pequeña aplicación desarrollada con JAVA que convierte un archivo XML generado desde Wordpress en un archivo PDF. Queria hacer un libro de mi blog, pero las herramientas que encontré no incluïan los comentarios en el documento generado. Entonces, a la par que aprendía a utilizar las librerias de java iText y jSoup desarrollé esta utilidad en un archivo jar ejecutable. Para utilizarlo simplemente necesitaremos tener java instalado en nuestro PC. El archivo pdf generado se puede utilizar...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    File Type Checker checks the file data to determine the actual file type. As of this writing filetypechecker supports doc, rtf, xls, pdf, jpg, jpeg, and gif. more file support will be added soon.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 14
    stymaker

    stymaker

    Create your own LaTeX style.

    Stymaker is a GUI application assisting LaTeX users with creating their own style packages. After filling a simple form one can get a new package file corresponding to the chosen settings. This package may be included in preamble of LaTeX document by: \usepackage{mystyle} The new package, based on standard LaTeX packages, allows changing of the document layout, or appearance of standard environments like such as lists. While testing new settings one may instantly view actual changes in...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    jPod is a PDF manipulation and rendering framework. This release contains the documented features, including reading, manipulating and writing. More features to be released as API matures. To see jPod at work, have a look at www.cabaret-solutions.com
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    eLML - eLesson Markup Language
    eLML (eLesson Markup Language) is an XML framework for creating structured eLessons based on a pedagogical model. eLML consists of an XMLSchema and XSLT files to create XHTML, PDF, LaTeX, IMS CP and SCORM versions, standards supported by most LMS.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    Copperhead is a small and simple library providing a Swing user interface that allows one to automatically generate PDF documents from annotated objects using the iText PDF library. Copperhead is developed under GPLv3. Please download Copperhead 0.1b for iText 2 and 0.2b for iText5. Read more on http://byteality.ch/blog. Enjoy!
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    Kabeja is a java library for parsing DXF and converting to SVG (dxf2svg). The library supports the SAX-api and can integrated into other applications (Cocoon,Batik). Tools for converting svg to jpeg, tiff, png and pdf are included .
    Leader badge
    Downloads: 65 This Week
    Last Update:
    See Project
  • 19
    an images to pdf converter
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    Calenco XML CMS
    Calenco is a Web collaborative platform that enable remote teams of writers, proofreader, graphic designers, translators, etc. to produce together XML documents like user guides, security procedures, etc.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    Foxon is a FO emitter/indenter to be used with Saxon. It can indent and prettify XSL-FO output, making it suitable for human inspection and editing, without introducing artefacts that change the layout of the PDF file.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    xccdf2pdf renders XCCDF documents in PDF and other formats.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 23
    This project provides a toolkit and framework based on PDFBox for document analysis of PDF files and performing custom conversion tasks and is published under the Apache licence. A GUI is also included, and is published using the GPL licence.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    This library provides tools to make a PDF/A preflight on a PDF document. It is highly based on apache PDFBOX. Conformance to the ISO 19005 (PDF/A) norm is checked. The goal is to pass completely the isartor test.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    iText is a free open source Java-PDF library released on SF under the MPL/LGPL; iText comes with a simple GUI: the iText toolbox. The original developers of iText want to publish this toolbox as a separate project under the more permissive MIT license.
    Downloads: 0 This Week
    Last Update:
    See Project
Auth0 Logo