Showing 20 open source projects for "pdf to text"

View related business solutions
  • Fully Managed MySQL, PostgreSQL, and SQL Server Icon
    Fully Managed MySQL, PostgreSQL, and SQL Server

    Automatic backups, patching, replication, and failover. Focus on your app, not your database.

    Cloud SQL handles your database ops end to end, so you can focus on your app.
    Try Free
  • Full-stack observability with actually useful AI | Grafana Cloud Icon
    Full-stack observability with actually useful AI | Grafana Cloud

    Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

    Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.
    Create free account
  • 1

    FOray

    Modular XSL-FO Implementation for Java.

    FOray is an open-source XSL-FO publishing system that is suitable for converting XML content into PDF and other document formats. Although not yet fully conformant with the XSL-FO standard, it is very useful for many applications.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 2
    MyBox

    MyBox

    Easy Tools of PDF, Image, File, Network, Data, and Medias

    javafx-desktop-apps pdf image ocr icc barcode color-palette text bytes markdown html archive compress digest video audio editor converter media https://github.com/Mararsh/MyBox Self-contain packages need not java env nor installation. Jar packages need Java 16 or higher.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 3
    Provides optical character recognition (OCR) solutions for Vietnamese language.
    Leader badge
    Downloads: 124 This Week
    Last Update:
    See Project
  • 4
    jPicEdt

    jPicEdt

    Another drawing editor for LaTeX with PSTricks & TikZ

    jPicEdt is an extensible internationalized vector-based drawing editor for LaTeX and related packages (TikZ, PsTricks,...), written in Java. It is also a library of reusable high-level graphic primitives.
    Downloads: 3 This Week
    Last Update:
    See Project
  • Gemini 3 and 200+ AI Models on One Platform Icon
    Gemini 3 and 200+ AI Models on One Platform

    Access Google's best plus Claude, Llama, and Gemma. Fine-tune and deploy from one console.

    Build generative AI apps with Vertex AI. Switch between models without switching platforms.
    Start Free
  • 5
    OCR Manga Reader for Android

    OCR Manga Reader for Android

    Android Manga reader with Japanese OCR and dictionary capabilities

    OCR Manga Reader is a free and open source Android app that allows you to quickly OCR and lookup Japanese words in real-time. It does not have ads or telemetry/spyware and does not require an Internet connection. Supports both EDICT and EPWING dictionaries. Requires Android 4.0 (Ice Cream Sandwich) or higher. See http://ocrmangareaderforandroid.sourceforge.net/ for details.
    Leader badge
    Downloads: 33 This Week
    Last Update:
    See Project
  • 6
    LaTeXDraw

    LaTeXDraw

    Vector drawing program for LaTeX using PSTricks

    LaTeXDraw is a graphical drawing editor for LaTeX. LaTeXDraw can be used to 1) generate PSTricks code; 2) directly create PDF or PS pictures.
    Leader badge
    Downloads: 55 This Week
    Last Update:
    See Project
  • 7
    BRIC

    BRIC

    BRIC is a powerful tool for batch image processing.

    Bric is a cross-platform batch image processor. You can convert, resize, rotate and add watermark to your images. Multiple file types are supported for input and output. The project started back in 2011 and was maintained for a couple of years. In 2020 BRIC is again in active development, so some of the features written below might be outdated. Please be patient, until everything is reviewed and rewritten.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    FidoCadJ

    FidoCadJ

    Simple and intuitive 2D vector drawing for electronics and not only.

    A multiplatform vector drawing program with a complete library of electronic symbols. Schematics and drawings are stored in a very compact text format. There is no netlist concept behind the drawings (so no simulation, and this is a choice) but this allows a great graphical flexibility and ease of use, making FidoCadJ the perfect tool for exchange sketches in forum and newsgroup discussions with a few clicks. Drawings can be exported in several graphic formats, such as pdf. ...
    Downloads: 41 This Week
    Last Update:
    See Project
  • 9
    PDF Clown

    PDF Clown

    General-Purpose PDF Library for Java and .NET

    PDF Clown is a general-purpose Java and .NET library for manipulating PDF files through multiple abstraction layers, rigorously adhering to PDF 1.7 specification (ISO 32000-1). This project aims to provide a universal access to PDF files (creation, reading, editing, rendering...) through an accurate and elegant object-oriented API. * Features: http://pdfclown.org/overview/features/ * Overview: http://pdfclown.org/overview/architecture/ * Website: http://pdfclown.org/ * Blog:...
    Downloads: 4 This Week
    Last Update:
    See Project
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • 10
    JBarcodeReader
    JBarcodeReader is a software to read files or images of barcodes, is a easy way to copy in text format a description of barcodes. Open and drag the JBarcodeReader above image or file, and click in 'Capture' button, and copy the description.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 11
    Armchair File Manager

    Armchair File Manager

    Remote control your home theater PC from across the room

    Use the Armchair File Manager to control your Windows home theater PC using its remote control. Perform light-duty computing tasks from across the room without a keyboard or mouse. Armchair works best with a PC connected to a widescreen television.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    PDF Presenter
    An easy to use presentation software with focus on PDF documents. External display / projector presentations: PDF, images, text and whiteboard with annotations; intuitive user interface, optimal mouse and pen input handling.
    Leader badge
    Downloads: 41 This Week
    Last Update:
    See Project
  • 13
    Shared Questionnaire System
    Shared Questionnaire System(SQS) is a full-functional Optical Mark Reader(OMR) form processing system implemented in Java-Swing, XSL-FO and AJAX with straightforward GUIs. It is aimed at developing social platform to share knowledge about questionnaire.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    PDF Annot is a piece of software that enables you to add audio and text annotation to a PDF. It uses JPedal SimpleViewer and iText library. Annotations are supported by Adobe'sofficial PDF Reader. Report any bug here: krakosia[at]gmail.com
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    This project aims to create a single easy to use GUI wrapper for ghostscript and tesseract to allow scanned pdf to plain text or HTML for scanned documents.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    Early Access iText, a PDF generation library in Java
    Downloads: 7 This Week
    Last Update:
    See Project
  • 17
    The Writers Forge is a fiction authoring suite, an IDE for writers. The tool suite will provide integrated support for writing screenplays and prose, and developing plot and character. The backend will support many target formats, including XML and PDF.
    Leader badge
    Downloads: 6 This Week
    Last Update:
    See Project
  • 18
    Powerful cataloguing software for various types of files (audio, video, various text documents, software packages etc.) based on XML technologies and thus providing broad capabilities for data manipulation and reporting (text, HTML/XHTML/PDF, RTF, whateve
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    METAGRAF is an Application written in pure Java (platform independent) to create any kind of graphics, including text and images, and that outputs a MetaPost file that can be converted in PostScript, PDF ...etc., or can be included directly in any TeX/LaT
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    The Watermarks is a java library for images and text watermarking fingerprinting and tamper-proofing. The supported carrier formats for a watermark are JPEG and PDF. The project also aim to build a test environment to evaluate robustness of implemented a
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • Next
MongoDB Logo MongoDB