Showing 40 open source projects for "pdf to text"

View related business solutions
  • Try Google Cloud Risk-Free With $300 in Credit Icon
    Try Google Cloud Risk-Free With $300 in Credit

    No hidden charges. No surprise bills. Cancel anytime.

    Use your credit across every product. Compute, storage, AI, analytics. When it runs out, 20+ products stay free. You only pay when you choose to.
    Start Free
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • 1
    java-pdf-table-extractor-lib

    java-pdf-table-extractor-lib

    Java Pdf Table extraction library

    The command line application is an example of usage of the Java library. The library is based on pdfbox library and works by looking for the layout of each selected pdf page, and looking for table structure patterns. After calling the library (passing the pdf filename, and the page range), the result is a List<PdfTextElement>. PdfTextElement is an interface that has two implementations. * A basic text (outside the tables) * And PdfTextTabulaElement, for table structures. That implementation allows reading table dimensions and the text of each cell of the table. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    ant4docbook

    ant4docbook

    ANT4DOCBOOK is an ANT task for DOCBOOK

    ANT4DOCBOOK is an ANT task for DOCBOOK, a semantic markup language for technical documentation.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 3
    OmegaT - multiplatform CAT tool

    OmegaT - multiplatform CAT tool

    The free computer aided translation (CAT) tool for professionals

    OmegaT is a free and open source multiplatform Computer Assisted Translation tool with fuzzy matching, translation memory, keyword search, glossaries, and translation leveraging into updated projects.
    Leader badge
    Downloads: 1,596 This Week
    Last Update:
    See Project
  • 4
    jPicEdt

    jPicEdt

    Another drawing editor for LaTeX with PSTricks & TikZ

    jPicEdt is an extensible internationalized vector-based drawing editor for LaTeX and related packages (TikZ, PsTricks,...), written in Java. It is also a library of reusable high-level graphic primitives.
    Downloads: 6 This Week
    Last Update:
    See Project
  • 8 Monitoring Tools in One APM. Install in 5 Minutes. Icon
    8 Monitoring Tools in One APM. Install in 5 Minutes.

    Errors, performance, logs, uptime, hosts, anomalies, dashboards, and check-ins. One interface.

    AppSignal works out of the box for Ruby, Elixir, Node.js, Python, and more. 30-day free trial, no credit card required.
    Start Free
  • 5
    iSphere

    iSphere

    The iSphere Project for and RDi 9.5.1.3+

    The iSphere Source Code has been moved to GitHub (https://github.com/rdi-open-source/isphere-plugin) on January 3rd, 2024. Important: The update site and ticket management has been moved to GitHub as well. iSphere is an open source plug-in for IBM's Rational Developer for i 9.5.1.3+. It delivers high quality extensions to improve developer productivity. IBM's current Eclipse based Integrated Development Environment (IDE) is a huge step beyond SEU, but it still lacks features...
    Leader badge
    Downloads: 205 This Week
    Last Update:
    See Project
  • 6
    Delphi : VRCalc++ and more Binary Exec

    Delphi : VRCalc++ and more Binary Exec

    Delphi Java - VRCalc++ OOSL (Script) and + (Binary Exec Distro)

    Vincent Radio {Adrix.NT} Embarcadero : Delphi : Executable Binaries Delphi : VRCalc++ Object Oriented Scripting Language : Engine + Ext Libraries VRCalc++ OOSL Visual Stage Project : VCL & FMX (FireMonkey) VRCalc++ Script Executor: - VCL Console - Terminal Console - FMX Console + VRCalc++ OOSL : VR System Scripted Standard Runtime Library Delphi Applics - VR Multi Editor : Smart Text Editor - VR Lazy Code Editor : Smart RTF Multi Lang Code Text Editor - VR Astro Vision...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 7
    Delphi : VRCalc++ OOSL (Script) and more

    Delphi : VRCalc++ OOSL (Script) and more

    Delphi : VRCalc++ OOSL & + (Paged List, TextEditor, VRAstroVision ...)

    Vincent Radio {Adrix.NT} Sources Library & Applications : Delphi C++ Java VRCalc++ C# VRCalc++ Object Oriented Scripting Language - Engine Source Pascal Code - Delphi Packages Build Prjs - VRCalc++ Scripted System Std RT Library - Guides & Docs (CHM, PDF, DOCX) - VCL & FMX (FireMonkey) Support - Script Test Code (Lang RTL VCL FMX) - Visual Stage Project : VCL & FMX Paged Lists & Iterators : Delphi C++ Java C# Multi-Dim Arrays & Direct Graph Classes : Delphi C++ Java VRCalc++...
    Downloads: 9 This Week
    Last Update:
    See Project
  • 8
    Common Resource Grep - crgrep

    Common Resource Grep - crgrep

    Common Resource Grep

    CRGREP searches for matching text in databases, various document formats, archives and other difficult to access resources. A command line tool for name and content text matching in database tables, plain files, MS Office documents, PDF, archives, MP3 audio, image meta-data, scanned documents, maven dependencies and web resources. CRGREP will search resources within resources of any arbitrary combination or depth, so text within a document within a zip archive, and so on. ...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 9
    Next Generation Programming

    Next Generation Programming

    Compose Software Without Writing Any Programing Code

    "Next Generation Programming - Programming Without Coding Software" is a drag-drop wizard for creating simple or complex applications without writing any programming language code The Software is coded/designed with "Java Programming Language" for novice/expert programmers; Programmers can write softwares with visual tools : drag-drop components;visual editors... Programmers can use the software to compose of simple/complex applications : Database programs, circuit design, generate...
    Downloads: 0 This Week
    Last Update:
    See Project
  • AI-generated apps that pass security review Icon
    AI-generated apps that pass security review

    Stop waiting on engineering. Build production-ready internal tools with AI—on your company data, in your cloud.

    Retool lets you generate dashboards, admin panels, and workflows directly on your data. Type something like “Build me a revenue dashboard on my Stripe data” and get a working app with security, permissions, and compliance built in from day one. Whether on our cloud or self-hosted, create the internal software your team needs without compromising enterprise standards or control.
    Try Retool free
  • 10
    LaTeXDraw

    LaTeXDraw

    Vector drawing program for LaTeX using PSTricks

    LaTeXDraw is a graphical drawing editor for LaTeX. LaTeXDraw can be used to 1) generate PSTricks code; 2) directly create PDF or PS pictures.
    Leader badge
    Downloads: 101 This Week
    Last Update:
    See Project
  • 11
    A fast and reliable tool, written in Java 8, for generating professional UML sequence diagrams from text. Supports actors, con-/destructors, threads, broadcasts, notes, and fragments. Can serve as a Javadoc taglet. Exports PDF, (E)PS, SVG, PNG, ... The source code is hosted at http://github.com/sdedit/sdedit
    Downloads: 6 This Week
    Last Update:
    See Project
  • 12

    ConcatPDF

    PDF Concatenation Tool

    ConcatPDF is the tool to concatenate PDF files. It can concatenate, extract, encrypt, decrypt, configure PDF files, convert image files to PDF. GUI version and CUI version are both available. iText.NET is iText porting on .NET Framework by J#. This library allows you to generate PDF, (X)HTML, XML, RTF files on Microsoft.NET Framework including ASP.NET.
    Downloads: 19 This Week
    Last Update:
    See Project
  • 13
    iText®, a JAVA PDF library

    iText®, a JAVA PDF library

    PDF Library for Developers

    iText is an open-source PDF library available for Java and .NET (C#). iText allows you to effortlessly generate and manipulate standards-compliant PDF documents with a powerful and feature-rich SDK. With iText, you can create archivable and accessible PDFs, split and merge documents, fill and flatten forms, digitally sign documents, and more. iText add-ons enable additional functionality, such as PDF creation from HTML templates, secure redaction, OCR, and much more. The latest...
    Leader badge
    Downloads: 185 This Week
    Last Update:
    See Project
  • 14

    JSentiWordNet

    A wrapper for the famous SentiWordNet, a resource for opinion mining

    ...As defined by the authors : SentiWordNet assigns to each synset of WordNet three sentiment scores: positivity, negativity, objectivity. You can find additional information about the creation of SentiWordnet here : http://nmis.isti.cnr.it/sebastiani/Publications/LREC06.pdf sentiWordnet (avilable here : https://drive.google.com/open?id=0B0ChLbwT19XcOVZFdm5wNXA5ODg) is a text file with a specific format that saves a synset on each line. In this project, we aim to provide a wrapper that allows developers to use SentiWordnet without the need to interact directly with the text file. The project allows simple interrogation using a simple GUI and a REST Web service.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15

    eXtensible Text Framework (XTF)

    Framework for search and display of heterogenous document collections.

    NOTICE: This code repository is deprecated. Please visit https://github.com/cdlib/xtf for the latest updates. Obsolete Description: The eXtensible Text Framework (XTF) is an architecture that supports searching across collections of heterogeneous textual data (XML, PDF, HTML, text, and more), and the presentation of results and documents in a highly configurable manner. Includes highly customized versions of the proven open-source components Lucene and Saxon.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16

    eLibrary

    Personalized Search Engine for Commonly Used Files

    ...It differs from general OS file search engines in that it personalizes the indexing setup so that users can choose which directories to index or remove from an existing index and it can also suggest queries just like Google's "Did you mean" feature. The customization of indexing and query suggestion greatly improves search speed and make user experience more comfortable. eLibrary can also extract text content from files of many wildly used file types such as pdf, doc, ppt, and mp3 to improve the index quality.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    DJVU++

    DJVU++

    The DjVu complete solution,with OCR Technology(Arabic ,English).

    DjVu++ is a user-friendly program that used to manipulate DjVu file formats such as eBooks with a penalty of editing features. The program introduce a free replacement for the property PDF format with similar resolution and smaller file size DjVu++ also support OCR to handle text in scanned books and images. The program shows good performance for English. In addition to the Arabic language to lead free and commercial software in this area. The main features of DjVu++ program are: o Manipulate DjVu files. o Support smaller size than PDF with the same performance. ...
    Downloads: 6 This Week
    Last Update:
    See Project
  • 18

    PDF Comaprision JINI

    This project is forged to compare two PDFs

    This project is forged to compare two PDFs . IT uses following approach in compression 1 . Extract All text of both pdfs and compare them Page by Page 2. Extract all images from both PDF and save in folders and then compare them one by one and save difference in Difference Folder 3. Convert PDF 1 and 2 pages to JPG and compare them one by one
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    Aspose for Spring.Java

    Aspose for Spring.Java

    Aspose for Spring.Java provides Aspose API samples for Spring.Java.

    Aspose for Spring.Java provides usage samples / sourcecodes for the demonstration of Aspose API for Java by extending famous Spring Java PetClinic Sample Web Application. This extension of "Spring Java PetClinic Sample Web Application" also shows the Integration of Aspose APIs with Spring MVC, JSP and Maven Project. Latest Release: v1.1
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    Jedi

    Jedi

    Java Enhanced Data Interface - Italian Senate Project

    JEDI is a J2EE application that provides a centralized service aiming at significantly simplify the generation of data driven documents in a enterprise environment. The documents (hereafter called "JEDI documents") can have different format types: pdf, excel, rtf, plain text data streams and xml streams. A JEDI document is an instance of the so called "managed document" (i.e. the configuration data and a particular set of rules), configured by a developer into the JEDI configuration database. The provision of a JEDI document is based on a synchronous processing performed by the JEDI core engine triggered by an end-user request (through HTTP request or RMI call). ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    JPDF Tools
    JPDF Tools is a GUI java program built on the JPDF Export library. Its main aim is to create pdf files by inserting texts, images or tables. Users can also merge PDF files, split PDF files, merge images into PDF files and soon convert from and to PDF files.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 22

    Simple Use Case Documentation Tool

    A simple tool to document use cases.

    This is a simple tool with a user interface to key in use case text and spell out PDF documentation. Each use case will have the following captured. 1) Use Case Name 2) Page over view. 3) Pre conditions 4) Wireframe (image) 5) Business Rules 6) Main Flow 7) Alternate Flow 8) Post Condition 9) Error scenarios 10) Technical details Usability Features: 1) Open existing usecase for editing. 2) Save copy of a use case with new use case name. 3) Retain use case export order. 4) Information messages are shown in notification bar with 'fade' effect. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    Task for ANT to produce documentation with (PDF)LaTeX using BibTeX, Makeindex and GlossTeX.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    JPDF Export
    JPDF Export is a java library built on the famous iText library. It provides simple functions that can be used to build complex pdf files. It also provides simple classes to merge, split and convert pdf files
    Downloads: 2 This Week
    Last Update:
    See Project
  • 25
    Estimation Reporting
    Its a pure Java Reporting Library and Reportdesigner for viewing, printing, exporting (xls, pdf, html, rtf) of professional looking reports. Supports Text-, Image- and Chart-Fields, multiple Grouping, XML-Templates and different Datasources.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • Next
MongoDB Logo MongoDB