Showing 43 open source projects for "pdf to text"

View related business solutions
  • Enterprise-grade ITSM, for every business Icon
    Enterprise-grade ITSM, for every business

    Give your IT, operations, and business teams the ability to deliver exceptional services—without the complexity.

    Freshservice is an intuitive, AI-powered platform that helps IT, operations, and business teams deliver exceptional service without the usual complexity. Automate repetitive tasks, resolve issues faster, and provide seamless support across the organization. From managing incidents and assets to driving smarter decisions, Freshservice makes it easy to stay efficient and scale with confidence.
    Try it Free
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • 1
    PDFsam

    PDFsam

    PDFsam, a desktop application to split, merge, mix, rotate PDF files

    PDFsam Basic is our free and open-source desktop application to split, merge, extract pages, rotate and mix PDF files. PDFsam Visual is a powerful tool to visually compose PDF files, reorder pages, delete pages, split, merge, rotate, encrypt, decrypt, extract text, convert to grayscale, crop PDF files. PDFsam Basic is written using JavaFX. Since version 4 it is released as a self-contained application and bundles a jlinked JDK while version 3 requires a Java Runtime Environment 8 with JavaFx installed in order to run.
    Downloads: 141 This Week
    Last Update:
    See Project
  • 2
    MyBox

    MyBox

    Easy Tools of PDF, Image, File, Network, Data, and Medias

    javafx-desktop-apps pdf image ocr icc barcode color-palette text bytes markdown html archive compress digest video audio editor converter media https://github.com/Mararsh/MyBox Self-contain packages need not java env nor installation. Jar packages need Java 16 or higher.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 3
    Teedy

    Teedy

    Lightweight document management system

    ...As a content-oriented document management system, the user interface is not cluttered with buttons and menus and works both on desktop and mobile. Document searching has never been easier thanks to the powerful full-text search engine in Teddy. You can search in images (embedded OCR), DOCX, ODT, TXT, PDF, and more. Verify or validate your documents with people of your organization using workflows.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 4
    PDFLayoutTextStripper

    PDFLayoutTextStripper

    Converts a pdf file into a text file while keeping the layout

    Converts a PDF file into a text file while keeping the layout of the original PDF. Useful to extract the content from a table or a form in a PDF file. PDFLayoutTextStripper is a subclass of PDFTextStripper class (from the Apache PDFBox library).
    Downloads: 1 This Week
    Last Update:
    See Project
  • Our Free Plans just got better! | Auth0 Icon
    Our Free Plans just got better! | Auth0

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
    Try free now
  • 5
    MagicalPdfEditor

    MagicalPdfEditor

    This is a small PDF editor based on OpenPdf core and AndroidPdfViewer

    This is a small PDF editor based on OpenPDF Core and AndroidPdfViewer. As there is not many open-source easy working PDF editors and PDF wizards, I decided to create a simple directory to resolve my issues. Here I have worked on two separate cores, add some functionality to them, and combined them together to achieve my target. I am working on this repo, any help will be appreciated. Just clone the project and trance the source code, It's really easy and clear. All functionality in...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6

    jmarkdownviewer

    markdown viewer for java

    This is a Markdown viewer for java. Primarily it tries to display github styled markdown scripts. Download the release jar file and run java -jar jmdviewer-x.y.jar where x.y is the version number. You can pass the markdown filename on the command line java -jar jmdviewer-x.y.jar [filename.md]
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    BRIC

    BRIC

    BRIC is a powerful tool for batch image processing.

    Bric is a cross-platform batch image processor. You can convert, resize, rotate and add watermark to your images. Multiple file types are supported for input and output. The project started back in 2011 and was maintained for a couple of years. In 2020 BRIC is again in active development, so some of the features written below might be outdated. Please be patient, until everything is reviewed and rewritten.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 8
    mbFXWords

    mbFXWords

    Analyze text. Diagonal read subject, predicate, obj. Search other pdf.

    ...JavaFX Application, runs with Oracle Java Runtime Environment version 8 that is including JavaFX. NLP extensions: - Divide sentences in subclauses: segmentation. - Divide plain text: subject, predicate, object. - Count words: stemming. - Search for similar content: pdf's. Gives out subject, predicate and object of sentences of pdf and plain text files. Provides comfortable GUI. Automatic language detection.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    JPDF Viewer

    JPDF Viewer

    Your Java Swing PDF Viewer/Reader cross platform

    A simple PDF Viewer that allows you to be able to view, print and extract the contents of your pdf file in just a few clicks. You can export the contents of the pdf in svg format or txt. The Viewer is also equipped with a handy utility panel with search functions, thumbnails and annotations. Get Your PDF Reader for Android - https://play.google.com/store/apps/details?id=com.ulm.pdfreader =============================== Get now your Word Processor in pure java:...
    Downloads: 6 This Week
    Last Update:
    See Project
  • $300 in Free Credit Towards Top Cloud Services Icon
    $300 in Free Credit Towards Top Cloud Services

    Build VMs, containers, AI, databases, storage—all in one place.

    Start your project in minutes. After credits run out, 20+ products include free monthly usage. Only pay when you're ready to scale.
    Get Started
  • 10
    OpenSearchServer Search Engine

    OpenSearchServer Search Engine

    An open source search engine with RESTFul API and crawlers

    ...Using the web user interface, the crawlers (web, file, database, etc.) and the client libraries (REST/API , Ruby, Rails, Node.js, PHP, Perl) you will be able to integrate quickly and easily advanced full-text search capabilities in your application: Full-text with basic semantic, join queries, boolean queries, facet and filter, document (PDF, Office, etc.) indexation, web scrapping,etc. OpenSearchServer runs on Windows and Linux/Unix/BSD.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    ParseDoc 1.0

    ParseDoc 1.0

    Find text in files (Open-/Libre-Office, PDF, MS Office, RTF, TXT)

    This is a GUI-Application to scan files in format Open-/Libre-Office, MS-Office, PDF, RTF, TXT
    Downloads: 1 This Week
    Last Update:
    See Project
  • 12
    DJVU++

    DJVU++

    The DjVu complete solution,with OCR Technology(Arabic ,English).

    DjVu++ is a user-friendly program that used to manipulate DjVu file formats such as eBooks with a penalty of editing features. The program introduce a free replacement for the property PDF format with similar resolution and smaller file size DjVu++ also support OCR to handle text in scanned books and images. The program shows good performance for English. In addition to the Arabic language to lead free and commercial software in this area. The main features of DjVu++ program are: o Manipulate DjVu files. o Support smaller size than PDF with the same performance. ...
    Downloads: 5 This Week
    Last Update:
    See Project
  • 13
    Omega Base

    Omega Base

    Web-based knowledge base template.

    A Knowledge Base and document management system (DMS). With strong user management, security, and file indexing for search.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14

    PDF Comaprision JINI

    This project is forged to compare two PDFs

    This project is forged to compare two PDFs . IT uses following approach in compression 1 . Extract All text of both pdfs and compare them Page by Page 2. Extract all images from both PDF and save in folders and then compare them one by one and save difference in Difference Folder 3. Convert PDF 1 and 2 pages to JPG and compare them one by one
    Downloads: 1 This Week
    Last Update:
    See Project
  • 15
    OpenSearchServer Extractor

    OpenSearchServer Extractor

    A RESTFul/JSON Web Service for text and metata extraction

    An open source RESTFul Web Service for text , meta-data extraction and analysis. oss-text-extractor supports various binary formats: Word processor (doc, docx, odt, rtf) Spreadsheet (xls, xlsx, ods) Presentation (ppt, pptx, odp) Publishing (pdf, pub) Web (rss, html/xhtml) Medias (audio, images) Others (vsd, text)
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    Secretica CRM

    Secretica CRM

    An absolutely free customer relationship managment (CRM) application

    An absolutely free customer relationship managment (CRM) application developed by Codeiron Ltd. Available under Codeiron Public License 1.0 (a BSD-style license but a little bit less permissive, check the License folder of project files).
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    iGi Reporter

    iGi Reporter

    Write Report,Read Multiple Notepad,pdf,Word,Design Logo and picture

    First Problem:- problem is ,many Programs need internet To Convert Word To Pdf in this Program You Can Convert Word To Pdf Without internet ----------------------------------------------------------------- Second Problem:- problem is,Many Students Don't Know How To Write CV in this Program You Can Creat Cv in One Minute ------------------------------------------------------------------ Third Problem:- problem is,Many Students Don't Know How To Write Report in this Program You Can Write Report With High Speed ------------------------------------------------------------------- Fourth Problem:- problem is,if You Want To Read Pdf , extract Text From Pdf , And extract text From Word Documents. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    Coulson Plot Generator

    Coulson Plot Generator

    Graphics generator for pie diagram matrix from spreadsheets

    For comparative plus/minus spreadsheet data, Coulson Plot Generator application creates a multiple pie diagram, for a rapid visual comparison of subunit occupancy across a range of entities. Numbers of pies, segments, labels and colours are customisable. Creates an editable PDF from a comma separated text file. Now with zoom capability and more memory, for larger datasets. Save images in several formats including SVG.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19

    QueLang

    QueLang is a designing tool to use for Questionnaire Design.

    This is the first implementation of QueLang. QueLang is a language I designed for Questionnaire Design and Implementation. This software can compile your code (written in .ql text files) into a special .qlc format (a kind of database). Then it can read those .qlc files to open them in viewer and export them to PDF format. It can be also used for exam and test designing! Tested on: -Linux Ubuntu 12.04 -Windows 7 64-bit QueLang can run by double clicking the .jar (or .exe) file. NOW INSTALLATION (or compilation) IS NEEDED! ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    Jedi

    Jedi

    Java Enhanced Data Interface - Italian Senate Project

    JEDI is a J2EE application that provides a centralized service aiming at significantly simplify the generation of data driven documents in a enterprise environment. The documents (hereafter called "JEDI documents") can have different format types: pdf, excel, rtf, plain text data streams and xml streams. A JEDI document is an instance of the so called "managed document" (i.e. the configuration data and a particular set of rules), configured by a developer into the JEDI configuration database. The provision of a JEDI document is based on a synchronous processing performed by the JEDI core engine triggered by an end-user request (through HTTP request or RMI call). ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    JPDF Tools
    JPDF Tools is a GUI java program built on the JPDF Export library. Its main aim is to create pdf files by inserting texts, images or tables. Users can also merge PDF files, split PDF files, merge images into PDF files and soon convert from and to PDF files.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 22
    LaTeX Edit Kit
    LaTeX EditKit is a text-editing tool that can convert the edited text to LaTeX
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    JPDF Export
    JPDF Export is a java library built on the famous iText library. It provides simple functions that can be used to build complex pdf files. It also provides simple classes to merge, split and convert pdf files
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    DocInfoRetriever is a Web_based document full-text search engine based on lucene. It allows you to search the contents and metadata of documents . Supported document formats, likes doc, xls, pdf, odt, jpg...etc.,and torrent files.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    PDF Presenter
    An easy to use presentation software with focus on PDF documents. External display / projector presentations: PDF, images, text and whiteboard with annotations; intuitive user interface, optimal mouse and pen input handling.
    Leader badge
    Downloads: 68 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • Next
MongoDB Logo MongoDB