Showing 110 open source projects for "documents"

View related business solutions
  • $300 in Free Credit Towards Top Cloud Services Icon
    $300 in Free Credit Towards Top Cloud Services

    Build VMs, containers, AI, databases, storage—all in one place.

    Start your project in minutes. After credits run out, 20+ products include free monthly usage. Only pay when you're ready to scale.
    Get Started
  • Go From AI Idea to AI App Fast Icon
    Go From AI Idea to AI App Fast

    One platform to build, fine-tune, and deploy ML models. No MLOps team required.

    Access Gemini 3 and 200+ models. Build chatbots, agents, or custom models with built-in monitoring and scaling.
    Try Free
  • 1
    TeXtidote

    TeXtidote

    Spelling, grammar and style checking on LaTeX documents

    If so, you probably know that the process is far from simple. Since LaTeX documents contain special commands and keywords (the so-called "markup") that are not part of the "real" text, you cannot run a grammar checker directly on these files: it cannot tell the difference between markup and text. The other option is to remove all this markup, leaving only the "clear" text; however, when a grammar tool points to a problem at a specific line in this clear text, it becomes hard to retrace that location in the original LaTeX file. ...
    Downloads: 73 This Week
    Last Update:
    See Project
  • 2
    jsoup

    jsoup

    Java library for working with real-world HTML

    jsoup is a Java library for working with real-world HTML. It provides a very convenient API for fetching URLs and extracting and manipulating data, using the best of HTML5 DOM methods and CSS selectors. jsoup implements the WHATWG HTML5 specification, and parses HTML to the same DOM as modern browsers do. jsoup is designed to deal with all varieties of HTML found in the wild; from pristine and validating, to invalid tag-soup; jsoup will create a sensible parse tree. The parser will make...
    Downloads: 5 This Week
    Last Update:
    See Project
  • 3
    OpenDataLoader PDF

    OpenDataLoader PDF

    PDF Parser for AI-ready data. Automate PDF accessibility

    ...The tool combines deterministic parsing methods with an optional hybrid AI-powered mode that improves extraction quality for difficult layouts such as multi-column documents, scanned files, and scientific papers. It includes built-in OCR capabilities supporting dozens of languages, making it suitable for digitizing low-quality or image-based PDFs. A key differentiator is its emphasis on accessibility automation, as it can generate tagged PDFs aligned with accessibility standards, significantly reducing manual remediation effort.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 4

    xsd2pgschema

    Relational database replication tool based on XML Schema

    xsd2pgschema is a Java application suite, which converts XML Schema 1.1 (hierarchical data model) to PostgreSQL DDL (relational data model) and supports XML data migration into PostgreSQL based on the XML Schema without defects on information content. It also supports full-text indexing via either Apache Lucene or Sphinx Search utilizing the relational data model. File conversion from XML to CSV, TSV, or JSON is possible as well as mapping XML Schema to JSON Schema. Obtained PostgreSQL...
    Downloads: 9 This Week
    Last Update:
    See Project
  • Forever Free Full-Stack Observability | Grafana Cloud Icon
    Forever Free Full-Stack Observability | Grafana Cloud

    Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

    Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.
    Create free account
  • 5

    EasyML

    EasyML serialization library, to and from XML, similar to Gson

    ...EasyML offers extensive support for JDK classes and also supports customization through user settings, user extensions, or through the Java Serialization API. EasyML provides support for: - reading and writing XML text and/or org.w3c.dom documents - Java Collections framework - Java Serialization framework - Multi-threading - Java Generics and functional APIs EasyML can be customized with user-defined serialization strategies. The low-level components, XMLWriter and XMLReader, can be used directly, for a higher control compared to the EasyML Facade. Security policies can also be defined, specifying black- or whitelists of types which are allowed at deserialization time. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    RTextDoc

    RTextDoc

    An editor for structured documents

    RTextDoc is an editor for structured text documents such as LaTeX, AsciiDoc, DocBook. RTextDoc has proofreading capabilities: on-the-fly spelling, instant grammar checking and built-in free dictionaries. RTextDoc has syntax highlighting, bracket matching, folding, document structure browser for sections and labels, bookmarks, manager for LaTeX symbols, an editor for mathematical equations,integrated BibTeX database manager and several tools to convert LaTeX to HTML and back. ...
    Leader badge
    Downloads: 1 This Week
    Last Update:
    See Project
  • 7
    Merge to PDF

    Merge to PDF

    Simple portable Java app to merge single images and documents to PDF

    Portable Java app to merge single PDF documents and JPG images to one PDF document. Standalone app include own Java. * Application is in the BETA version - please report bugs and issues on Tickets page. *
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    Swagger2Markup

    Swagger2Markup

    Swagger to AsciiDoc or Markdown converter

    ...The output of Swagger2Markup can be used as an alternative to swagger-UI and can be served as static content. Swagger2Markup converts a Swagger JSON or YAML file into several AsciiDoc or GitHub Flavored Markdown documents which can be combined with hand-written documentation. The Swagger source file can be located locally or remotely via HTTP. Swagger2Markup supports the Swagger 1.2 and 2.0 specifications. Internally it uses the official swagger-parser and my markup-document-builder. You can use Swagger2Markup to convert your contract-first Swagger YAML file into a human-readable format.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9

    Json Node Tree Library

    Represents JSON doc as a nodes tree for transforms like XPATH & XSLT.

    Json Node Tree framework is provisioning of transformation operations for JSON documents similar XML XPATH and XSLT. The framework parses textual JSON document to nodes tree that represents JSON entity: object, array or value of primitive type. Generated tree allows to modify values of nodes and/or structure of the given JSON document. Representation of JSON object as a nodes tree allows implement an operations such as path query and transformations (like XPATH and XSLT for XML).
    Downloads: 1 This Week
    Last Update:
    See Project
  • Build Securely on AWS with Proven Frameworks Icon
    Build Securely on AWS with Proven Frameworks

    Lay a foundation for success with Tested Reference Architectures developed by Fortinet’s experts. Learn more in this white paper.

    Moving to the cloud brings new challenges. How can you manage a larger attack surface while ensuring great network performance? Turn to Fortinet’s Tested Reference Architectures, blueprints for designing and securing cloud environments built by cybersecurity experts. Learn more and explore use cases in this white paper.
    Download Now
  • 10
    RedPen

    RedPen

    RedPen is an open source proofreading tool

    RedPen is an open source proofreading tool to check if your technical documents meet the writing standard. RedPen supports various markup text formats (Markdown, Textile, AsciiDoc, Re:VIEW, reStructuredText and LaTeX).
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    DODDLE-OWL

    DODDLE-OWL

    a Domain Ontology rapiD DeveLopment Environment – OWL extension

    DODDLE-OWL is a domain ontology development tool for the Semantic Web. DODDLE-OWL makes reuse of existing ontologies and supports the semi-automatic construction of taxonomic and other relationships in domain ontologies from documents.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    MorganaXProc (Implements XProc 1.0)

    MorganaXProc (Implements XProc 1.0)

    A Java implementation of XProc: An XML Pipeline Language

    This project is now abandoned in favour of MorganaXProc-III which implements XProc 3.0: https://sourceforge.net/projects/morganaxproc-iiise/ MorganaXProc is a full implementation of XProc: An XML Pipeline Language (W3C Recommendation 11 May 2010) written in Java, supporting all required and optional steps, the steps proposed on EXProc.org and also the document templating steps "p:in-scope-names" and "p:template". XPath 2.0 or XPath 3.0 is used as expression language. XSLT and XQuery...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    iText®, a JAVA PDF library

    iText®, a JAVA PDF library

    PDF Library for Developers

    iText is an open-source PDF library available for Java and .NET (C#). iText allows you to effortlessly generate and manipulate standards-compliant PDF documents with a powerful and feature-rich SDK. With iText, you can create archivable and accessible PDFs, split and merge documents, fill and flatten forms, digitally sign documents, and more. iText add-ons enable additional functionality, such as PDF creation from HTML templates, secure redaction, OCR, and much more. The latest versions of iText build on the success of previous versions and feature an improved document engine, high and low-level programming capabilities, and a more efficient modular structure. iText represents the next level for developers looking to leverage PDF in document workflows. ...
    Leader badge
    Downloads: 146 This Week
    Last Update:
    See Project
  • 14
    Blueprint Builder

    Blueprint Builder

    The markup language for design the programs

    Blueprint Builder is an integrate environment for development the blueprint documents. Blueprint is a technology for designing : 1. structure of program 2. process of development 3. business processes The technology is compatible with Rational Unified Process Articles about Blueprint in english and ukrainian reveals the major moments of the using Blueprint technology. Examples of the using technology includes the blueprints of MultiText Editor, Blueprint Builder and Neutrino (Java library for text editing) The program costs $500. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15

    Digital Preservation Recorder

    NO LONGER MAINTAINED

    NO LONGER MAINTAINED, NO LONGER SUPPORTED The Digital Preservation Recorder (DPR) has been developed by the National Archives of Australia to manage a digital preservation workflow. It features antivirus integration and makes use of the Xena framework for preservation conversions of data objects
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    Zip64File is a JAVA library which implements the 64-bit extension of the ZIP standard from PKWARE (http://www.pkware.com/documents/casestudies/APPNOTE.TXT) using random-access, permitting to access archives larger than 4 GB with more than 65 K files.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 17
    Aperture is a Java framework for extracting and querying full-text content and metadata from various information systems (file systems, web sites, mail boxes, ...) and the file formats (documents, images, ...) occurring in these systems.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    XSLT stylesheets and related tools to convert documents from formatting objects (XSL FO) to ODF format. The resulting document can be opened in OpenOffice Writer.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    A Swiss Army Knife GUI application for PDF documents: combine, split, rotate, reorder (n-up, booklet), watermark, edit bookmarks/fileinfo/pagetransition, compress, encrypt, decrypt, sign, repair, edit attachments and more.
    Leader badge
    Downloads: 87 This Week
    Last Update:
    See Project
  • 20
    Grotag
    Grotag views Amigaguide documents or converts them to HTML and DocBook XML. Additionally it can validate and pretty print such documents.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21

    eXtensible Text Framework (XTF)

    Framework for search and display of heterogenous document collections.

    ...Obsolete Description: The eXtensible Text Framework (XTF) is an architecture that supports searching across collections of heterogeneous textual data (XML, PDF, HTML, text, and more), and the presentation of results and documents in a highly configurable manner. Includes highly customized versions of the proven open-source components Lucene and Saxon.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 22
    The HandCoded Toolkit for FpML processing is a library supporting functions for manipulating FpML documents implemented both in Java and C#.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    CosmoFile

    CosmoFile

    Convert your files,Edit pdf Files,Edit Images,Download files

    ...CosmoFile is created for you ,a great software absolutely free for users to convert your files to many different formats.CosmoFile is very Simple and very fast and support many formats PDF,HTML,JPG,PNG,JPG,ICO,SVG,XLSX,PPTX... Edit Pdf Files with CosmoFile Looking for free software to modify PDF documents? Sometimes you need to make minor changes to a PDF file. For instance, you may want to hide your personal phone number from a PDF file before uploading it online or may want to annotate a page with notes and freehand drawings.You can perform such edits in a PDF easily with CosmoFile. Edit Images with CosmoFile CosmoFile makes your photos beautiful with easy and powerful editing tools Editing your photos is easy with CosmoFile! ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24

    wikihtml

    Converts wikitext documents into HTML documents

    This project is an application that converts wikitext documents into HTML documents. Wiki markup or wikitext is a markup language to write documents in wiki-based systems, such as web sites powered by MediaWiki.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    NekoHTML is a simple HTML scanner and tag balancer that enables application programmers to parse HTML documents and access the information using standard XML interfaces.
    Downloads: 14 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • 3
  • 4
  • 5
  • Next
MongoDB Logo MongoDB