Showing 117 open source projects for "documents"

View related business solutions
  • AI-powered service management for IT and enterprise teams Icon
    AI-powered service management for IT and enterprise teams

    Enterprise-grade ITSM, for every business

    Give your IT, operations, and business teams the ability to deliver exceptional services—without the complexity. Maximize operational efficiency with refreshingly simple, AI-powered Freshservice.
    Try it Free
  • Gemini 3 and 200+ AI Models on One Platform Icon
    Gemini 3 and 200+ AI Models on One Platform

    Access Google's best plus Claude, Llama, and Gemma. Fine-tune and deploy from one console.

    Build, govern, and optimize agents and models with Gemini Enterprise Agent Platform.
    Start Free
  • 1
    TeXtidote

    TeXtidote

    Spelling, grammar and style checking on LaTeX documents

    If so, you probably know that the process is far from simple. Since LaTeX documents contain special commands and keywords (the so-called "markup") that are not part of the "real" text, you cannot run a grammar checker directly on these files: it cannot tell the difference between markup and text. The other option is to remove all this markup, leaving only the "clear" text; however, when a grammar tool points to a problem at a specific line in this clear text, it becomes hard to retrace that location in the original LaTeX file. ...
    Downloads: 77 This Week
    Last Update:
    See Project
  • 2
    jsoup

    jsoup

    Java library for working with real-world HTML

    jsoup is a Java library for working with real-world HTML. It provides a very convenient API for fetching URLs and extracting and manipulating data, using the best of HTML5 DOM methods and CSS selectors. jsoup implements the WHATWG HTML5 specification, and parses HTML to the same DOM as modern browsers do. jsoup is designed to deal with all varieties of HTML found in the wild; from pristine and validating, to invalid tag-soup; jsoup will create a sensible parse tree. The parser will make...
    Downloads: 5 This Week
    Last Update:
    See Project
  • 3
    OpenDataLoader PDF

    OpenDataLoader PDF

    PDF Parser for AI-ready data. Automate PDF accessibility

    ...The tool combines deterministic parsing methods with an optional hybrid AI-powered mode that improves extraction quality for difficult layouts such as multi-column documents, scanned files, and scientific papers. It includes built-in OCR capabilities supporting dozens of languages, making it suitable for digitizing low-quality or image-based PDFs. A key differentiator is its emphasis on accessibility automation, as it can generate tagged PDFs aligned with accessibility standards, significantly reducing manual remediation effort.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 4

    xsd2pgschema

    Relational database replication tool based on XML Schema

    xsd2pgschema is a Java application suite, which converts XML Schema 1.1 (hierarchical data model) to PostgreSQL DDL (relational data model) and supports XML data migration into PostgreSQL based on the XML Schema without defects on information content. It also supports full-text indexing via either Apache Lucene or Sphinx Search utilizing the relational data model. File conversion from XML to CSV, TSV, or JSON is possible as well as mapping XML Schema to JSON Schema. Obtained PostgreSQL...
    Downloads: 8 This Week
    Last Update:
    See Project
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • 5

    EasyML

    EasyML serialization library, to and from XML, similar to Gson

    ...EasyML offers extensive support for JDK classes and also supports customization through user settings, user extensions, or through the Java Serialization API. EasyML provides support for: - reading and writing XML text and/or org.w3c.dom documents - Java Collections framework - Java Serialization framework - Multi-threading - Java Generics and functional APIs EasyML can be customized with user-defined serialization strategies. The low-level components, XMLWriter and XMLReader, can be used directly, for a higher control compared to the EasyML Facade. Security policies can also be defined, specifying black- or whitelists of types which are allowed at deserialization time. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    RTextDoc

    RTextDoc

    An editor for structured documents

    RTextDoc is an editor for structured text documents such as LaTeX, AsciiDoc, DocBook. RTextDoc has proofreading capabilities: on-the-fly spelling, instant grammar checking and built-in free dictionaries. RTextDoc has syntax highlighting, bracket matching, folding, document structure browser for sections and labels, bookmarks, manager for LaTeX symbols, an editor for mathematical equations,integrated BibTeX database manager and several tools to convert LaTeX to HTML and back. ...
    Leader badge
    Downloads: 4 This Week
    Last Update:
    See Project
  • 7
    Merge to PDF

    Merge to PDF

    Simple portable Java app to merge single images and documents to PDF

    Portable Java app to merge single PDF documents and JPG images to one PDF document. Standalone app include own Java. * Application is in the BETA version - please report bugs and issues on Tickets page. *
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    Swagger2Markup

    Swagger2Markup

    Swagger to AsciiDoc or Markdown converter

    ...The output of Swagger2Markup can be used as an alternative to swagger-UI and can be served as static content. Swagger2Markup converts a Swagger JSON or YAML file into several AsciiDoc or GitHub Flavored Markdown documents which can be combined with hand-written documentation. The Swagger source file can be located locally or remotely via HTTP. Swagger2Markup supports the Swagger 1.2 and 2.0 specifications. Internally it uses the official swagger-parser and my markup-document-builder. You can use Swagger2Markup to convert your contract-first Swagger YAML file into a human-readable format.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9

    Json Node Tree Library

    Represents JSON doc as a nodes tree for transforms like XPATH & XSLT.

    Json Node Tree framework is provisioning of transformation operations for JSON documents similar XML XPATH and XSLT. The framework parses textual JSON document to nodes tree that represents JSON entity: object, array or value of primitive type. Generated tree allows to modify values of nodes and/or structure of the given JSON document. Representation of JSON object as a nodes tree allows implement an operations such as path query and transformations (like XPATH and XSLT for XML).
    Downloads: 1 This Week
    Last Update:
    See Project
  • 8 Monitoring Tools in One APM. Install in 5 Minutes. Icon
    8 Monitoring Tools in One APM. Install in 5 Minutes.

    Errors, performance, logs, uptime, hosts, anomalies, dashboards, and check-ins. One interface.

    AppSignal works out of the box for Ruby, Elixir, Node.js, Python, and more. 30-day free trial, no credit card required.
    Start Free
  • 10
    RedPen

    RedPen

    RedPen is an open source proofreading tool

    RedPen is an open source proofreading tool to check if your technical documents meet the writing standard. RedPen supports various markup text formats (Markdown, Textile, AsciiDoc, Re:VIEW, reStructuredText and LaTeX).
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    DODDLE-OWL

    DODDLE-OWL

    a Domain Ontology rapiD DeveLopment Environment – OWL extension

    DODDLE-OWL is a domain ontology development tool for the Semantic Web. DODDLE-OWL makes reuse of existing ontologies and supports the semi-automatic construction of taxonomic and other relationships in domain ontologies from documents.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    MorganaXProc (Implements XProc 1.0)

    MorganaXProc (Implements XProc 1.0)

    A Java implementation of XProc: An XML Pipeline Language

    This project is now abandoned in favour of MorganaXProc-III which implements XProc 3.0: https://sourceforge.net/projects/morganaxproc-iiise/ MorganaXProc is a full implementation of XProc: An XML Pipeline Language (W3C Recommendation 11 May 2010) written in Java, supporting all required and optional steps, the steps proposed on EXProc.org and also the document templating steps "p:in-scope-names" and "p:template". XPath 2.0 or XPath 3.0 is used as expression language. XSLT and XQuery...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    iText®, a JAVA PDF library

    iText®, a JAVA PDF library

    PDF Library for Developers

    iText is an open-source PDF library available for Java and .NET (C#). iText allows you to effortlessly generate and manipulate standards-compliant PDF documents with a powerful and feature-rich SDK. With iText, you can create archivable and accessible PDFs, split and merge documents, fill and flatten forms, digitally sign documents, and more. iText add-ons enable additional functionality, such as PDF creation from HTML templates, secure redaction, OCR, and much more. The latest versions of iText build on the success of previous versions and feature an improved document engine, high and low-level programming capabilities, and a more efficient modular structure. iText represents the next level for developers looking to leverage PDF in document workflows. ...
    Leader badge
    Downloads: 138 This Week
    Last Update:
    See Project
  • 14
    Blueprint Builder

    Blueprint Builder

    The markup language for design the programs

    Blueprint Builder is an integrate environment for development the blueprint documents. Blueprint is a technology for designing : 1. structure of program 2. process of development 3. business processes The technology is compatible with Rational Unified Process Articles about Blueprint in english and ukrainian reveals the major moments of the using Blueprint technology. Examples of the using technology includes the blueprints of MultiText Editor, Blueprint Builder and Neutrino (Java library for text editing) The program costs $500. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15

    Digital Preservation Recorder

    NO LONGER MAINTAINED

    NO LONGER MAINTAINED, NO LONGER SUPPORTED The Digital Preservation Recorder (DPR) has been developed by the National Archives of Australia to manage a digital preservation workflow. It features antivirus integration and makes use of the Xena framework for preservation conversions of data objects
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    NEXUSe2e Business Messaging Server
    NEXUSe2e server provides XML based messaging using e.g. ebXML to integrate business processes that span multiple companies. Its main use is for supply chain integration, e.g. for exchanging orders and other business documents in electronic form.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    Zip64File is a JAVA library which implements the 64-bit extension of the ZIP standard from PKWARE (http://www.pkware.com/documents/casestudies/APPNOTE.TXT) using random-access, permitting to access archives larger than 4 GB with more than 65 K files.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 18

    Service Grid - Language Grid Base System

    SOA infrastracture initially developed by NICT Language Grid Project

    ...Resources with complicated intellectual property issues are wrapped as Web services and shared on the Service Grid. If you release your software by using the software of this project, please include the following description in the documents or on the website. * This software uses the [SOFTWARE] by the Language Grid project (http://langrid.org/). [SOFTWARE] is one of: * Service Grid Server Software (http://langrid.org/oss-project/en/service_grid.html) * Language Service Development Libraries (http://langrid.org/oss-project/en/language_service.html) * Language Grid Toolbox (http://langrid.org/oss-project/en/toolbox.html) If you publish a paper by using the software of this project, please cite the following book...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    Aperture is a Java framework for extracting and querying full-text content and metadata from various information systems (file systems, web sites, mail boxes, ...) and the file formats (documents, images, ...) occurring in these systems.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    XSLT stylesheets and related tools to convert documents from formatting objects (XSL FO) to ODF format. The resulting document can be opened in OpenOffice Writer.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    A Swiss Army Knife GUI application for PDF documents: combine, split, rotate, reorder (n-up, booklet), watermark, edit bookmarks/fileinfo/pagetransition, compress, encrypt, decrypt, sign, repair, edit attachments and more.
    Leader badge
    Downloads: 81 This Week
    Last Update:
    See Project
  • 22
    Grotag
    Grotag views Amigaguide documents or converts them to HTML and DocBook XML. Additionally it can validate and pretty print such documents.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23

    eXtensible Text Framework (XTF)

    Framework for search and display of heterogenous document collections.

    ...Obsolete Description: The eXtensible Text Framework (XTF) is an architecture that supports searching across collections of heterogeneous textual data (XML, PDF, HTML, text, and more), and the presentation of results and documents in a highly configurable manner. Includes highly customized versions of the proven open-source components Lucene and Saxon.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 24
    PortaFIB

    PortaFIB

    Portafirmes de les Illes Balears

    == MOGUT A GITHUB: https://github.com/GovernIB/portafib == PortaFIB és un portafirmes “digital” desenvolupat en el departament d'Administració Electrònica de la Fundació Bit per encàrrec del Govern de les Illes Balears.. Les seves característiques principals són: • Permet firmar documents de tipus PDF • Permet fitxers annexos al document principal • Suport de càrrecs • Gestió de flux de firmes i plantilles de flux de firmes • Realitzar peticions de firma via web o WebServices, és a dir, a traves d'un usuari persona o un usuari-aplicació. • Multientitat (amb una sol servidor es pot donar servei a varies entitats, administracions, empreses, ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    The HandCoded Toolkit for FpML processing is a library supporting functions for manipulating FpML documents implemented both in Java and C#.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • 3
  • 4
  • 5
  • Next
MongoDB Logo MongoDB