Showing 23 open source projects for "documents"

View related business solutions
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • Full-stack observability with actually useful AI | Grafana Cloud Icon
    Full-stack observability with actually useful AI | Grafana Cloud

    Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

    Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.
    Create free account
  • 1
    Nokogiri

    Nokogiri

    Tool to work with XML and HTML from Ruby

    Nokogiri (鋸) makes it easy and painless to work with XML and HTML from Ruby. It provides a sensible, easy-to-understand API for reading, writing, modifying, and querying documents. It is fast and standards-compliant by relying on native parsers like libxml2 (C) and xerces (Java). Be secure-by-default by treating all documents as untrusted by default. Be a thin-as-reasonable layer on top of the underlying parsers, and don't attempt to fix behavioral differences between the parsers. "Native gems" contain pre-compiled libraries for a specific machine architecture. ...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 2
    Vanilla.PDF

    Vanilla.PDF

    Cross-platform SDK for creating and modifying PDF documents

    Vanilla.PDF is a modern, high-performance, open-source C++17 SDK designed for creating, editing, signing, and analyzing PDF documents across multiple platforms. It requires no external runtime dependencies, making it lightweight and ideal for embedding into desktop applications, servers, or automation pipelines. The SDK offers full cross-platform support including Windows, Linux, macOS, and Android, with builds available for major compilers and architectures.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    Lexbor

    Lexbor

    Lexbor is development of an open source HTML Renderer library

    ...In our development process, we focus on fastest parsing techniques for HTML, CSS, and fonts, fastest data processing methods, and fastest ways to serve content to end users. Whether you are building a backend that handles millions of HTML documents or a UI-heavy user app, your software’s response rate always matters to users and developers alike. Lexbor’s code is optimized for ease of access in end-user applications and across programming languages. You can effortlessly wrap all the capabilities of the library in, say, Python to power your end-user or backend applications in a new environment. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 4

    vx - fast LaTeX editor

    Fast editor for TeX documents

    VX is a small, light, free text editor and yet another shell for TeX programming language based oh Scintilla. "Visualizing" corresponds only to document editing process, but not to preview (as in WYSIWYG systems). Main feature of VX is support of accelerators (macro) for text typing. Also usual text editing functions are supported such as autocompleting, block selecting etc.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Gemini 3 and 200+ AI Models on One Platform Icon
    Gemini 3 and 200+ AI Models on One Platform

    Access Google's best plus Claude, Llama, and Gemma. Fine-tune and deploy from one console.

    Build, govern, and optimize agents and models with Gemini Enterprise Agent Platform.
    Start Free
  • 5
    MultiMarkdown-6

    MultiMarkdown-6

    Lightweight markup processor to produce HTML, LaTeX, and more

    Lightweight markup processor to produce HTML, LaTeX, and more. MultiMarkdown is a superset of the Markdown lightweight markup syntax with support for additional output formats and features. Markdown is a text-to-HTML conversion tool for web writers. Markdown allows you to write using an easy-to-read, easy-to-write plain text format, then convert it to structurally valid XHTML (or HTML).
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    Tidy

    Tidy

    The granddaddy of HTML tools, with support for modern standards

    ...It can tidy your documents by itself, and developers can easily integrate its features into even more powerful tools. Thanks for the efforts of HTACG and prominent contributors, HTML Tidy has a whole new heartbeat and a whole new life.
    Downloads: 18 This Week
    Last Update:
    See Project
  • 7
    Tidy

    Tidy

    The granddaddy of HTML tools, with support for modern standards

    ...Supports modern standards. Thanks to the efforts of HTACG and prominent contributors, HTML Tidy has a whole new heartbeat and a whole new life. Tidy tidies HTML and XML. It can tidy your documents by itself, and developers can easily integrate its features into even more powerful tools. Tidy is a console application for macOS, Linux, Windows, UNIX, and more. It corrects and cleans up HTML and XML documents by fixing markup errors and upgrading legacy code to modern standards. libtidy is a C static and dynamic library that developers can integrate into their applications in order to bring all of Tidy’s power to your favorite tools. libtidy is used today in desktop applications, web servers, and more. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 8

    jpeg2pdf

    Create PDF from JPEG scans and photos

    Cross-platform command-line tool for creation of PDF documents from scans/photos of pages in JPEG (.jpg) format and the lightest weight ANSI C library to put multiple JPEG files into one PDF file. You can add handwritten comments to PDF scans (over original images) with xournal: http://xournal.sourceforge.net/ It supports graphics tablets and saves comments to PDFs as vector data.
    Leader badge
    Downloads: 21 This Week
    Last Update:
    See Project
  • 9

    ftdetector

    File type detector library

    ...It uses hash tables to make the detection of a file type as fast as possible. The signature and mime types lists are stored at simple user-friendly files. This file type detector supports a lot of formats (image, archive, text, documents, audio, video, fonts and others). It also includes Microsoft OLE compound file types. The detector's algorythm has special features to detect text file types like (HTML, XML, JSON, etc.) and text file UNICODE charsets by using their BOMs (Byte order marks). Its small simple interface can be used at the most popular platforms.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Fully Managed MySQL, PostgreSQL, and SQL Server Icon
    Fully Managed MySQL, PostgreSQL, and SQL Server

    Automatic backups, patching, replication, and failover. Focus on your app, not your database.

    Cloud SQL handles your database ops end to end, so you can focus on your app.
    Try Free
  • 10

    Z Notation E-Mail Mark-up Tools

    Tools to convert Z mark-up to HTML or text.

    A small library and two command-line tools to parse and convert Z notation from the "e-mail" mark-up into HTML code, or into UTF-8 text with box-drawing graphics, or into the Z Standard text format. See the project's Wiki Home Page for details --- the "Wiki" button in the bar above, or the following link:
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    ...Right now, the Osxcart library includes a module for reading and writing property lists (".plist" files) in XML format, and a module for importing and exporting RTF documents to and from GtkTextBuffers.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    Files Hunter

    Files Hunter

    Analyze files to get their real format. Retrieve corrupted ones.

    Analyze files and guess their true content's format. Extract hidden files from corrupted ones. Easily extensible by adding new plug-ins for new formats. Handles documents, videos, images, music, executables...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    FleXML reads a DTD describing the format of XML documents; it may be specified as a URI to the DTD on the web. From this FleXML produces a validating XML processor with an interface to support XML applications.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    gPDFText is a text editor for GTK+ that opens PDF documents for ebook readers, converts the text contents into plain ASCII text, restores the original paragraphs and removes unwanted line breaks to allow easier zooming on the reader.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15

    HXPath

    XPath HTML parser

    HXPath is a command line tool useful to extract data from HTML documents. HXPath can select sub trees, like the standard xpath tool, but is also able to read contents and attributes and output them in a bash friendly format. HTML Tidy and HTTP/HTTPS get are built in too.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    XPN is relies on a non relational native XML database, where XML documents are stored in a compressed form and indices enables fast access to structure and content, thus enabling a fast evaluation of XQuery queries.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    GnoTeX is a framework for writing LaTeX documents. You can simple create and edit existing latex documents with or without knowing LaTeX. Based on gtk2 and some gnome2 libraries, you will be able to edit source, but also content only.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    PK7
    Utility to validate, extract, show and create digital documents included in files with "pk7" and "fp7" extensions such as electronic bills provided by some phone companies in Spain like Movistar and Telefonica.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 19
    Excalibur is a TeX/LaTeX aware spelling checker. It will also spell check plain text documents.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    The tool putting HTML/XHTML documents into the Lotus Notes/Domino R5, R6 or R7 databases. The HTML code is aggregated in defined field of document, including files of the resources. Currently is only supported Polish language.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    Irudiko is a library written in C++ for generating Locality Sensitive Hashing sketches from any textual and web document. Mainly designed to work with HTML pages, it has also an optimization support for English or Italian documents.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    PdfRipImage is a program to automatically extract images from PDF documents and convert them to a format of your choice (such as JPEG or TIFF). It runs on UNIX-like platforms and requires utilities from netpbm and xpdf.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    MyLib is a smart desktop assistant to manage PDF/PPT/PS documents. These types of documents are frequently used by academic & engineering communities.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • Next
MongoDB Logo MongoDB