Data Formats Software

View related business solutions

Browse free open source Data Formats software and projects below. Use the toggles on the left to filter open source Data Formats software by OS, license, language, programming language, and project status.

  • Auth0 B2B Essentials: SSO, MFA, and RBAC Built In Icon
    Auth0 B2B Essentials: SSO, MFA, and RBAC Built In

    Unlimited organizations, 3 enterprise SSO connections, role-based access control, and pro MFA included. Dev and prod tenants out of the box.

    Auth0's B2B Essentials plan gives you everything you need to ship secure multi-tenant apps. Unlimited orgs, enterprise SSO, RBAC, audit log streaming, and higher auth and API limits included. Add on M2M tokens, enterprise MFA, or additional SSO connections as you scale.
    Sign Up Free
  • Earn up to 16% annual interest with Nexo. Icon
    Earn up to 16% annual interest with Nexo.

    More flexibility. More control.

    Generate interest, access liquidity without selling, and execute trades seamlessly. All in one platform. Geographic restrictions, eligibility, and terms apply.
    Get started with Nexo.
  • 1
    Skim

    Skim

    A PDF Reader and Note-taker for OS X

    Skim is a PDF reader and note-taker for OS X. It is designed to help you read and annotate scientific papers in PDF, but is also great for viewing any PDF file. Skim requires OS X 10.10 or higher.
    Leader badge
    Downloads: 4,552 This Week
    Last Update:
    See Project
  • 2
    BibDesk

    BibDesk

    Bibliography manager for Mac OS X

    BibDesk is a graphical bibTeX bibliography manager for Mac OS X.
    Leader badge
    Downloads: 3,099 This Week
    Last Update:
    See Project
  • 3
    Gwyddion

    Gwyddion

    Scanning probe microscopy data visualisation and analysis

    A data visualization and processing tool for scanning probe microscopy (SPM, i.e. AFM, STM, MFM, SNOM/NSOM, ...) and profilometry data, useful also for general image and 2D data analysis.
    Leader badge
    Downloads: 1,632 This Week
    Last Update:
    See Project
  • 4
    Pandoc

    Pandoc

    The universal markup converter

    Pandoc is a universal document converter able to convert files from a multitude of markup formats into another. With Pandoc, you have a swiss-army knife of a converter, able to convert practically any markup format into any other. Pandoc contains a Haskell library for conversions as well as a command-line tool that uses this library. It can convert to and from just about anything-- lightweight markup formats, HTML formats, documentation formats, ebooks, TeX formats, word processor formats and so much more. It understands several useful markdown syntax extensions, such as document metadata, footnotes, tables, and more. If you want strict markdown compatibility however, these extensions can be turned off. Pandoc is no doubt powerful and customizable, but it is important to note that its intermediate representation of a document is less expressive than many of the formats, so it may not produce perfect conversions every time.
    Downloads: 269 This Week
    Last Update:
    See Project
  • Go From AI Idea to AI App Fast Icon
    Go From AI Idea to AI App Fast

    One platform to build, fine-tune, and deploy ML models. No MLOps team required.

    Access Gemini 3 and 200+ models. Build chatbots, agents, or custom models with built-in monitoring and scaling.
    Try Free
  • 5
    QPDF

    QPDF

    PDF transformation/manipulation program + library

    QPDF is a C++ library and set of programs that inspect and manipulate the structure of PDF files. It can encrypt and linearize files, expose the internals of a PDF file, and do many other operations useful to end users and PDF developers.
    Leader badge
    Downloads: 1,043 This Week
    Last Update:
    See Project
  • 6

    Zefania XML Bible Markup Language

    Zefania XML bible modules

    Zefania XML bible modules can be viewed with: Simple Bible Reader (see files) Zefania XML Language Documentation can be found here: https://bgfdb.de/zefaniaxml/bml/
    Leader badge
    Downloads: 820 This Week
    Last Update:
    See Project
  • 7
    NAPS2 - Not Another PDF Scanner

    NAPS2 - Not Another PDF Scanner

    Scan documents to PDF and other file types, as simply as possible.

    Visit NAPS2's home page at www.naps2.com. NAPS2 is a document scanning application with a focus on simplicity and ease of use. Scan your documents from WIA- and TWAIN-compatible scanners, organize the pages as you like, and save them as PDF, TIFF, JPEG, PNG, and other file formats. Available on Windows, Mac, and Linux. NAPS2 is currently available in over 40 different languages. Want to see NAPS2 in your preferred language? Help translate! See the wiki for more details.
    Leader badge
    Downloads: 683 This Week
    Last Update:
    See Project
  • 8
    lxml

    lxml

    The lxml XML toolkit for Python

    A Python library for efficient XML and HTML processing, known for speed and compatibility. The lxml XML toolkit is a Pythonic binding for the C libraries libxml2 and libxslt. It is unique in that it combines the speed and XML feature completeness of these libraries with the simplicity of a native Python API, mostly compatible but superior to the well-known ElementTree API. The latest release works with all CPython versions from 3.6 to 3.12. See the introduction for more information about the background and goals of the lxml project.
    Downloads: 160 This Week
    Last Update:
    See Project
  • 9
    XML Copy Editor
    XML Copy Editor is a fast, free, validating XML editor.
    Leader badge
    Downloads: 889 This Week
    Last Update:
    See Project
  • $300 in Free Credit Towards Top Cloud Services Icon
    $300 in Free Credit Towards Top Cloud Services

    Build VMs, containers, AI, databases, storage—all in one place.

    Start your project in minutes. After credits run out, 20+ products include free monthly usage. Only pay when you're ready to scale.
    Get Started
  • 10
    Google Photos Takeout Helper

    Google Photos Takeout Helper

    Script that organizes the Google Takeout archive

    If you ever want to move from Google Photos to another platform/solution, your fastest choice to export all photos is Google Takeout. But when you download it, you will find yourself with zips with hundreds of little folders with weird .json files inside. What if you want to just have one folder with all photos, in chronological order? Good luck copying all of that.
    Downloads: 148 This Week
    Last Update:
    See Project
  • 11
    TinyTeX releases

    TinyTeX releases

    Windows/macOS/Linux binaries and installation methods of TinyTeX

    A lightweight, cross-platform, portable, and easy-to-maintain LaTeX distribution based on TeX Live. TinyTeX is a custom LaTeX distribution based on TeX Live that is small in size but functions well in most cases, especially for R users. If you run into the problem of missing LaTeX packages, it should be super clear to you what you need to do (in fact, R users won’t need to do anything). You only install LaTeX packages you actually need. Currently, TinyTeX works best for R users. Other users can use it, too, it is just that missing LaTeX packages won’t be automatically installed, and you need to install them manually. Or you can go to the extreme to install all packages (see FAQ 3 for how), but remember there are thousands of them. Installing or running TinyTeX does not require admin privileges, which means you no longer need sudo or your IT. You can even run TinyTeX from a Flash drive.
    Downloads: 129 This Week
    Last Update:
    See Project
  • 12

    Expat XML Parser

    Fast XML parser library in C

    PLEASE NOTE that we are in the process of moving to GitHub: https://github.com/libexpat/libexpat This is James Clark's Expat XML parser library in C. It is a stream oriented parser that requires setting handlers to deal with the structure that the parser discovers in the document. PLEASE NOTE that we are in the process of moving to GitHub: https://github.com/libexpat/libexpat
    Leader badge
    Downloads: 549 This Week
    Last Update:
    See Project
  • 13
    pdfcrack is a command line, password recovery tool for PDF-files.
    Leader badge
    Downloads: 597 This Week
    Last Update:
    See Project
  • 14
    OCRmyPDF

    OCRmyPDF

    OCRmyPDF adds an OCR text layer to scanned PDF files

    OCRmyPDF adds an optical character recognition (OCR) text layer to scanned PDF files, allowing them to be searched. PDF is the best format for storing and exchanging scanned documents. Unfortunately, PDFs can be difficult to modify. OCRmyPDF makes it easy to apply image processing and OCR (recognized, searchable text) to existing PDFs.
    Downloads: 116 This Week
    Last Update:
    See Project
  • 15
    Nativefier

    Nativefier

    Make any web page a desktop application

    Tired of having to search through numerous open tabs on your browser just to view a web page that you regularly view? Make things easier and more convenient for yourself with Nativefier! Nativefier is a command-line tool that easily creates a desktop app for any web site with minimal configuration. It creates an Electron (which uses Chromium under the hood) based native wrapper in an OS executable (.app, .exe, etc) for use on Windows, macOS and Linux. Using Nativefier is absurdly simple. Just type in: nativefier "web site address" to create a native desktop app for your chosen web site, and Nativefier does the rest. It will automatically retrieve the app icon and name, which you can overwrite.
    Downloads: 113 This Week
    Last Update:
    See Project
  • 16
    V2rayU

    V2rayU

    A tool to manage v2ray config json

    V2ray multi-user management script, wizard-style management [new|delete|modify] transmission protocol, enjoy the fun of V2ray. Quickly view server connection information, general configuration modification. Freely change the transmission configuration. Upgrade command (keep the configuration file, if the upgrade fails, please install it completely). Call v2ray official api for traffic statistics. Multi-user, multi-port management , mixed transmission protocol management is no longer a dream. A random port will be generated during the first installation, the default configuration is mkcp + one random header camouflage; the configuration information will be displayed after the installation is complete. After the script runs, you can eat it with confidence! The v2ray core is automatically upgraded and restarted at 3 am Beijing time every day to reduce the chance of v2ray being killed due to low memory.
    Downloads: 110 This Week
    Last Update:
    See Project
  • 17
    Dompdf

    Dompdf

    HTML to PDF converter for PHP

    dompdf is an HTML to PDF converter. At its heart, dompdf is (mostly) a CSS 2.1 compliant HTML layout and rendering engine written in PHP. It is a style-driven renderer, it will download and read external stylesheets, inline style tags, and the style attributes of individual HTML elements. It also supports most presentational HTML attributes. PDF rendering is currently provided either by PDFLib or by a bundled version the R&OS CPDF class written by Wayne Munro. (Some important changes have been made to the R&OS class, however). In order to use PDFLib with dompdf, the PDFLib PECL extension is required. Using PDFLib improves performance and reduces the memory requirements of dompdf somewhat, while the R&OS CPDF class, though slightly slower, eliminates any dependencies on external PDF libraries.
    Downloads: 102 This Week
    Last Update:
    See Project
  • 18
    gSOAP Toolkit

    gSOAP Toolkit

    Development toolkit for Web Services and XML data bindings for C & C++

    The gSOAP toolkit is an extensive suite of portable C and C++ software to develop XML Web services with powerful type-safe XML data bindings. Easy-to-use code-generator tools allow you to directly integrate XML data in C and C++. Serializes native application data in XML. Includes WSDL/XSD schema binding and auto-coding tools, stub/skeleton compiler, Web server integration with Apache module and IIS extension, high-performance XML processing with schema validation, fast MIME/MTOM streaming, SOAP and REST Web API development, WS-* protocols (WS-Security, WS-Policy, WS-ReliableMessaging, etc), XML-RPC and JSON. Licensed under GPLv2. Visit the developer center with guides, tutorials, and examples at www.genivia.com/dev.html The documentation of the toolkit, libraries, plugins and more is available at www.genivia.com/docs.html Commercial-use licenses and expert technical support services are available at competitive pricing levels, please visit www.genivia.com/products.html
    Leader badge
    Downloads: 451 This Week
    Last Update:
    See Project
  • 19
    TeXtidote

    TeXtidote

    Spelling, grammar and style checking on LaTeX documents

    If so, you probably know that the process is far from simple. Since LaTeX documents contain special commands and keywords (the so-called "markup") that are not part of the "real" text, you cannot run a grammar checker directly on these files: it cannot tell the difference between markup and text. The other option is to remove all this markup, leaving only the "clear" text; however, when a grammar tool points to a problem at a specific line in this clear text, it becomes hard to retrace that location in the original LaTeX file. TeXtidote solves this problem; it can read your original LaTeX file and perform various sanity checks on it: for example, making sure that every figure is referenced in the text, enforcing the correct capitalization of titles, etc. In addition, TeXtidote can remove markup from the file and send it to the Language Tool library, which performs verification of both spelling and grammar in a dozen languages.
    Downloads: 96 This Week
    Last Update:
    See Project
  • 20
    KOReader

    KOReader

    An ebook reader application supporting PDF, DjVu, EPUB, FB2, etc.

    KOReader is a document viewer for E Ink devices. Supported fileformats include EPUB, PDF, DjVu, XPS, CBT, CBZ, FB2, PDB, TXT, HTML, RTF, CHM, DOC, MOBI and ZIP files. It’s available for Kindle, Kobo, PocketBook, Android and desktop Linux. Runs on embedded devices (Cervantes, Kindle, Kobo, PocketBook, reMarkable), Android and Linux computers. Developers can run a KOReader emulator in Linux and MacOS. Multi-lingual user interface with a highly customizable reader view and many typesetting options. You can set arbitrary page margins, override line spacing and choose external fonts and styles. It has multi-lingual hyphenation dictionaries bundled into the application. Custom UI without animation, with paginated menus, adjustable text contrast, and easy zoom to fit content or page in paged media. Look up words with StarDict dictionaries / Wikipedia, add your own online OPDS catalogs and RSS feeds, share ebooks with other KOReader devices wirelessly.
    Downloads: 95 This Week
    Last Update:
    See Project
  • 21
    wkhtmltopdf

    wkhtmltopdf

    Convert HTML to PDF using Webkit (QtWebKit)

    wkhtmltopdf and wkhtmltoimage are open source (LGPLv3) command line tools to render HTML into PDF and various image formats using the Qt WebKit rendering engine. These run entirely "headless" and do not require a display or display service. There is also a C library, if you're into that kind of thing. The file pdf.h contains a fairly high level and stable pure c binding to wkhtmltopdf. These binding are well documented and do not depend on QT. Using this is the recommended way of interfacing with the PDF portion of libwkhtmltox. The rest of the headers directly exposes the C++ QT dependent class used internally by wkhtmltopdf and wkhtmltoimage. wkhtmltopdf is able to put several objects into the output file, an object is either a single webpage, a cover webpage or a table of contents. The objects are put into the output document in the order they are specified on the command line, options can be specified on a per object basis or in the global options area.
    Downloads: 93 This Week
    Last Update:
    See Project
  • 22
    TeXworks

    TeXworks

    A simple interface for working with TeX documents

    TeXworks is a free and simple working environment for authoring TeX (LaTeX, ConTeXt and XeTeX) documents. Inspired by Dick Koch's award-winning TeXShop program for Mac OS X, it makes entry into the TeX world easier for those using desktop operating systems other than OS X. It provides an integrated, easy-to-use environment for users on other platforms particularly GNU/Linux and Windows and features a clean, simple interface accessible to casual and non-technical users.
    Downloads: 89 This Week
    Last Update:
    See Project
  • 23
    mPDF

    mPDF

    PHP library generating PDF files from UTF-8 encoded HTML

    mPDF is a PHP library that generates PDF files from UTF-8 encoded HTML. The original author, Ian Back, wrote mPDF to output PDF files ‘on-the-fly’ from his website, handling different languages. It is slower than the original scripts e.g. HTML2FPDF and produces larger files when using Unicode fonts, but support for CSS styles etc. and has been much enhanced. Supports almost all languages including RTL (Arabic and Hebrew), and CJK (Chinese-Japanese-Korean). Nested block-level elements (e.g. P, DIV) including margins, borders, padding, line height, background colors, etc. Right-to-left languages are supported, with automatic detection of RTL characters within a document. Transposes tables, lists, text justification, and table cell alignment, as well as a full-text reversal for RTL characters. Automatically detects non-RTL characters and displays these in the original order.
    Downloads: 89 This Week
    Last Update:
    See Project
  • 24
    BentoPDF

    BentoPDF

    A Privacy First PDF Toolkit

    BentoPDF is a self-hosted, open-source PDF toolkit that provides a suite of local PDF manipulation features for users who want full control over their documents without relying on cloud PDF services. It offers functionality to merge, split, compress, rotate, and convert PDFs through an easy-to-deploy container or local installation, making it ideal for individuals and teams that handle large volumes of PDF files regularly. Because it runs entirely on your own system or server, BentoPDF helps ensure that sensitive documents never leave your environment — a critical consideration for privacy-conscious users or organizations with strict data policies. Its design aims to balance robust feature coverage with a simple setup process, typically involving a Docker image so you can start using the toolkit with minimal configuration. BentoPDF also supports automation via command line parameters or API calls, so it can be integrated into batch workflows or document pipelines.
    Downloads: 82 This Week
    Last Update:
    See Project
  • 25
    TeXnicCenter

    TeXnicCenter

    A feature-rich environment for writing LaTeX documents on Windows

    TeXnicCenter is a LaTeX editor on Windows. Navigating LaTeX documents is simple due to the automatically created document outline. Errors of the LaTeX compilation can be reviewed instantly. TXC features autocompletion and comes with LaTeX templates.
    Leader badge
    Downloads: 360 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • 3
  • 4
  • 5
  • Next
MongoDB Logo MongoDB