Data Formats Software

View related business solutions
  • Our Free Plans just got better! | Auth0 Icon
    Our Free Plans just got better! | Auth0

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
    Try free now
  • Go From AI Idea to AI App Fast Icon
    Go From AI Idea to AI App Fast

    One platform to build, fine-tune, and deploy ML models. No MLOps team required.

    Access Gemini 3 and 200+ models. Build chatbots, agents, or custom models with built-in monitoring and scaling.
    Try Free
  • 1
    Free editor for PDF documents. Complete editing of PDF documents is possible with PDFedit. You can change raw pdf objects (for advanced users) or use many gui functions. Functionality can be easily extended using a scripting language (ECMAScript).
    Leader badge
    Downloads: 171 This Week
    Last Update:
    See Project
  • 2
    Microsoft Works format import library
    libwps is a Microsoft Works file format import filter based on top of the librevenge (see https://sourceforge.net/p/libwpd/wiki/librevenge/ ). Currently, libwps can import all word processing Works formats since about 1995 with some success. It may also be able to import some basic database and spreadsheet files.
    Leader badge
    Downloads: 298 This Week
    Last Update:
    See Project
  • 3
    Cube

    Cube

    Universal semantic layer platform for AI, BI, spreadsheets

    Cube is the semantic layer for building data applications. It helps data engineers and application developers access data from modern data stores, organize it into consistent definitions, and deliver it to every application. Cube was designed to work with all SQL-enabled data sources, including cloud data warehouses like Snowflake or Google BigQuery, query engines like Presto or Amazon Athena, and application databases like Postgres. Cube has a built-in relational caching engine to provide sub-second latency and high concurrency for API requests.
    Downloads: 20 This Week
    Last Update:
    See Project
  • 4
    Nano PDF Editor

    Nano PDF Editor

    Edit PDF files with Nano Banana

    Nano PDF Editor is a minimalist, portable PDF viewer and toolkit that focuses on simplicity, speed, and ease of integration for applications that need basic PDF rendering without heavy dependencies. It provides core functionality such as page navigation, zooming, text selection, and rendering directly to native graphics surfaces, making it suitable for lightweight PDF viewing scenarios on desktop or embedded platforms. Designed to be easily embedded into larger software projects, Nano-PDF has a small code footprint and straightforward APIs that developers can call from common languages, helping it fit into text editors, document explorers, or custom user interfaces with minimal effort. The viewer strives to comply with standard PDF features, like annotations and linked bookmarks, while avoiding the complexity of full-featured document suites that prioritize editing or creation.
    Downloads: 20 This Week
    Last Update:
    See Project
  • Enterprise-grade ITSM, for every business Icon
    Enterprise-grade ITSM, for every business

    Give your IT, operations, and business teams the ability to deliver exceptional services—without the complexity.

    Freshservice is an intuitive, AI-powered platform that helps IT, operations, and business teams deliver exceptional service without the usual complexity. Automate repetitive tasks, resolve issues faster, and provide seamless support across the organization. From managing incidents and assets to driving smarter decisions, Freshservice makes it easy to stay efficient and scale with confidence.
    Try it Free
  • 5
    Unredact

    Unredact

    A simple tool for reading in poorly redacted documents

    Unredact is a specialized tool that attempts to reconstruct redacted or obscured text in images, PDFs, or screenshots using a combination of image processing and generative AI inference to suggest plausible completions of blurred, black-boxed, or jumbled content. Unlike traditional optical character recognition (OCR), which only reads visible text, Unredact focuses on inferring missing content where redaction has been applied by analyzing surrounding context, font characteristics, and linguistic patterns to produce candidate reconstructions. It accepts a variety of input formats, automatically identifies redacted regions, and then generates text suggestions that are presented alongside visual overlays so users can choose or refine outputs.
    Downloads: 20 This Week
    Last Update:
    See Project
  • 6
    Grassroots DICOM

    Grassroots DICOM

    Cross-platform DICOM implementation

    Grassroots DiCoM is a C++ library for DICOM medical files. It is accessible from Python, C#, Java and PHP. It supports RAW, JPEG, JPEG 2000, JPEG-LS, RLE and deflated transfer syntax. It comes with a super fast scanner implementation to quickly scan hundreds of DICOM files. It supports SCU network operations (C-ECHO, C-FIND, C-STORE, C-MOVE). PS 3.3 & 3.6 are distributed as XML files. It also provides PS 3.15 certificates and password based mecanism to anonymize and de-identify DICOM datasets.
    Leader badge
    Downloads: 101 This Week
    Last Update:
    See Project
  • 7

    QTester104

    IEC 60870-5-104 protocol tester.

    This software implements the IEC60870-5-104 protocol (client side) for substation data acquisition and control via tcp/ip network using the QT UI Framework. It can be compiled on Linux and Windows platforms. It's possible to poll and view data from the remote system (RTU) and also send commands. >>> Have a look also at my new SCADA project here: https://github.com/riclolsen/json-scada
    Leader badge
    Downloads: 108 This Week
    Last Update:
    See Project
  • 8

    Free VCF file to CSV or Excel converter

    Excel VBA Script to convert VCF file with multiple entries to CSV file

    This is an Excel based VBA script used to import bulk .VCF files that contain more than 1 Vcard and then convert them to a comma separated .CSV file or Excel xls file that can then be imported into Outlook, Google, or any other application that supports import of contacts using CSV files. This has been written to support VCF 2.0, 2.1, 3.0 and 4.0 formatted files including those with printable encoding (MIME) and has been tested with bulk VCF files from Backupify, Google Contacts (Gmail Contacts), Apple IOS, Samsung Android devices and other platforms. If this doesn't work for you, please email me the VCF file and I'll modify the code to support your specific VCF format. To use, simply download and open the Excel file (xlsm), Enable Macros if prompted, and then select the VCF file you wish to import. Once complete, Excel will create and save new file titled VCFExport.csv that can also be saved in Excel format.
    Downloads: 75 This Week
    Last Update:
    See Project
  • 9

    android-backup-processor

    Android adb backup extractor and packer

    Leader badge
    Downloads: 110 This Week
    Last Update:
    See Project
  • Forever Free Full-Stack Observability | Grafana Cloud Icon
    Forever Free Full-Stack Observability | Grafana Cloud

    Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

    Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.
    Create free account
  • 10
    Kabeja is a java library for parsing DXF and converting to SVG (dxf2svg). The library supports the SAX-api and can integrated into other applications (Cocoon,Batik). Tools for converting svg to jpeg, tiff, png and pdf are included .
    Leader badge
    Downloads: 136 This Week
    Last Update:
    See Project
  • 11
    TikZ

    TikZ

    TikZ figures for concepts in physics/chemistry/ML

    Collection of 111 standalone TikZ figures for illustrating concepts in physics, chemistry, and machine learning. Check out janosh.github.io to search, sort, open in Overleaf, and download figures (PDF/SVG/PNG) from this collection.
    Downloads: 17 This Week
    Last Update:
    See Project
  • 12
    Twemoji

    Twemoji

    Simple library that provides standard Unicode emoji for all platforms

    Twitter’s open source emoji has you covered for all your project's emoji needs. With support for the latest Unicode emoji specification, featuring 3,245 emojis, and all for free. As an open source project, attribution is critical from a legal, practical and motivational perspective in our opinion. The graphics are licensed under the CC-BY 4.0 which has a pretty good guide on best practices for attribution. Although there are two kinds of parsing supported by this utility, we recommend you use DOM parsing, explained below. Each type of parsing accepts a callback to generate an image source or an options object with parsing info. The second kind of parsing is string parsing. This is unrecommended because this method does not sanitize the string or otherwise prevent malicious code from being executed; such sanitization is out of scope.
    Downloads: 17 This Week
    Last Update:
    See Project
  • 13
    pstoedit

    pstoedit

    converts PostScript or PDF files to other vector graphics formats

    pstoedit is a tool converting PostScript and PDF files into various other formats suported by different drawing editors. As a prerequesite it needs GhostScript to be installed (binary installation is sufficient).
    Downloads: 102 This Week
    Last Update:
    See Project
  • 14
    Beamer

    Beamer

    A LaTeX class for producing presentations and slides

    The beamer class is focused on producing (on-screen) presentations, along with support material such as handouts and speaker notes. Content is created in a frame environment, each of which can be divided up into a number of slides (actual output pages). A simple 'action' notation is used to specify which material appears on each slide within a frame. The class supports a range of environments to enable complex slide relationships to be constructed. The appearance of slides is controlled by a template system. Many of the elements of slides can be adjusted by setting simple key-based values in the preamble. More complex changes can be implemented by altering specific, targeted definitions without needing to rewrite entire blocks of code. This allows a variety of visual appearances to be selected for the same content source.
    Downloads: 16 This Week
    Last Update:
    See Project
  • 15
    Memvid

    Memvid

    Video-based AI memory library. Store millions of text chunks in MP4

    Memvid encodes text chunks as QR codes within MP4 frames to build a portable “video memory” for AI systems. This innovative approach uses standard video containers and offers millisecond-level semantic search across large corpora with dramatically less storage than vector DBs. It's self-contained—no DB needed—and supports features like PDF indexing, chat integration, and cloud dashboards.
    Downloads: 16 This Week
    Last Update:
    See Project
  • 16
    Pix2Text

    Pix2Text

    Open-Source Python3 tool for recognizing layouts, tables, and math

    An Open-Source Python3 tool for recognizing layouts, tables, math formulas, and text in images, converting them into Markdown format. A free alternative to Mathpix, empowering seamless conversion of visual content into text-based representations. 80+ languages are supported. Pix2Text (P2T) aims to be a free and open-source Python alternative to Mathpix, and it can already accomplish Mathpix's core functionality. Pix2Text (P2T) can recognize layouts, tables, images, text, and mathematical formulas, and integrate all of these contents into Markdown format. P2T can also convert an entire PDF file (which can contain scanned images or any other format) into Markdown format.
    Downloads: 16 This Week
    Last Update:
    See Project
  • 17
    Saw

    Saw

    Fast, multi-purpose tool for AWS CloudWatch Logs

    saw is a multi-purpose tool for AWS CloudWatch Logs.
    Downloads: 16 This Week
    Last Update:
    See Project
  • 18
    Countries States Cities Database

    Countries States Cities Database

    Discover our global repository of countries, states, and cities

    Full Database of city state country available in JSON, SQL, XML, YAML & CSV format. All Countries, States and cities are Covered and populated with Different Combinations and versions. The API database is regularly updated to ensure the latest data is available to users. Discover our global repository of countries, states, and cities! Get comprehensive data in JSON, SQL, XML, YAML, and CSV formats. Access ISO2, ISO3 codes, country code, capital, native language, time zones (for countries), and more.
    Downloads: 15 This Week
    Last Update:
    See Project
  • 19
    TeXShop

    TeXShop

    TeX previewer for Mac OS X

    TeXShop is a TeX previewer for Mac OS X, written in Cocoa. Since pdf is a native file format on OS X, TeXShop uses "pdftex" and "pdflatex" rather than "tex" and "latex" to typeset in its default configuration; these programs in the standard TeX Live distribution of TeX produce pdf output instead of dvi output. TeXShop uses TeX Live, a standard distribution of Tex programs maintained by the TeX Users Group (TUG) for Mac OS X, Windows, Linux, and various other Unix machines. The distribution includes tex, latex, dvips, tex fonts, cyrillic fonts, and virtually all other programs and supporting files commonly used in the TeX world. The most recent version of this distribution is maintained for the Mac by the MacTeX TeXnical Working Group of the TeX Users Group and available under the "Obtaining" tab.
    Downloads: 15 This Week
    Last Update:
    See Project
  • 20
    Tidy

    Tidy

    The granddaddy of HTML tools, with support for modern standards

    Tidy is a console application for macOS, Linux, Windows, UNIX, and more. It corrects and cleans up HTML and XML documents by fixing markup errors and upgrading legacy code to modern standards. libtidy is a C static and dynamic library that developers can integrate into their applications in order to bring all of Tidy’s power to your favorite tools. libtidy is used today in desktop applications, web servers, and more. Tidy tidies HTML and XML. It can tidy your documents by itself, and developers can easily integrate its features into even more powerful tools. Thanks for the efforts of HTACG and prominent contributors, HTML Tidy has a whole new heartbeat and a whole new life.
    Downloads: 15 This Week
    Last Update:
    See Project
  • 21
    World countries JSON

    World countries JSON

    Constantly updated lists of world countries

    Constantly updated lists of world countries and their associated alpha-2, alpha-3 and numeric country codes as defined by the ISO 3166 standard, available in CSV, JSON , PHP, SQL and XML formats, in multiple languages and with national flags included; also available are the ISO 3166-2 codes of provinces/ states associated with the countries. Also available are the ISO 3166-2 codes used for identifying the principal subdivisions (e.g., provinces or states) of all countries coded in ISO 3166-1.
    Downloads: 15 This Week
    Last Update:
    See Project
  • 22
    FreeTAKServer

    FreeTAKServer

    Situational Awareness Server compatible with TAK clients

    FTS is a Python3 implementation of a TAK Server for devices like ATAK, WinTAK, and ITAK, it is cross-platform and runs from a multi-node installation on AWS down to the Android edition. It's free and open source (released under the Eclipse Public License. FTS allows you to connect ATAK clients to share geo-information, to chat with all the connected clients, exchange files and more. It intends to support all the major use cases of the original TAK server.
    Downloads: 14 This Week
    Last Update:
    See Project
  • 23
    Miller

    Miller

    Miller is like awk, sed, cut, join, and sort for name-indexed data

    Miller is like awk, sed, cut, join, and sort for data formats such as CSV, TSV, JSON, JSON Lines, and positionally-indexed. With Miller, you get to use named fields without needing to count positional indices, using familiar formats such as CSV, TSV, JSON, JSON Lines, and positionally-indexed. Then, on the fly, you can add new fields which are functions of existing fields, drop fields, sort, aggregate statistically, pretty-print, and more. Miller operates on key-value-pair data while the familiar Unix tools operate on integer-indexed fields: if the natural data structure for the latter is the array, then Miller's natural data structure is the insertion-ordered hash map. Miller handles a variety of data formats, including but not limited to the familiar CSV, TSV, and JSON/JSON Lines. (Miller can handle positionally-indexed data too!)
    Downloads: 14 This Week
    Last Update:
    See Project
  • 24
    SingleFile

    SingleFile

    Web Extension for saving a copy of complete web page in a single file

    Web Extension for Firefox/Chrome/MS Edge and CLI tool to save a faithful copy of an entire web page in a single HTML file. SingleFile is a Web Extension (and a CLI tool) compatible with Chrome, Firefox (Desktop and Mobile), Microsoft Edge, Vivaldi, Brave, Waterfox, Yandex Browser, and Opera. It helps you to save a complete web page into a single HTML file. Wait until the page is fully loaded. Click on the SingleFile button in the extension toolbar to save the page. You can click again on the button to cancel the action when processing a page. Open the context menu by right-clicking the SingleFile button in the extension toolbar or on the webpage. It allows you to save the current tab, the selected content, the selected frame. With auto-save active, pages are automatically saved every time after being loaded (or before being unloaded if not). Right-click on the SingleFile button and select "Manage extension" (Firefox) / "Options" (Chrome) to open the options page.
    Downloads: 14 This Week
    Last Update:
    See Project
  • 25
    dasel

    dasel

    Select, put and delete data from JSON, TOML, YAML, XML and CSV files

    Dasel (short for data-selector) allows you to query and modify data structures using selector strings. Say goodbye to learning new tools just to work with a different data format. Dasel uses a standard selector syntax no matter the data format. This means that once you learn how to use dasel you immediately have the ability to query/modify any of the supported data types without any additional tools or effort. Dasel can be imported and used just like any other go package. This can be very useful if you need to manipulate data from your own applications. From then on the rest of the docs and comments should be enough to get you going. Uses a standard query/selector syntax across all data formats. Zero runtime dependencies. Available on Linux, Mac and Windows. Available to import and use in your own projects. Run via Docker.
    Downloads: 14 This Week
    Last Update:
    See Project
MongoDB Logo MongoDB