Showing 4024 open source projects for "data"

View related business solutions
  • Cut Data Warehouse Costs up to 54% with BigQuery Icon
    Cut Data Warehouse Costs up to 54% with BigQuery

    Migrate from Snowflake, Databricks, or Redshift with free migration tools. Exabyte scale without the Exabyte price.

    BigQuery delivers up to 54% lower TCO than cloud alternatives. Migrate from legacy or competing warehouses using free BigQuery Migration Service with automated SQL translation. Get serverless scale with no infrastructure to manage, compressed storage, and flexible pricing—pay per query or commit for deeper discounts. New customers get $300 in free credit.
    Try BigQuery Free
  • AI-generated apps that pass security review Icon
    AI-generated apps that pass security review

    Stop waiting on engineering. Build production-ready internal tools with AI—on your company data, in your cloud.

    Retool lets you generate dashboards, admin panels, and workflows directly on your data. Type something like “Build me a revenue dashboard on my Stripe data” and get a working app with security, permissions, and compliance built in from day one. Whether on our cloud or self-hosted, create the internal software your team needs without compromising enterprise standards or control.
    Try Retool free
  • 1
    MDN data

    MDN data

    This repository contains general data for Web technologies

    This repository contains general data for Web technologies and is maintained by the MDN team at Mozilla.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 2
    Spring Data Redis

    Spring Data Redis

    Provides support to increase developer productivity in Java

    Provides support to increase developer productivity in Java when using Redis, a key-value store. Uses familiar Spring concepts such as a template class for core API usage and lightweight repository-style data access. The primary goal of the Spring Data project is to make it easier to build Spring-powered applications that use new data access technologies such as non-relational databases, map-reduce frameworks, and cloud-based data services. Connection package as low-level abstraction across multiple Redis drivers (Lettuce and Jedis). Exception translation to Spring’s portable Data Access exception hierarchy for Redis driver exceptions. ...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 3
    browser-compat-data

    browser-compat-data

    This repository contains compatibility data for Web technologies

    The browser-compat-data ("BCD") project contains machine-readable browser (and JavaScript runtime) compatibility data for Web technologies, such as Web APIs, JavaScript features, CSS properties, and more. Our goal is to document accurate compatibility data for Web technologies, so web developers may write cross-browser compatible websites more easily. BCD is used in web apps and software such as MDN Web Docs, CanIUse, Visual Studio Code, WebStorm and more.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 4
    sq data wrangler

    sq data wrangler

    sq data wrangler

    sq is a command line tool that provides jq-style access to structured data sources: SQL databases, or document formats like CSV or Excel. sq executes jq-like queries, or database-native SQL. It can join across sources: join a CSV file to a Postgres table, or MySQL with Excel. sq outputs to a multitude of formats including JSON, Excel, CSV, HTML, Markdown and XML, and can insert query results directly to a SQL database. sq can also inspect sources to view metadata about the source structure (tables, columns, size). ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Run Any Workload on Compute Engine VMs Icon
    Run Any Workload on Compute Engine VMs

    From dev environments to AI training, choose preset or custom VMs with 1–96 vCPUs and industry-leading 99.95% uptime SLA.

    Compute Engine delivers high-performance virtual machines for web apps, databases, containers, and AI workloads. Choose from general-purpose, compute-optimized, or GPU/TPU-accelerated machine types—or build custom VMs to match your exact specs. With live migration and automatic failover, your workloads stay online. New customers get $300 in free credits.
    Try Compute Engine
  • 5
    License List Data

    License List Data

    Various data formats for the SPDX License List

    Various data formats for the SPDX License List including RDFa, HTML, Text, and JSON.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    jq

    jq

    Lightweight and flexible command-line JSON processor

    ...The output(s) of the filter are written to standard out, again as a sequence of whitespace-separated JSON data.
    Downloads: 49 This Week
    Last Update:
    See Project
  • 7
    QSV

    QSV

    Blazing-fast Data-Wrangling toolkit

    qsv is a fast, command-line CSV data toolkit written in Rust that extends the capabilities of xsv. It’s designed to make working with CSV files at scale easy and efficient, offering over 40 powerful subcommands for tasks like querying, sampling, splitting, deduplicating, and more. qsv is ideal for data engineers, analysts, and developers who need high-performance CSV manipulation on the command line.
    Downloads: 42 This Week
    Last Update:
    See Project
  • 8
    keycloak-config-cli

    keycloak-config-cli

    Import YAML/JSON-formatted configuration files into Keycloak

    keycloak-config-cli is a Keycloak utility to ensure the desired configuration state for a realm based on a JSON/YAML file. The format of the JSON/YAML file is based on the export realm format. Store and handle the configuration files inside git just like normal code. A Keycloak restart isn't required to apply the configuration. The config files are based on the keycloak export files. You can use them to re-import your settings. But keep your files as small as possible. Remove all UUIDs and...
    Downloads: 145 This Week
    Last Update:
    See Project
  • 9
    Pandoc

    Pandoc

    The universal markup converter

    Pandoc is a universal document converter able to convert files from a multitude of markup formats into another. With Pandoc, you have a swiss-army knife of a converter, able to convert practically any markup format into any other. Pandoc contains a Haskell library for conversions as well as a command-line tool that uses this library. It can convert to and from just about anything-- lightweight markup formats, HTML formats, documentation formats, ebooks, TeX formats, word processor formats...
    Downloads: 134 This Week
    Last Update:
    See Project
  • Build on Google Cloud with $300 in Free Credit Icon
    Build on Google Cloud with $300 in Free Credit

    New to Google Cloud? Get $300 in free credit to explore Compute Engine, BigQuery, Cloud Run, Vertex AI, and 150+ other products.

    Start your next project with $300 in free Google Cloud credit. Spin up VMs, run containers, query exabytes in BigQuery, or build AI apps with Vertex AI and Gemini. Once your credits are used, keep building with 20+ products with free monthly usage, including Compute Engine, Cloud Storage, GKE, and Cloud Run functions. Sign up to start building right away.
    Start Free Trial
  • 10
    OpenPDF

    OpenPDF

    open source Java library for creating and editing PDF files

    OpenPDF is a Java library for creating and editing PDF files with a LGPL and MPL open source license. OpenPDF is the LGPL/MPL open source successor of iText, and is based on a fork, of a fork, of iText 4 svn tag.
    Downloads: 57 This Week
    Last Update:
    See Project
  • 11
    PDF.js

    PDF.js

    A PDF Reader in JavaScript

    PDF.js is a web standards-based platform for parsing and rendering Portable Document Formats (PDFs). Open source and built with HTML5, this PDF viewer is supported by a great community and Mozilla Labs. PDF.js can be used on both modern and older browsers, and is built into version 19+ of Firefox.
    Downloads: 89 This Week
    Last Update:
    See Project
  • 12
    TeXworks

    TeXworks

    A simple interface for working with TeX documents

    TeXworks is a free and simple working environment for authoring TeX (LaTeX, ConTeXt and XeTeX) documents. Inspired by Dick Koch's award-winning TeXShop program for Mac OS X, it makes entry into the TeX world easier for those using desktop operating systems other than OS X. It provides an integrated, easy-to-use environment for users on other platforms particularly GNU/Linux and Windows and features a clean, simple interface accessible to casual and non-technical users.
    Downloads: 116 This Week
    Last Update:
    See Project
  • 13
    JSONViewer for Notepad++

    JSONViewer for Notepad++

    A JSON viewer plugin for Notepad++

    This plugin is meant to display a JSON string in a Treeview. It also marks the error position in case of parsing errors, and that's it. The plugin can be used in Notepad++.
    Downloads: 43 This Week
    Last Update:
    See Project
  • 14
    OCRmyPDF

    OCRmyPDF

    OCRmyPDF adds an OCR text layer to scanned PDF files

    OCRmyPDF adds an optical character recognition (OCR) text layer to scanned PDF files, allowing them to be searched. PDF is the best format for storing and exchanging scanned documents. Unfortunately, PDFs can be difficult to modify. OCRmyPDF makes it easy to apply image processing and OCR (recognized, searchable text) to existing PDFs.
    Downloads: 97 This Week
    Last Update:
    See Project
  • 15
    jackson-core

    jackson-core

    Core part of Jackson that defines Streaming API

    ...It is licensed under Apache License 2.0. Alternate data format implementations (like Smile (binary JSON), XML, CSV, Protobuf, and CBOR) also build on this base package, implementing the core interfaces, and making it possible to use standard data-binding packages regardless of underlying data format.
    Downloads: 13 This Week
    Last Update:
    See Project
  • 16
    anime-offline-database

    anime-offline-database

    A JSON based anime data set with important meta data

    The purpose of this repository is to create an offline database containing anime metadata aggregated by different anime metadata providers (such as myanimelist.net, anidb.net, kitsu.io, and more) and allow cross-references between those metadata providers. This file is supposed to be used by and created for manami. A JSON-based anime data set containing the most important meta data as well as cross-references to various anime sites such as MAL, ANIDB, ANILIST, KITSU, and more.
    Downloads: 11 This Week
    Last Update:
    See Project
  • 17
    Cube

    Cube

    Universal semantic layer platform for AI, BI, spreadsheets

    Cube is the semantic layer for building data applications. It helps data engineers and application developers access data from modern data stores, organize it into consistent definitions, and deliver it to every application. Cube was designed to work with all SQL-enabled data sources, including cloud data warehouses like Snowflake or Google BigQuery, query engines like Presto or Amazon Athena, and application databases like Postgres. ...
    Downloads: 5 This Week
    Last Update:
    See Project
  • 18
    Jansson

    Jansson

    C library for encoding, decoding and manipulating JSON data

    Jansson is a C library for encoding, decoding and manipulating JSON data.
    Downloads: 7 This Week
    Last Update:
    See Project
  • 19
    Better BibTeX for Zotero

    Better BibTeX for Zotero

    Make Zotero effective for us LaTeX holdouts

    Better BibTeX (BBT) is a plugin for Zotero and Juris-M that makes it easier to manage bibliographic data, especially for people authoring documents using text-based toolchains (e.g. based on LaTeX / Markdown). Zotero does all its work in UTF-8 Unicode, which is absolutely the right thing to do. Unfortunately, for those shackled to BibTeX and who cannot (yet) move to BibLaTeX, unicode is a major PITA. Also, Zotero supports some simple HTML markup in your items that Bib(La)TeX won’t understand.
    Downloads: 42 This Week
    Last Update:
    See Project
  • 20
    Miller

    Miller

    Miller is like awk, sed, cut, join, and sort for name-indexed data

    ...Miller operates on key-value-pair data while the familiar Unix tools operate on integer-indexed fields: if the natural data structure for the latter is the array, then Miller's natural data structure is the insertion-ordered hash map. Miller handles a variety of data formats, including but not limited to the familiar CSV, TSV, and JSON/JSON Lines. (Miller can handle positionally-indexed data too!)
    Downloads: 6 This Week
    Last Update:
    See Project
  • 21
    Countries States Cities Database

    Countries States Cities Database

    Discover our global repository of countries, states, and cities

    Full Database of city state country available in JSON, SQL, XML, YAML & CSV format. All Countries, States and cities are Covered and populated with Different Combinations and versions. The API database is regularly updated to ensure the latest data is available to users. Discover our global repository of countries, states, and cities! Get comprehensive data in JSON, SQL, XML, YAML, and CSV formats. Access ISO2, ISO3 codes, country code, capital, native language, time zones (for countries), and more.
    Downloads: 17 This Week
    Last Update:
    See Project
  • 22
    BentoPDF

    BentoPDF

    A Privacy First PDF Toolkit

    ...Because it runs entirely on your own system or server, BentoPDF helps ensure that sensitive documents never leave your environment — a critical consideration for privacy-conscious users or organizations with strict data policies. Its design aims to balance robust feature coverage with a simple setup process, typically involving a Docker image so you can start using the toolkit with minimal configuration. BentoPDF also supports automation via command line parameters or API calls, so it can be integrated into batch workflows or document pipelines.
    Downloads: 44 This Week
    Last Update:
    See Project
  • 23
    PDF4QT

    PDF4QT

    Open source PDF editor

    PDF4QT is open source PDF editor based on Qt framework. It contains a C++ library, applications for viewing/editing PDF documents, and a command line tool. PDF4QT is an open-source PDF editor for Windows/Linux. It is a modern solution for viewing/editing/rendering PDF documents, for users and developers alike. For developers, there is a C++ library and a command line tool for use in scripts. For users, there are four applications offering many features. The project is hosted on Github and...
    Downloads: 81 This Week
    Last Update:
    See Project
  • 24
    R3nzSkin

    R3nzSkin

    Skin changer for League of Legends (LOL)

    R3nzSkin is an internal skin changer for League of Legends. Change the skin of your champion, your ward, other champions, towers, minions, and jungle monsters in the game. If your CPU supports the AVX / AVX2 / AVX-512 instruction set, you can enable it in project settings. This should result in more performant code, optimized for your CPU. Currently, SSE2 instructions are selected in project settings. Change skins anytime and unlimited times in a single game. Supports all popular languages...
    Downloads: 45 This Week
    Last Update:
    See Project
  • 25
    MiKTeX

    MiKTeX

    the MiKTeX source code

    MiKTeX is a modern TeX distribution for Windows, Linux, and macOS. MiKTeX's integrated package manager installs missing components from the Internet if required. This allows you to keep your TeX installation as minimal as possible (“Just enough TeX”). MiKTeX is open source. You are welcome to redistribute MiKTeX under certain conditions.
    Downloads: 27 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • 3
  • 4
  • 5
  • Next
MongoDB Logo MongoDB
Gen AI apps are built with MongoDB Atlas
Atlas offers built-in vector search and global availability across 125+ regions. Start building AI apps faster, all in one place.
Try Free →