Showing 2000 open source projects for "data"

View related business solutions
  • Build on Google Cloud with $300 in Free Credit Icon
    Build on Google Cloud with $300 in Free Credit

    New to Google Cloud? Get $300 in free credit to explore Compute Engine, BigQuery, Cloud Run, Vertex AI, and 150+ other products.

    Start your next project with $300 in free Google Cloud credit. Spin up VMs, run containers, query exabytes in BigQuery, or build AI apps with Vertex AI and Gemini. Once your credits are used, keep building with 20+ products with free monthly usage, including Compute Engine, Cloud Storage, GKE, and Cloud Run functions. Sign up to start building right away.
    Start Free Trial
  • Managed MySQL, PostgreSQL, and SQL Databases on Google Cloud Icon
    Managed MySQL, PostgreSQL, and SQL Databases on Google Cloud

    Get back to your application and leave the database to us. Cloud SQL automatically handles backups, replication, and scaling.

    Cloud SQL is a fully managed relational database for MySQL, PostgreSQL, and SQL Server. We handle patching, backups, replication, encryption, and failover—so you can focus on your app. Migrate from on-prem or other clouds with free Database Migration Service. IDC found customers achieved 246% ROI. New customers get $300 in credits plus a 30-day free trial.
    Try Cloud SQL Free
  • 1
    data-diff

    data-diff

    Efficiently diff rows across two different databases

    We're excited to announce the launch of a new open-source product, data-diff that makes comparing datasets across databases fast at any scale. data-diff automates data quality checks for data replication and migration. In modern data platforms, data is constantly moving between systems, and at the modern data volume and complexity, systems go out of sync all the time. Until now, there has not been any tooling to ensure that when the data is correctly copied. ...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 2
    atk4/data

    atk4/data

    Data Access PHP Framework for SQL & high-latency databases

    ATK Data is a data persistence and modeling framework for PHP, developed as part of the Agile Toolkit. It provides a high-level abstraction for working with databases, making it easier to define and manipulate data models with minimal boilerplate code. It supports various SQL and NoSQL databases and integrates seamlessly with Agile UI and other PHP frameworks.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 3
    Azure Data Studio

    Azure Data Studio

    A data management tool that enables working with other SQL tools

    Azure Data Studio is a cross-platform database tool for data professionals who use on-premises and cloud data platforms on Windows, macOS, and Linux. Azure Data Studio offers a modern editor experience with IntelliSense, code snippets, source control integration, and an integrated terminal. It's engineered with the data platform user in mind, with the built-in charting of query result sets and customizable dashboards.
    Downloads: 16 This Week
    Last Update:
    See Project
  • 4
    Spring Data Neo4j

    Spring Data Neo4j

    Provide support to increase developer productivity in Java

    ...The template programming model is equivalent to other Spring templates and builds the basis for interaction with the graph and is also used for the Spring Data repository support. Spring Data Neo4j is a core part of the Spring Data project which aims to provide convenient data access for NoSQL databases. Spring Data builds on Spring Framework, check the spring.io web-site for a wealth of reference documentation. If you are just starting out with Spring, try one of the guides.
    Downloads: 1 This Week
    Last Update:
    See Project
  • Cut Cloud Costs with Google Compute Engine Icon
    Cut Cloud Costs with Google Compute Engine

    Save up to 91% with Spot VMs and get automatic sustained-use discounts. One free VM per month, plus $300 in credits.

    Save on compute costs with Compute Engine. Reduce your batch jobs and workload bill 60-91% with Spot VMs. Compute Engine's committed use offers customers up to 70% savings through sustained use discounts. Plus, you get one free e2-micro VM monthly and $300 credit to start.
    Try Compute Engine
  • 5
    DBeaver

    DBeaver

    Free universal database tool

    ...It is useful for developers, SQL programmers, database administrators and analysts. DBeaver comes with plenty of great features such as metadata and SQL editors, ERD, data export/import/migration and more. Plugins are available for certain databases, and there are also several database management utilities. DBeaver’s Enterprise Edition provides even more features and supports non-JDBC datasources.
    Downloads: 171 This Week
    Last Update:
    See Project
  • 6
    HDF5

    HDF5

    Official HDF5® Library Repository

    HDF5 (Hierarchical Data Format v5) is a widely-used data management library and file format for storing large and complex scientific data sets efficiently.
    Downloads: 24 This Week
    Last Update:
    See Project
  • 7
    Greenplum Database

    Greenplum Database

    Massive parallel data platform for analytics, machine learning and AI

    Rapidly create and deploy models for complex applications in cybersecurity, predictive maintenance, risk management, fraud detection, and many other areas. With its unique cost-based query optimizer designed for large-scale data workloads, Greenplum scales interactive and batch-mode analytics to large datasets in the petabytes without degrading query performance and throughput. Based on PostgreSQL, Greenplum provides you with more control over the software you deploy, reducing vendor lock-in, and allowing open influence on product direction. Greenplum reduces data silos by providing you with a single, scale-out environment for converging analytic and operational workloads, like streaming ingestion. ...
    Downloads: 21 This Week
    Last Update:
    See Project
  • 8
    Logstash

    Logstash

    Centralize, transform and stash your data

    Logstash is a server-side data processing pipeline that dynamically ingests data from numerous sources, transforms it, and ships it to your favorite “stash” regardless of format or complexity. It supports and ingests data of all shapes, sizes and sources, dynamically transforms and prepares this data, and transports it to the output of your choice. Logstash is extensible, with over 200 plugins available to let you create and configure your pipeline how you choose.
    Downloads: 19 This Week
    Last Update:
    See Project
  • 9
    HeidiSQL

    HeidiSQL

    A lightweight client for managing MariaDB, MySQL, SQL Server, etc.

    HeidiSQL is a lightweight, open-source client for managing relational databases. It provides a user-friendly interface for browsing and editing data, creating and modifying tables, and executing SQL queries. HeidiSQL supports various database systems, making it a versatile tool for developers and database administrators.
    Downloads: 60 This Week
    Last Update:
    See Project
  • Easily Host LLMs and Web Apps on Cloud Run Icon
    Easily Host LLMs and Web Apps on Cloud Run

    Run everything from popular models with on-demand NVIDIA L4 GPUs to web apps without infrastructure management.

    Run frontend and backend services, batch jobs, host LLMs, and queue processing workloads without the need to manage infrastructure. Cloud Run gives you on-demand GPU access for hosting LLMs and running real-time AI—with 5-second cold starts and automatic scale-to-zero so you only pay for actual usage. New customers get $300 in free credit to start.
    Try Cloud Run Free
  • 10
    Nextcloud Server

    Nextcloud Server

    A safe home for all your data

    Nextcloud server is a free and open source server software that allows you to store all of your data in a server of your choosing. With Nextcloud you can easily access and store data in the data center you trust, sync data among various devices, and share your data for collaboration purposes. It offers the best security in the self hosted file sync and share world, and is expandable with hundreds of apps.
    Downloads: 21 This Week
    Last Update:
    See Project
  • 11
    etcd

    etcd

    Distributed reliable key-value store for the most critical data

    ...Mainly used as a separate coordination service, in distributed systems. And designed to hold small amounts of data that can fit entirely in memory. Read and write values using standard HTTP tools, such as curl. Store data in hierarchically organized directories, as in a standard filesystem. Watch specific keys or directories for changes and react to changes in values.
    Downloads: 30 This Week
    Last Update:
    See Project
  • 12
    DbGate

    DbGate

    Database manager for MySQL, PostgreSQL, SQL Server, MongoDB

    ...Runs as an application for Windows, Linux and Mac. Or in Docker container on the server and in a web browser on the client. Import, export from/to CSV, Excel, JSON, XML. Free table editor - quick table data editing (cleanup data after import/before export, prototype tables, etc.) Archives - back up your data in JSON files on local filesystem (or on DbGate server, when using web application) Charts, export chart to HTML page.
    Downloads: 64 This Week
    Last Update:
    See Project
  • 13
    Redash

    Redash

    Connect to any data source, easily visualize and share your data

    Redash is an essential tool to help you make sense of your data. It allows everyone, regardless of level of technical know-how to harness the power of data. SQL users connect, query, visualize and share data easily and efficiently, allowing everyone in their organization to use the data. Redash combines the power and comfort of an SQL client with the collaborative benefits of a cloud-based service.
    Downloads: 15 This Week
    Last Update:
    See Project
  • 14
    StarRocks

    StarRocks

    StarRocks is a next-gen sub-second MPP database for full analytics

    ...From streaming data to change data capture, StarRocks meets the data ingestion demands of real-time analytics. Scale storage and computing power horizontally and support tens of thousands of concurrent users. All of your BI tools work with StarRocks through standard SQL. StarRocks provides superior performance. It is also a unified OLAP covering most data analytics scenarios.
    Downloads: 7 This Week
    Last Update:
    See Project
  • 15
    Countries States Cities Database

    Countries States Cities Database

    Discover our global repository of countries, states, and cities

    Full Database of city state country available in JSON, SQL, XML, YAML & CSV format. All Countries, States and cities are Covered and populated with Different Combinations and versions. The API database is regularly updated to ensure the latest data is available to users. Discover our global repository of countries, states, and cities! Get comprehensive data in JSON, SQL, XML, YAML, and CSV formats. Access ISO2, ISO3 codes, country code, capital, native language, time zones (for countries), and more.
    Downloads: 21 This Week
    Last Update:
    See Project
  • 16
    Npgsql

    Npgsql

    Npgsql is the .NET data provider for PostgreSQL

    ...Npgsql is the open source .NET data provider for PostgreSQL. It allows you to connect and interact with PostgreSQL server using .NET.
    Downloads: 60 This Week
    Last Update:
    See Project
  • 17
    Apache HBase

    Apache HBase

    Get random, realtime read/write access to your Big Data

    Use Apache HBase™ when you need random, realtime read/write access to your Big Data. This project's goal is the hosting of very large tables, billions of rows X millions of columns, atop clusters of commodity hardware. Apache HBase is an open-source, distributed, versioned, non-relational database modeled after Google's Bigtable. A Distributed Storage System for Structured Data by Chang et al. Just as Bigtable leverages the distributed data storage provided by the Google File System, Apache HBase provides Bigtable-like capabilities on top of Hadoop and HDFS. ...
    Downloads: 8 This Week
    Last Update:
    See Project
  • 18
    Jailer Database Tool

    Jailer Database Tool

    Database subsetting and relational data browsing tool

    Jailer is a tool for database subsetting, schema and data browsing. It creates small slices from your database and lets you navigate through your database following the relationships. Ideal for creating small samples of test data or for local problem analysis with relevant production data. Creates small slices from your productive database and imports the data into your development and test environment (consistent and referentially intact).
    Downloads: 8 This Week
    Last Update:
    See Project
  • 19
    TimescaleDB

    TimescaleDB

    An open-source time-series SQL database optimized for fast ingest

    TimescaleDB is the open-source relational database for time-series and analytics. Build powerful data-intensive applications. Become instantly productive with full SQL. Rely on the same PostgreSQL you know, love, and trust. Hyperfunctions make time series easier. Achieve 10-100x faster queries than with vanilla PostgreSQL, InfluxDB, MongoDB. Write millions of data points per second per node. Horizontally scale to petabytes. Don’t worry about cardinality.
    Downloads: 64 This Week
    Last Update:
    See Project
  • 20
    DuckDB

    DuckDB

    DuckDB is an in-process SQL OLAP Database Management System

    ...For more information on the goals of DuckDB, please refer to the Why DuckDB page on our website. Processing and storing tabular datasets, e.g. from CSV or Parquet files. Interactive data analysis, e.g. Joining & aggregate multiple large tables. Concurrent large changes, to multiple large tables, e.g. appending rows, adding/removing/updating columns. Large result set transfer to client. For development, DuckDB requires CMake, Python3 and a C++11 compliant compiler. Run make in the root directory to compile the sources. ...
    Downloads: 16 This Week
    Last Update:
    See Project
  • 21
    OpenRefine

    OpenRefine

    A free, open source, powerful tool for working with messy data

    OpenRefine is a powerful Java-based tool designed to work with messy data and improve it. With OpenRefine you can load data, understand it, clean it up, transform it, reconcile it, and augment it with web services and external data. It allows you to do this all from a web browser and in the convenience and privacy of your own computer. OpenRefine keeps all data securely in your computer by running a small server on it, using your web browser to interact with it. ...
    Downloads: 4 This Week
    Last Update:
    See Project
  • 22
    Datacap

    Datacap

    DataCap is integrated software for data transformation

    Datacap is an open-source data catalog and governance tool that helps organizations manage and document their data assets. It provides metadata management, lineage tracking, and collaboration features to ensure data transparency and quality. Datacap is designed for teams that need a lightweight, self-hosted solution to organize and govern their data ecosystems.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 23
    pgModeler

    pgModeler

    Open-source data modeling tool designed for PostgreSQL

    pgModeler is an open-source data modeling tool for PostgreSQL, enabling visual creation and management of database schemas. It supports reverse engineering from existing databases, model validation, and SQL export, providing a full-featured GUI for database design. pgModeler is suited for developers, DBAs, and analysts who want to streamline schema creation and documentation.
    Downloads: 54 This Week
    Last Update:
    See Project
  • 24
    pg_flo

    pg_flo

    Stream, transform, and route PostgreSQL data in real-time

    The easiest way to move and transform data between PostgreSQL databases.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 25
    Google Node.js Datastore

    Google Node.js Datastore

    Node.js client for Google Cloud Datastore

    Google’s Node.js Datastore client is a library for interacting with Google Cloud Datastore, a fully managed NoSQL database. It enables developers to store and query structured data in a scalable and serverless manner. The library provides an easy-to-use API for integrating Datastore into Node.js applications.
    Downloads: 6 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • 3
  • 4
  • 5
  • Next
MongoDB Logo MongoDB
Gen AI apps are built with MongoDB Atlas
Atlas offers built-in vector search and global availability across 125+ regions. Start building AI apps faster, all in one place.
Try Free →