Showing 352 open source projects for "data"

View related business solutions
  • Go from Data Warehouse to Data and AI platform with BigQuery Icon
    Go from Data Warehouse to Data and AI platform with BigQuery

    Build, train, and run ML models with simple SQL. Automate data prep, analysis, and predictions with built-in AI assistance from Gemini.

    BigQuery is more than a data warehouse—it's an autonomous data-to-AI platform. Use familiar SQL to train ML models, run time-series forecasts, and generate AI-powered insights with native Gemini integration. Built-in agents handle data engineering and data science workflows automatically. Get $300 in free credit, query 1 TB, and store 10 GB free monthly.
    Try BigQuery Free
  • AI-generated apps that pass security review Icon
    AI-generated apps that pass security review

    Stop waiting on engineering. Build production-ready internal tools with AI—on your company data, in your cloud.

    Retool lets you generate dashboards, admin panels, and workflows directly on your data. Type something like “Build me a revenue dashboard on my Stripe data” and get a working app with security, permissions, and compliance built in from day one. Whether on our cloud or self-hosted, create the internal software your team needs without compromising enterprise standards or control.
    Try Retool free
  • 1
    Azure Data Studio

    Azure Data Studio

    A data management tool that enables working with other SQL tools

    Azure Data Studio is a cross-platform database tool for data professionals who use on-premises and cloud data platforms on Windows, macOS, and Linux. Azure Data Studio offers a modern editor experience with IntelliSense, code snippets, source control integration, and an integrated terminal. It's engineered with the data platform user in mind, with the built-in charting of query result sets and customizable dashboards.
    Downloads: 16 This Week
    Last Update:
    See Project
  • 2
    Syncthing

    Syncthing

    Open Source Continuous File Synchronization

    Syncthing is an open source continuous file synchronization program, which essentially works by synchronizing files between two or more computers in real time. Syncthing is designed to first and foremost keep users’ data safe and protected against data loss and against attackers who would unlawfully try and access this data. It doesn’t store data anywhere else except on your computers and uses encryption to secure all your data. It is very easy to use, cross-platform and automatic. As such, it is very user-friendly and ideal for all users looking for a secure and easy-to-use file synchronization solution. ...
    Downloads: 52 This Week
    Last Update:
    See Project
  • 3
    lakeFS

    lakeFS

    lakeFS - Git-like capabilities for your object storage

    Increase data quality and reduce the painful cost of errors. Data engineering best practices using git-like operations on data. lakeFS is an open-source data version control for data lakes. It enables zero-copy Dev / Test isolated environments, continuous quality validation, atomic rollback on bad data, reproducibility, and more. Data is dynamic, it changes over time.
    Downloads: 9 This Week
    Last Update:
    See Project
  • 4
    Data Crow

    Data Crow

    The ultimate cataloguer

    Data Crow allows you to use the standard movie & video (divx, xvid, DVD, Blu-ray, etc), book (and eBooks), images, board games, comic books, games & software, music (mp3 and other music files) cataloguing modules. Besides these modules, which you can change to fit your requirements, you can create new modules (want to catalogue your stamps, equipment, or anything else?).
    Leader badge
    Downloads: 323 This Week
    Last Update:
    See Project
  • Ship AI Apps Faster with Vertex AI Icon
    Ship AI Apps Faster with Vertex AI

    Go from idea to deployed AI app without managing infrastructure. Vertex AI offers one platform for the entire AI development lifecycle.

    Ship AI apps and features faster with Vertex AI—your end-to-end AI platform. Access Gemini 3 and 200+ foundation models, fine-tune for your needs, and deploy with enterprise-grade MLOps. Build chatbots, agents, or custom models. New customers get $300 in free credit.
    Try Vertex AI Free
  • 5
    Zipkin

    Zipkin

    Distributed tracing system to gather timing data

    Zipkin is a distributed tracing system. It helps gather timing data needed to troubleshoot latency problems in service architectures. Features include both the collection and lookup of this data. If you have a trace ID in a log file, you can jump directly to it. Otherwise, you can query based on attributes such as service, operation name, tags and duration. Some interesting data will be summarized for you, such as the percentage of time spent in a service, and whether or not operations failed. ...
    Downloads: 26 This Week
    Last Update:
    See Project
  • 6
    restic

    restic

    Fast, secure and efficient backup program

    ...It supports backends for storing backups natively, including on the local directory, sftp server (via SSH), HTTP REST server (protocol rest server), OpenStack Swift, among many others. It also uses cryptography to ensure the confidentiality and integrity of your data. Restic supports Windows, macOS and Linux operating systems, as well as FreeBSD and OpenBSD.
    Downloads: 12 This Week
    Last Update:
    See Project
  • 7
    Disk Image Manager

    Disk Image Manager

    Visual inspection tool for .dsk Spectrum/Amstrad disk images

    Disk Image Manager is an application for examining and manipulating disk images in the Standard and Extended DSK format used by many Spectrum, Amstrad PCW and CPC emulators. Many of the features and functions of this tool were driven by the Spectrum Disk Preservation team which used this format and tool to help manage the selection and testing of disk images archived to The World of Spectrum and The TZX Vault. These images are traditionally created with CPDRead under DOS but Simon Owen's...
    Downloads: 12 This Week
    Last Update:
    See Project
  • 8
    BFG Repo-Cleaner

    BFG Repo-Cleaner

    Remove large or troublesome blobs

    The BFG is a simpler, faster alternative to git-filter-branch for cleansing bad data out of your Git repository history. You can use it for removing crazy big files, and for removing passwords, credentials and other private data. The git-filter-branch command is enormously powerful and can do things that the BFG can't, but the BFG is much better for the tasks above, because is faster and simpler. The BFG isn't particularily clever, but is focused on making the above tasks easy. ...
    Downloads: 7 This Week
    Last Update:
    See Project
  • 9
    Brotli

    Brotli

    Brotli compression format

    ...After the overflow happens, memcpy is invoked with a gigantic num value, that will likely cause the crash. Brotli is a generic-purpose lossless compression algorithm that compresses data using a combination of a modern variant of the LZ77 algorithm, Huffman coding and 2nd order context modeling, with a compression ratio comparable to the best currently available general-purpose compression methods. It is similar in speed with deflate but offers more dense compression. The specification of the Brotli Compressed Data Format is defined in RFC 7932. ...
    Downloads: 35 This Week
    Last Update:
    See Project
  • Run Any Workload on Compute Engine VMs Icon
    Run Any Workload on Compute Engine VMs

    From dev environments to AI training, choose preset or custom VMs with 1–96 vCPUs and industry-leading 99.95% uptime SLA.

    Compute Engine delivers high-performance virtual machines for web apps, databases, containers, and AI workloads. Choose from general-purpose, compute-optimized, or GPU/TPU-accelerated machine types—or build custom VMs to match your exact specs. With live migration and automatic failover, your workloads stay online. New customers get $300 in free credits.
    Try Compute Engine
  • 10
    Cryptomator

    Cryptomator

    Multi-platform transparent client-side encryption for your cloud files

    Cryptomator is a free and open source project that offers multi-platform, transparent client side encryption of your files in the cloud. It works with just about any cloud storage service; is totally transparent so you can work with your files how you normally would; and secure with a 256-bit key length AES encryption. It’s also simple and easy to use with no need for different accounts, key management, cloud access grants or cipher configurations.
    Downloads: 27 This Week
    Last Update:
    See Project
  • 11
    LakeSoul

    LakeSoul

    An end-to-end, realtime and cloud native Lakehouse framework

    LakeSoul is a high-performance, unified table storage framework for big data lakes, supporting both streaming and batch data in a single format. Built on top of Apache Spark and leveraging Apache Arrow and Parquet, LakeSoul provides ACID transactions, schema evolution, and time travel. It is designed for large-scale data lake architectures that require consistency, efficiency, and easy integration with modern data stacks.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    Velero

    Velero

    Backup and migrate Kubernetes applications and their persistent volume

    ...Velero is an open source tool to safely backup and restore, perform disaster recovery, and migrate Kubernetes cluster resources and persistent volumes. Reduces time to recovery in case of infrastructure loss, data corruption, and/or service outages. Enables cluster portability by easily migrating Kubernetes resources from one cluster to another​. Offers key data protection features such as scheduled backups, retention schedules, and pre or post-backup hooks for custom actions. Backup your Kubernetes resources and volumes for an entire cluster, or part of a cluster by using namespaces or label selectors. ...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 13
    MemGuard

    MemGuard

    Secure software enclave for storage of sensitive information in memory

    This package attempts to reduce the likelihood of sensitive data being exposed when in memory. It aims to support all major operating systems and is written in pure Go. Sensitive data is encrypted and authenticated in memory with XSalsa20Poly1305. The scheme used also defends against cold-boot attacks. Memory allocation bypasses the language runtime by using system calls to query the kernel for resources directly.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    Space Radar Electron

    Space Radar Electron

    Disk And Memory Space Visualization App built with Electron & d3.js

    Space Radar Electron is an application that offers an interactive and comprehensive visualization of disk space and memory usage of your computer. Built with Electron & d3.js, it currently offers visualizations in the form of Sunburst, Treemap and Flamegraph charts. As it scans the contents of your disk, it produces a preview visualization so you can already see what's been scanned. It allows for drilldown of directories, breadcrumbs and navigation. Space Radar works fast, and is...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 15
    ArchiveBox

    ArchiveBox

    Open source self-hosted web archiving

    ...Archive.org does a great job as a centralized service, but saved URLs have to be public, and they can't save every type of content. ArchiveBox is an open source tool that lets organizations & individuals archive both public & private web content while retaining control over their data. It can be used to save copies of bookmarks, preserve evidence for legal cases, backup photos from FB/Insta/Flickr or media from YT/Soundcloud/etc., save research papers, and more. ArchiveBox is an open-source, self-hosted web archiving tool for saving websites offline. It helps organizations and individuals preserve bookmarks, research papers, and social media content, among others.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 16
    SeaweedFS

    SeaweedFS

    Distributed storage system for blobs, objects, files, and data lake

    SeaweedFS is a distributed storage system for blobs, objects, files, and data lake, to store and serve billions of files fast! Blob store has O(1) disk seek, local tiering, cloud tiering. Filer supports cross-cluster active-active replication, Kubernetes, POSIX, S3 API, encryption, Erasure Coding for warm storage, FUSE mount, Hadoop, WebDAV. SeaweedFS is an independent Apache-licensed open source project with its ongoing development made possible because of the community.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 17
    Curve

    Curve

    Curve is a sandbox project hosted by the CNCF Foundation

    A cloud-native distributed storage system. A sandbox project hosted by the CNCF Foundation. Curve is a modern storage system developed by netease, currently supporting file storage(CurveFS) and block storage(CurveBS). Now it's hosted at CNCF as a sandbox project. The performance, mixed, capacity cloud disk or persistent volume of virtual machine/container, and remote disks of physical machines. High-performance separation of storage and computation architecture: high-performance and low...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    RecoverPy

    RecoverPy

    Interactively find and recover deleted or overwritten files

    RecoverPy is a powerful tool that leverages your system capabilities to recover lost files. Unlike others, you can not only recover deleted files but also overwritten data. Every block of your partition will be scanned. You can even find a string in binary files.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 19
    Kopia

    Kopia

    Cross-platform backup tool for Windows, macOS and Linux

    ...Its architecture supports end-to-end encryption, optional compression, and deduplication, so multiple backups can share data efficiently, and repositories can be stored securely even in untrusted locations.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 20
    Convert to it!

    Convert to it!

    Truly universal online file converter

    Convert to it! is a lightweight utility project designed to provide simple and fast file or data conversion workflows through a developer-friendly interface. The repository focuses on practical transformations between formats, making it useful in scripting pipelines, automation tasks, and small backend utilities where reliability matters more than heavy frameworks. Its structure emphasizes clarity and minimal dependencies so developers can quickly understand and extend the codebase. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 21
    Outline Server

    Outline Server

    Outline Manager, developed by Jigsaw

    ...VPNs are useful to protect the privacy of your communications and allow you to keep access to the Internet open. However, with most VPN providers, you don't always know who controls the VPN and who has access to your data. Creating your own VPN server used to be a complicated process…until now. Outline allows anyone to create their own VPN, run it, and share access to it. Outline is designed to offer block resistance. It allows you to control your server settings, including its location. Furthermore, Outline is a 100% open-source project that uses transparent technology and has undergone audits by two security companies.
    Downloads: 15 This Week
    Last Update:
    See Project
  • 22
    PlainApp

    PlainApp

    Open-source application that allows you to manage your phone

    ...Access files, videos, music, contacts, sms, calls, and more from your desktop using a secure, easy-to-use web interface. We take your privacy seriously and use TLS + AES-GCM-256 encryption to ensure your data is secure. PlainApp has a modern, customizable interface with multi-language support, light/dark theme options. You can manage your phone from your desktop by visiting a self-hosted webpage wirelessly.
    Downloads: 6 This Week
    Last Update:
    See Project
  • 23
    Zippy

    Zippy

    Pure Nim implementation of deflate, zlib, gzip and zip

    Zippy is a pure Nim implementation of DEFLATE, ZLIB, GZIP, and ZIP data compression formats, aiming to be small, performant, and dependency-free.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 24
    MobX

    MobX

    A Simple, scalable state management

    MobX is a battle tested library that makes state management simple and scalable by transparently applying functional reactive programming (TFRP). Write minimalistic, boilerplate free code that captures your intent. Trying to update a record field? Use the good old JavaScript assignment. Updating data in an asynchronous process? No special tools are required, the reactivity system will detect all your changes and propagate them out to where they are being used. All changes to and uses of your data are tracked at runtime, building a dependency tree that captures all relations between state and output. This guarantees that computations depending on your state, like React components, run only when strictly needed. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    Stash Kubernetes

    Stash Kubernetes

    Backup your Kubernetes Stateful Applications

    A complete Kubernetes native disaster recovery solution for backup and restore your volumes and databases in Kubernetes on any public and private clouds. Backup your data in Kubernetes native way. Stash uses Custom Resource Definition (CRD) to specify targets and behaviors of the backup and restore process. Schedule a backup for your workloads in the same way you deployed them into Kubernetes. Back up your Kubernetes volumes mounted in a workload. Stash supports Deployment, DaemonSet, StatefulSet, ReplicaSet, ReplicationController, OpenShift DeploymentConfig etc. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • 3
  • 4
  • 5
  • Next
MongoDB Logo MongoDB