Showing 171 open source projects for "data"

View related business solutions
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • Cut Cloud Costs with Google Compute Engine Icon
    Cut Cloud Costs with Google Compute Engine

    Save up to 91% with Spot VMs and get automatic sustained-use discounts. One free VM per month, plus $300 in credits.

    Save on compute costs with Compute Engine. Reduce your batch jobs and workload bill 60-91% with Spot VMs. Compute Engine's committed use offers customers up to 70% savings through sustained use discounts. Plus, you get one free e2-micro VM monthly and $300 credit to start.
    Try Compute Engine
  • 1
    QSV

    QSV

    Blazing-fast Data-Wrangling toolkit

    qsv is a fast, command-line CSV data toolkit written in Rust that extends the capabilities of xsv. It’s designed to make working with CSV files at scale easy and efficient, offering over 40 powerful subcommands for tasks like querying, sampling, splitting, deduplicating, and more. qsv is ideal for data engineers, analysts, and developers who need high-performance CSV manipulation on the command line.
    Downloads: 42 This Week
    Last Update:
    See Project
  • 2
    RustDesk

    RustDesk

    An open-source remote desktop, and alternative to TeamViewer

    ...Desktop versions use Flutter or Sciter (deprecated) for GUI, this tutorial is for Sciter only, since it is easier and more friendly to start. Check out our CI for building Flutter version. You have full control of your data, with no concerns about security. You can use our rendezvous/relay server, or self-hosting, or write your own rendezvous/relay server.
    Downloads: 468 This Week
    Last Update:
    See Project
  • 3
    Scanopy

    Scanopy

    Clean network diagrams, One-time setup, zero upkeep

    Scanopy is a powerful multi-modal data capture and analysis toolkit that enables users to collect, process, and visualize structured and unstructured information from a variety of sources in a flexible pipeline. It is built to handle complex scanning tasks — such as OCR, document analysis, audio transcription, network data capture, and image extraction — while providing unified APIs and workflows that make managing heterogeneous data sources seamless.
    Downloads: 16 This Week
    Last Update:
    See Project
  • 4
    Cube

    Cube

    Universal semantic layer platform for AI, BI, spreadsheets

    Cube is the semantic layer for building data applications. It helps data engineers and application developers access data from modern data stores, organize it into consistent definitions, and deliver it to every application. Cube was designed to work with all SQL-enabled data sources, including cloud data warehouses like Snowflake or Google BigQuery, query engines like Presto or Amazon Athena, and application databases like Postgres. ...
    Downloads: 5 This Week
    Last Update:
    See Project
  • Ship AI Apps Faster with Vertex AI Icon
    Ship AI Apps Faster with Vertex AI

    Go from idea to deployed AI app without managing infrastructure. Vertex AI offers one platform for the entire AI development lifecycle.

    Ship AI apps and features faster with Vertex AI—your end-to-end AI platform. Access Gemini 3 and 200+ foundation models, fine-tune for your needs, and deploy with enterprise-grade MLOps. Build chatbots, agents, or custom models. New customers get $300 in free credit.
    Try Vertex AI Free
  • 5
    Zenith

    Zenith

    Sort of like top or htop but with zoom-able charts, CPU, GPU

    In terminal graphical metrics for your *nix system written in Rust. The make file provides for building fully static versions on Linux against the musl C library. It requires musl-gcc to be installed on the system. Install "musl-tools" package on debian/ubuntu derivatives, "musl-gcc" on fedora and equivalent on other distributions from their standard repos. If one needs to build with NVIDIA support in a virtual environment, then it requires some more setup since typically the VM software is...
    Downloads: 24 This Week
    Last Update:
    See Project
  • 6
    gping

    gping

    Ping, but with a graph

    Graphical Ping displays a color-coded realtime graph of continuous pings to a specified host. No warranties are provided on this program, it is completely free to use. Graph the execution time for a list of commands rather than pinging hosts. Resolve ping targets to IPv4 address. Resolve ping targets to IPv6 address. Uses dot characters instead of braille. Determine the number of seconds to display in the graph. Watch interval seconds (provide partial seconds like '0.5').
    Downloads: 7 This Week
    Last Update:
    See Project
  • 7
    Relay

    Relay

    JavaScript framework for building data-driven React applications

    Relay is data-fetching turned declarative. Components declare their data dependencies, without worrying about how to fetch them. Relay guarantees that the data each component needs is fetched and available. This keeps components decoupled and promotes reuse. With Relay, components and their data dependencies can be quickly modified without modifying other parts of the system.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 8
    ReductStore

    ReductStore

    The fastest time series object store for Edge AI

    ...Capture and access blob data as time series, tailored for edge computing, computer vision, and IoT. ReductStore handles blob data without size limits; your disk capacity is the only boundary. Ensure optimal storage management with FIFO quotas, preventing disk space shortages in real-time. Manage your time-series blob data with ease: annotate, filter, and save AI labels or meta-data.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 9
    RisingWave

    RisingWave

    RisingWave: the next-generation streaming database in the cloud

    ...Create materialized views for the data that need to be incrementally aggregated. Query for data in RisingWave, including persisted data and data you add or import to RisingWave. RisingWave makes it easy to manage streams and data. All you need to interact with RisingWave is Postgres-compatible SQL. No Java or Scala codes are needed.
    Downloads: 1 This Week
    Last Update:
    See Project
  • Build on Google Cloud with $300 in Free Credit Icon
    Build on Google Cloud with $300 in Free Credit

    New to Google Cloud? Get $300 in free credit to explore Compute Engine, BigQuery, Cloud Run, Vertex AI, and 150+ other products.

    Start your next project with $300 in free Google Cloud credit. Spin up VMs, run containers, query exabytes in BigQuery, or build AI apps with Vertex AI and Gemini. Once your credits are used, keep building with 20+ products with free monthly usage, including Compute Engine, Cloud Storage, GKE, and Cloud Run functions. Sign up to start building right away.
    Start Free Trial
  • 10
    Spice.ai OSS

    Spice.ai OSS

    A self-hostable CDN for databases

    Spice is a portable runtime offering developers a unified SQL interface to materialize, accelerate, and query data from any database, data warehouse, or data lake. Spice connects, fuses, and delivers data to applications, machine-learning models, and AI backends, functioning as an application-specific, tier-optimized Database CDN. The Spice runtime, written in Rust, is built-with industry-leading technologies such as Apache DataFusion, Apache Arrow, Apache Arrow Flight, SQLite, and DuckDB. ...
    Downloads: 5 This Week
    Last Update:
    See Project
  • 11
    Graph Node

    Graph Node

    Graph Node indexes data from blockchains such as Ethereum

    ...All data is stored and processed on open networks with verifiable integrity. The Graph makes querying this data fast, reliable, and secure. Entrepreneurs are creating next level apps to scale human coordination on the internet. It's a new frontier and we're just getting started.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 12
    Delta

    Delta

    A viewer for git and diff output

    Code evolves, and we all spend time studying diffs. Delta aims to make this both efficient and enjoyable: it allows you to make extensive changes to the layout and styling of diffs, as well as allowing you to stay arbitrarily close to the default git/diff output. Language syntax highlighting with color themes. Within-line highlights based on a Levenshtein edit inference algorithm. Git style strings (foreground color, background color, font attributes) are supported for >20 stylable elements....
    Downloads: 8 This Week
    Last Update:
    See Project
  • 13
    Rerun

    Rerun

    Visualize streams of multimodal data

    Rerun is an open-source tool that helps developers visualize real-time multimodal data streams, such as images, point clouds, and tensors, for debugging and understanding ML and robotics systems. Designed for use with Python and Rust, it captures logged data and renders it through an interactive desktop interface, making it easier to understand how complex systems behave over time.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 14
    Supabase Wrappers

    Supabase Wrappers

    Postgres Foreign Data Wrapper development framework in Rust

    Supabase Wrappers is a collection of PostgreSQL extensions (written in Rust) that allow Postgres to interact with external systems like APIs, files, and other databases. It uses Foreign Data Wrappers (FDWs) to treat external resources as if they were native SQL tables, enabling seamless integration with external services directly from SQL queries.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 15
    CocoIndex

    CocoIndex

    ETL framework to index data for AI, such as RAG

    ...CocoIndex leverages vector embeddings and integrates with various models and frameworks, including OpenAI and Hugging Face, to provide high-quality semantic understanding. It’s built for transparency, ease of use, and local control over your search data, distinguishing itself from closed, black-box systems. The tool is suitable for developers working on personal knowledge bases, AI search interfaces, or private LLM applications.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 16
    Meetily

    Meetily

    Privacy first, AI meeting assistant with 4x faster Parakeet/Whisper

    ...It’s built for organizations that want meeting intelligence without sending recordings or transcripts to third-party cloud services, which helps address compliance and data sovereignty requirements. The app supports live transcription with local model options (including Whisper- and Parakeet-based workflows) and presents the transcript as the meeting happens, making it useful both for note-taking and accessibility. After or during the session, it can produce structured, AI-generated summaries, and it’s designed to be flexible about where that summarization comes from, supporting local providers as well as external endpoints when allowed by policy.
    Downloads: 31 This Week
    Last Update:
    See Project
  • 17
    RedisJSON

    RedisJSON

    RedisJSON - a JSON data type for Redis

    RedisJSON is a Redis module that implements ECMA-404 The JSON Data Interchange Standard as a native data type. It allows storing, updating and fetching JSON values from Redis keys (documents).
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    Vaultwarden

    Vaultwarden

    Bitwarden compatible server written in Rust

    Basically full implementation of Bitwarden API is provided including organizations support, attachments, vault API support, serving the static files for Vault interface, website icons API, authenticator and U2F support, yubiKey and Duo support. Pull the docker image and mount a volume from the host for persistent storage. This will preserve any persistent data under /vw-data/, you can adapt the path to whatever suits you. Some web browsers, like Chrome, disallow the use of Web Crypto APIs in insecure contexts. In this case, you might get an error like Cannot read property 'importKey'. To solve this problem, you need to access the web vault from HTTPS. This can be configured in vaultwarden directly or using a third-party reverse proxy (some examples). ...
    Downloads: 35 This Week
    Last Update:
    See Project
  • 19
    Lance

    Lance

    Modern columnar data format for ML and LLMs implemented in Rust

    Lance is a columnar data format that is easy and fast to version, query and train on. It’s designed to be used with images, videos, 3D point clouds, audio and of course tabular data. It supports any POSIX file systems, and cloud storage like AWS S3 and Google Cloud Storage.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 20
    Serde JSON

    Serde JSON

    Strongly typed JSON library for Rust

    Serde is a framework for serializing and deserializing Rust data structures efficiently and generically.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    Arroyo

    Arroyo

    Distributed stream processing engine in Rust

    Arroyo is a distributed stream processing engine written in Rust, designed to efficiently perform stateful computations on streams of data. Unlike traditional batch processing, streaming engines can operate on both bounded and unbounded sources, emitting results as soon as they are available.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    Meilisearch

    Meilisearch

    An open-source, lightning-fast, and hyper-relevant search engine

    ...Search-as-you-type returns answers in less than 50 milliseconds. That's faster than the blink of an eye! Deploy in a matter of minutes. Smart presets let you start searching through your data with zero configuration. Send data to Meilisearch however you want, no need to match a schema or convert your dataset to a compatible format. Everyone makes mistakes! If typos break your search experience, many users will leave thinking what they were looking for just wasn't there. Start searching through your dataset in less than 5 minutes and quickly connect your codebase to Meilisearch with our official libraries. ...
    Downloads: 11 This Week
    Last Update:
    See Project
  • 23
    Lantern Database

    Lantern Database

    PostgreSQL vector database extension for building AI applications

    Lantern is a real-time data transformation engine that enables data engineers to build, run, and monitor streaming data pipelines with SQL. It’s designed to process events in motion, offering low-latency stream transformations, aggregations, and enrichment in a declarative way. Lantern is especially suited for modern data infrastructure and analytics platforms.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    jql

    jql

    A JSON Query Language CLI tool

    jql is a JSON Query Language tool built with Rust.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 25
    rip2

    rip2

    A safe and ergonomic alternative to rm

    ...Deleted files get sent to the graveyard (typically /tmp/graveyard-$USER, see notes on changing this) under their absolute path, giving you a chance to recover them. No data is overwritten. If files that share the same path are deleted, they will be renamed as numbered backups.
    Downloads: 46 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • 3
  • 4
  • 5
  • Next
MongoDB Logo MongoDB
Gen AI apps are built with MongoDB Atlas
Atlas offers built-in vector search and global availability across 125+ regions. Start building AI apps faster, all in one place.
Try Free →