Showing 159 open source projects for "data"

View related business solutions
  • Our Free Plans just got better! | Auth0 Icon
    Our Free Plans just got better! | Auth0

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
    Try free now
  • Build on Google Cloud with $300 in Free Credit Icon
    Build on Google Cloud with $300 in Free Credit

    New to Google Cloud? Get $300 in free credit to explore Compute Engine, BigQuery, Cloud Run, Vertex AI, and 150+ other products.

    Start your next project with $300 in free Google Cloud credit. Spin up VMs, run containers, query exabytes in BigQuery, or build AI apps with Vertex AI and Gemini. Once your credits are used, keep building with 20+ products with free monthly usage, including Compute Engine, Cloud Storage, GKE, and Cloud Run functions. Sign up to start building right away.
    Start Free Trial
  • 1
    data-diff

    data-diff

    Efficiently diff rows across two different databases

    We're excited to announce the launch of a new open-source product, data-diff that makes comparing datasets across databases fast at any scale. data-diff automates data quality checks for data replication and migration. In modern data platforms, data is constantly moving between systems, and at the modern data volume and complexity, systems go out of sync all the time. Until now, there has not been any tooling to ensure that when the data is correctly copied. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    Grafana

    Grafana

    Leading open-source visualization and observability platform

    Grafana OSS is a leading open-source visualization and observability platform that lets you query, visualize, alert on, and explore your data—regardless of where it’s stored. With support for 100+ data source plugins (such as Prometheus, Loki, Elasticsearch, InfluxDB, SQL/NoSQL databases, OTel, and more), you can unify metrics, logs, traces, and other observability signals in one place. Grafana OSS empowers you to build dynamic, reusable dashboards with rich visualizations, template variables, interactive filtering, and cross-panel linking. ...
    Downloads: 31 This Week
    Last Update:
    See Project
  • 3
    OpenViking

    OpenViking

    Context database designed specifically for AI Agents

    OpenViking is an open-source context database engineered for efficient indexing and retrieval of large amounts of unstructured or semi-structured context data used by AI applications. It’s primarily designed to serve as a high-performance, scalable backend for storing app context, embeddings, conversational histories, and other textual artifacts that need rapid lookup and semantic search, which makes it especially useful for systems like chatbots or memory-augmented agents. The project is implemented with performance in mind, often leveraging optimized data structures that balance fast reads and writes with minimal resource consumption. ...
    Downloads: 5 This Week
    Last Update:
    See Project
  • 4
    OSV.dev

    OSV.dev

    Open source vulnerability DB and triage service

    ...This repository contains the full infrastructure code for deploying osv.dev on Google Cloud Platform, including Terraform configurations, APIs, data pipelines, indexers, and background workers for vulnerability ingestion and impact analysis. It also integrates with automated feeds from sources like NVD and OSS-Fuzz, enabling continuous updates and high data accuracy.
    Downloads: 1 This Week
    Last Update:
    See Project
  • Build AI Apps with Gemini 3 on Vertex AI Icon
    Build AI Apps with Gemini 3 on Vertex AI

    Access Google’s most capable multimodal models. Train, test, and deploy AI with 200+ foundation models on one platform.

    Vertex AI gives developers access to Gemini 3—Google’s most advanced reasoning and coding model—plus 200+ foundation models including Claude, Llama, and Gemma. Build generative AI apps with Vertex AI Studio, customize with fine-tuning, and deploy to production with enterprise-grade MLOps. New customers get $300 in free credits.
    Try Vertex AI Free
  • 5
    pgsync

    pgsync

    Postgres to Elasticsearch/OpenSearch sync

    pgsync is a lightweight tool for syncing Postgres databases across environments, such as from production to staging. It allows selective table syncing, data masking, and parallel copying for fast and safe data migration. pgsync is ideal for developers who need realistic test data without exposing sensitive information.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    CogDB

    CogDB

    Micro Graph Database for Python Applications

    Cog is a lightweight, embedded graph database for Go that provides a simple interface for storing and querying graph-based data structures, making it useful for knowledge representation and graph analytics.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 7
    DictDataBase

    DictDataBase

    A python NoSQL dictionary database, with concurrent access and ACID

    DictDataBase (DictDB) is a lightweight, Python-based in-memory database that uses dictionaries as its primary data structure. It provides a simple and efficient way to store, retrieve, and manipulate data without requiring an external database server. DictDB is useful for applications needing fast lookups, temporary storage, or embedded database functionalities.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    SuperDuperDB

    SuperDuperDB

    Integrate, train and manage any AI models and APIs with your database

    Build and manage AI applications easily without needing to move your data to complex pipelines and specialized vector databases. Integrate AI and vector search directly with your database including real-time inference and model training. Just using Python. A single scalable deployment of all your AI models and APIs which is automatically kept up-to-date as new data is processed immediately. No need to introduce an additional database and duplicate your data to use vector search and build on top of it. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    SQLModel

    SQLModel

    SQL databases in Python, designed for simplicity, compatibility

    SQLModel, SQL databases in Python, designed for simplicity, compatibility, and robustness. SQLModel is a library for interacting with SQL databases from Python code, with Python objects. It is designed to be intuitive, easy to use, highly compatible, and robust. SQLModel is based on Python-type annotations, and powered by Pydantic and SQLAlchemy.
    Downloads: 1 This Week
    Last Update:
    See Project
  • Go from Data Warehouse to Data and AI platform with BigQuery Icon
    Go from Data Warehouse to Data and AI platform with BigQuery

    Build, train, and run ML models with simple SQL. Automate data prep, analysis, and predictions with built-in AI assistance from Gemini.

    BigQuery is more than a data warehouse—it's an autonomous data-to-AI platform. Use familiar SQL to train ML models, run time-series forecasts, and generate AI-powered insights with native Gemini integration. Built-in agents handle data engineering and data science workflows automatically. Get $300 in free credit, query 1 TB, and store 10 GB free monthly.
    Try BigQuery Free
  • 10
    Bracket

    Bracket

    Selfhosted tournament system

    Bracket is an open-source tool that tracks and manages data access across your PostgreSQL database. It provides visibility into which parts of your codebase are accessing which tables and columns, enabling data governance, security auditing, and architectural insights. Bracket is particularly helpful for growing teams needing better observability in complex applications.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    SpeechRecognition

    SpeechRecognition

    Speech recognition module for Python

    Library for performing speech recognition, with support for several engines and APIs, online and offline. Recognize speech input from the microphone, transcribe an audio file, save audio data to an audio file. Show extended recognition results, calibrate the recognizer energy threshold for ambient noise levels (see recognizer_instance.energy_threshold for details). Listening to a microphone in the background, various other useful recognizer features. The easiest way to install this is using pip install SpeechRecognition. ...
    Downloads: 16 This Week
    Last Update:
    See Project
  • 12
    SQLAlchemy

    SQLAlchemy

    The Database Toolkit for Python

    ...SQLAlchemy provides a full suite of well known enterprise-level persistence patterns, designed for efficient and high-performing database access, adapted into a simple and Pythonic domain language. An industrial strength ORM, built from the core on the identity map, unit of work, and data mapper patterns. These patterns allow the transparent persistence of objects using a declarative configuration system. Domain models can be constructed and manipulated naturally, and changes are synchronized with the current transaction automatically. A relationally-oriented query system, exposing the full range of SQL's capabilities explicitly, including joins, subqueries, correlation, and most everything else, in terms of the object model. ...
    Downloads: 6 This Week
    Last Update:
    See Project
  • 13
    SQL Explorer

    SQL Explorer

    Easily share data across your company via SQL queries

    SQL Explorer aims to make the flow of data between people fast, simple, and confusion-free. It is a Django-based application that you can add to an existing Django site, or use as a standalone business intelligence tool. Quickly write and share SQL queries in a simple, usable SQL editor, preview the results in the browser, share links, download CSV, JSON, or Excel files (and even expose queries as API endpoints, if desired), and keep the information flowing!
    Downloads: 1 This Week
    Last Update:
    See Project
  • 14
    Finance Database

    Finance Database

    This is a database of 300.000+ symbols containing Equities, ETFs, etc.

    ...It, therefore, allows you to obtain a broad overview of sectors, industries, types of investments and much more. The aim of this database is explicitly not to provide up-to-date fundamentals or stock data as those can be obtained with ease (with the help of this database) by using yfinance, FundamentalAnalysis or ThePassiveInvestor.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 15
    Enferno

    Enferno

    Modern Flask framework optimized for AI-assisted development

    Enferno is a framework for building developer-first cloud backends using PostgreSQL and TypeScript. It offers primitives for defining data models, APIs, and access rules directly in code, enabling quick iteration and deployment. Enferno is designed to accelerate SaaS and internal tool development by combining the benefits of traditional backends with developer ergonomics.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    pgai

    pgai

    A suite of tools to develop RAG, semantic search, and other AI apps

    pgai is a suite of PostgreSQL extensions developed by Timescale to empower developers in building AI applications directly within their databases. It integrates tools for vector storage, advanced indexing, and AI model interactions, facilitating the development of applications like semantic search and Retrieval-Augmented Generation (RAG) without leaving the SQL environment.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    django-pgtrigger

    django-pgtrigger

    Write Postgres triggers for your Django models

    ...It allows developers to create database-level logic like automatic field updates, auditing, or validation without writing raw SQL. It’s ideal for teams that want stronger data integrity while keeping logic version-controlled.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    ArangoDB-Community/pyArango

    ArangoDB-Community/pyArango

    Python Driver for ArangoDB with built-in validation

    PyArango is a Python driver for ArangoDB, a multi-model NoSQL database. It provides a Pythonic way to interact with ArangoDB, allowing developers to manage collections, execute AQL queries, and integrate ArangoDB's document, graph, and key-value storage models into Python applications.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    ZeusDB Vector Database

    ZeusDB Vector Database

    Blazing-fast vector DB with similarity search and metadata filtering

    ...Hybrid search is a core design goal, allowing you to mix vector, keyword, and filter queries in a single request for practical relevance. Observability and safety round out the system, with metrics, tracing, and guardrails to manage recalls, deletions, and privacy-sensitive data at scale.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 20
    EdgeDB

    EdgeDB

    A next-generation graph-relational database

    ...Powered by the Postgres query engine under the hood, EdgeDB thinks about schema the same way you do: as objects with properties connected by links. It's like a relational database with an object-oriented data model, or a graph database with strict schema. We call it a graph-relational database. The core unit of schema in the graph-relational model is the object type, analogous to a table in SQL. Object types contain properties and can be linked to other object types to form a schema graph.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 21
    pagodo

    pagodo

    Automate Google Hacking Database scraping and searching

    ...To start off, pagodo.py needs a list of all the current Google dorks. The repo contains a dorks/ directory with the current dorks when the ghdb_scraper.py was last run. It's advised to run ghdb_scraper.py to get the freshest data before running pagodo.py.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 22
    Full Stack FastAPI and PostgreSQL

    Full Stack FastAPI and PostgreSQL

    Full stack, modern web application generator

    ...REST backend tests based on Pytest, integrated with Docker, so you can test the full API interaction, independent on the database. As it runs in Docker, it can build a new data store from scratch each time.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    sqlit

    sqlit

    A user friendly TUI for SQL databases

    ...For querying, it emphasizes productivity features like syntax highlighting, searchable query history, and vim-style keybindings so power users can move fast. For exploring data at scale, it can load and inspect very large result sets and provides filtering and fuzzy search to find rows and values efficiently.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    Arctic TimeSeries and Tick store

    Arctic TimeSeries and Tick store

    High performance datastore for time series and tick data

    ...Has different types of Stores, each with it's own benefits. Eg. Versionstore allows you to version and snapshot stuff, TickStore is for storage and highly efficient retrieval of streaming data, ChunkStore allows you to chunk and efficiently retrieve ranges of chunks.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25

    pyDaSSQLiteMan

    An easy way to manage SQLite databases and query CSV files

    Welcome to the project page for pyDaSSQLiteMan (Python Data and SQLite Manager). This application is a Python Tkinter application designed to enable easy SQLite database file manipulation, and an easy method to run SQL styled queries on CSV files. If you're not downloading the pre-compiled executable you will need Python to run this app. You can download it from http://www.python.org/ .
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • 3
  • 4
  • 5
  • Next
MongoDB Logo MongoDB
Gen AI apps are built with MongoDB Atlas
Atlas offers built-in vector search and global availability across 125+ regions. Start building AI apps faster, all in one place.
Try Free →