Showing 552 open source projects for ".sql"

View related business solutions
  • Our Free Plans just got better! | Auth0 Icon
    Our Free Plans just got better! | Auth0

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
    Try free now
  • Automate contact and company data extraction Icon
    Automate contact and company data extraction

    Build lead generation pipelines that pull emails, phone numbers, and company details from directories, maps, social platforms. Full API access.

    Generate leads at scale without building or maintaining scrapers. Use 10,000+ ready-made tools that handle authentication, pagination, and anti-bot protection. Pull data from business directories, social profiles, and public sources, then export to your CRM or database via API. Schedule recurring extractions, enrich existing datasets, and integrate with your workflows.
    Explore Apify Store
  • 1
    MCP Snowflake Server

    MCP Snowflake Server

    A Model Context Protocol (MCP) server implementation

    An MCP server implementation that facilitates database interactions with Snowflake, allowing execution of SQL queries and presentation of data insights as resources. ​
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    Supabase MCP Server

    Supabase MCP Server

    Query MCP enables end-to-end management of Supabase via chat interface

    An open-source MCP server that enables comprehensive management of Supabase projects through natural language interactions, providing capabilities such as SQL execution, schema management, and API integration. ​
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    Maltrail

    Maltrail

    Malicious traffic detection system

    Maltrail is a malicious traffic detection system, utilizing publicly available (black)lists containing malicious and/or generally suspicious trails, along with static trails compiled from various AV reports and custom user-defined lists, where trail can be anything from domain name, URL, IP address (e.g. 185.130.5.231 for the known attacker) or HTTP User-Agent header value (e.g. sqlmap for automatic SQL injection and database takeover tool). Also, it uses (optional) advanced heuristic mechanisms that can help in the discovery of unknown threats (e.g. new malware). Sensor(s) is a standalone component running on the monitoring node (e.g. Linux platform connected passively to the SPAN/mirroring port or transparently inline on a Linux bridge) or at the standalone machine (e.g. ...
    Downloads: 42 This Week
    Last Update:
    See Project
  • 4
    Smallpond

    Smallpond

    A lightweight data processing framework built on DuckDB and 3FS

    ...The idea is to preserve DuckDB’s fast analytics engine but lift it from single-node to multi-node settings, giving you the ability to operate on large datasets (e.g. petabyte scale) without moving to a heavyweight system like Spark. Users write Python-like code (via DataFrame APIs or SQL strings) to express their transformations; behind the scenes, tasks are scheduled (often via Ray) and pushed into DuckDB instances operating on partitioned data. Because the storage layer (3FS) is optimized for random access and high throughput, smallpond can shuffle data, repartition, and manage intermediate results across nodes.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Skillfully - The future of skills based hiring Icon
    Skillfully - The future of skills based hiring

    Realistic Workplace Simulations that Show Applicant Skills in Action

    Skillfully transforms hiring through AI-powered skill simulations that show you how candidates actually perform before you hire them. Our platform helps companies cut through AI-generated resumes and rehearsed interviews by validating real capabilities in action. Through dynamic job specific simulations and skill-based assessments, companies like Bloomberg and McKinsey have cut screening time by 50% while dramatically improving hire quality.
    Learn More
  • 5
    MCP Timeplus

    MCP Timeplus

    Execute SQL queries and manage databases seamlessly with Timeplus

    An MCP server designed for integration with Timeplus, enabling real-time data streaming and analytics through natural language interactions. ​
    Downloads: 5 This Week
    Last Update:
    See Project
  • 6
    Memobase

    Memobase

    Fast backend for long-term AI user memory via structured profiles

    ...The system focuses on three principal performance metrics: high search performance, reduced large language model (LLM) costs through batch processing techniques, and low latency with minimal SQL operations. Memobase supports integration with existing LLM workflows via APIs and SDKs (including Python, Node, and Go), making it easy to adopt within diverse application stacks.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 7
    EdgeDB

    EdgeDB

    A next-generation graph-relational database

    ...We call it a graph-relational database. The core unit of schema in the graph-relational model is the object type, analogous to a table in SQL. Object types contain properties and can be linked to other object types to form a schema graph.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 8
    Logfire MCP

    Logfire MCP

    The Logfire MCP Server is here

    The Logfire MCP Server is a Model Context Protocol server that allows AI applications to access OpenTelemetry traces and metrics sent to Logfire. It enables retrieval and analysis of telemetry data, enhancing debugging and observability workflows. ​
    Downloads: 2 This Week
    Last Update:
    See Project
  • 9
    LlamaParse

    LlamaParse

    Parse files for optimal RAG

    LlamaParse is a GenAI-native document parser that can parse complex document data for any downstream LLM use case (RAG, agents). Load in 160+ data sources and data formats, from unstructured, and semi-structured, to structured data (API's, PDFs, documents, SQL, etc.) Store and index your data for different use cases. Integrate with 40+ vector stores, document stores, graph stores, and SQL db providers.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Trumba is an All-in-one Calendar Management and Event Registration platform Icon
    Trumba is an All-in-one Calendar Management and Event Registration platform

    Great for live, virtual and hybrid events

    Publish, promote and track your events more affordably and effectively—all in one place.
    Learn More
  • 10
    sqlite-utils

    sqlite-utils

    Python CLI utility and library for manipulating SQLite databases

    ...As a library, it exposes high-level APIs for inserting records, creating or transforming tables, normalizing schemas, and running migrations that SQLite’s limited ALTER TABLE cannot handle directly. The project also embraces an ecosystem of plugins, so you can add custom SQL functions, extra commands, or UIs (including a terminal UI) via separate packages. Because it’s designed by someone who uses SQLite heavily in real projects, the tool includes many small quality-of-life features.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 11
    web2py

    web2py

    Free and open source full-stack enterprise framework

    Free open source full-stack framework for rapid development of fast, scalable, secure and portable database-driven web-based applications. Written and programmable in Python (version 3 and 2.7). Everything you need in one package including fast multi-threaded web server, SQL database and web-based interface. No third-party dependencies but works with third-party tools. Create, modify, deploy and manage applications from anywhere using your browser. One web2py instance can run multiple web sites using different databases. Try the interactive demo. Start with some quick examples, then read the manual and the Sphinx docs, watch videos, and join a user group for discussion. ...
    Downloads: 6 This Week
    Last Update:
    See Project
  • 12
    django-pgtrigger

    django-pgtrigger

    Write Postgres triggers for your Django models

    django-pgtrigger is a Django library for defining and managing PostgreSQL triggers directly in Python code. It allows developers to create database-level logic like automatic field updates, auditing, or validation without writing raw SQL. It’s ideal for teams that want stronger data integrity while keeping logic version-controlled.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 13
    Haystack

    Haystack

    Haystack is an open source NLP framework to interact with your data

    ...Pick any Transformer model from Hugging Face's Model Hub, experiment, find the one that works. Use Haystack NLP components on top of Elasticsearch, OpenSearch, or plain SQL. Boost search performance with Pinecone, Milvus, FAISS, or Weaviate vector databases, and dense passage retrieval.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 14
    mycli

    mycli

    CLI for MySQL with auto-completion and syntax highlighting

    mycli is an enhanced command-line client for MySQL and MariaDB that features syntax highlighting and intelligent auto-completion for database queries. Built for developers and DBAs, it improves productivity with features like query history, table suggestions, and colored output. The tool is designed to be both powerful and user-friendly, making interactive database work more efficient and visually intuitive.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 15
    VisiData

    VisiData

    A terminal spreadsheet multitool for discovering and arranging data

    ...VisiData is a free, open-source tool that lets you quickly open, explore, summarize, and analyze datasets in your computer’s terminal. VisiData works with CSV files, Excel spreadsheets, SQL databases, and many other data sources.
    Downloads: 6 This Week
    Last Update:
    See Project
  • 16
    ERAlchemy

    ERAlchemy

    Entity Relation Diagrams generation tool

    ERAlchemy is a tool that generates Entity-Relationship (ER) diagrams from databases or SQLAlchemy models and vice versa. It’s useful for database documentation, reverse engineering, and understanding complex schemas. ERAlchemy can export diagrams in formats like Graphviz and Mermaid, making it easy to include in reports or markdown files.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 17
    SageMaker Spark Container

    SageMaker Spark Container

    Docker image used to run data processing workloads

    ...It provides high-level APIs in Scala, Java, Python, and R, and an optimized engine that supports general computation graphs for data analysis. It also supports a rich set of higher-level tools including Spark SQL for SQL and DataFrames, MLlib for machine learning, GraphX for graph processing, and Structured Streaming for stream processing. The SageMaker Spark Container is a Docker image used to run batch data processing workloads on Amazon SageMaker using the Apache Spark framework. The container images in this repository are used to build the pre-built container images that are used when running Spark jobs on Amazon SageMaker using the SageMaker Python SDK. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 18
    AWS SDK for pandas

    AWS SDK for pandas

    Easy integration with Athena, Glue, Redshift, Timestream, Neptune

    ...The library abstracts efficient patterns like partitioning, compression, and vectorized I/O so you get performant data lake operations without hand-rolling boilerplate. It also supports Redshift, OpenSearch, and other services, enabling ETL tasks that blend SQL engines and Python transformations. Operational helpers handle IAM, sessions, and concurrency while exposing knobs for encryption, versioning, and catalog consistency. The result is a productive workflow that keeps your analytics in Python while leveraging AWS-native storage and query engines at scale.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    DataProfiler

    DataProfiler

    Extract schema, statistics and entities from datasets

    DataProfiler is an AI-powered tool for automatic data analysis and profiling, designed to detect patterns, anomalies, and schema inconsistencies in structured and unstructured datasets. The DataProfiler is a Python library designed to make data analysis, monitoring, and sensitive data detection easy. Loading Data with a single command, the library automatically formats & loads files into a DataFrame. Profiling the Data, the library identifies the schema, statistics, entities (PII / NPI), and...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 20
    MetricFlow

    MetricFlow

    MetricFlow allows you to define, build, and maintain metrics in code

    ...It works alongside a data stack—typically built with dbt—and allows you to express metrics as YAML‐based definitions tied to semantic models and dimension tables, rather than embedding logic ad-hoc across many dashboards or scripts. When a user or tool requests a metric (e.g., “monthly revenue by region”), MetricFlow generates optimized, warehouse-specific SQL to compute that metric, handling joins, filters, time grains, offsets, and other complexities under the hood. Because metric definitions live centrally, you avoid duplication across teams and tools, reduce risk of inconsistent numbers, and make it easier to audit and evolve the logic over time. The project emphasizes explainability, performance and portability: you define metrics once and then they can be consumed in BI tools, notebooks, or even AI/agent-driven workflows.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    peewee

    peewee

    A small, expressive orm, which supports postgresql, mysql and sqlite

    ...The real strength of our database is in how it allows us to retrieve data through queries. Relational databases are excellent for making ad-hoc queries. Peewee provides a magical helper fn(), which can be used to call any SQL function.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 22
    TinyDB

    TinyDB

    Document oriented database optimized for you

    TinyDB is a lightweight document oriented database optimized for your happiness :) It's written in pure Python and has no external dependencies. The target are small apps that would be blown away by a SQL-DB or an external database server. The current source code has 1800 lines of code (with about 40% documentation) and 1600 lines tests. Like MongoDB, you can store any document (represented as dict) in TinyDB. TinyDB is designed to be simple and fun to use by providing a simple and clean API. TinyDB neither needs an external server (as e.g. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 23
    gusty

    gusty

    Making DAG construction easier

    gusty allows you to control your Airflow DAGs, Task Groups, and Tasks with greater ease. gusty manages collections of tasks, represented as any number of YAML, Python, SQL, Jupyter Notebook, or R Markdown files. A directory of task files is instantly rendered into a DAG by passing a file path to gusty's create_dag function. gusty also manages dependencies (within one DAG) and external dependencies (dependencies on tasks in other DAGs) for each task file you define. All you have to do is provide a list of dependencies or external_dependencies inside of a task file, and gusty will automatically set each task's dependencies and create external task sensors for any external dependencies listed. gusty works with both Airflow 1.x and Airflow 2.x, and has even more features, all of which aim to make the creation, management, and iteration of DAGs more fluid, so that you can intuitively design your DAG and build your tasks.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 24
    LlamaIndex

    LlamaIndex

    Central interface to connect your LLM's with external data

    LlamaIndex (GPT Index) is a project that provides a central interface to connect your LLM's with external data. LlamaIndex is a simple, flexible interface between your external data and LLMs. It provides the following tools in an easy-to-use fashion. Provides indices over your unstructured and structured data for use with LLM's. These indices help to abstract away common boilerplate and pain points for in-context learning. Dealing with prompt limitations (e.g. 4096 tokens for Davinci) when...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 25
    peepDB

    peepDB

    CLI tool and python library to inspect databases fast

    peepDB is an open-source command-line tool and Python library designed for developers and database administrators who need a fast and efficient way to inspect their database tables without writing SQL queries. With support for MySQL, PostgreSQL, and MariaDB, peepDB is lightweight, secure, and incredibly easy to use.
    Downloads: 0 This Week
    Last Update:
    See Project