Showing 259 open source projects for "python data analysis"

View related business solutions
  • AI-powered service management for IT and enterprise teams Icon
    AI-powered service management for IT and enterprise teams

    Enterprise-grade ITSM, for every business

    Give your IT, operations, and business teams the ability to deliver exceptional services—without the complexity. Maximize operational efficiency with refreshingly simple, AI-powered Freshservice.
    Try it Free
  • Go From AI Idea to AI App Fast Icon
    Go From AI Idea to AI App Fast

    One platform to build, fine-tune, and deploy ML models. No MLOps team required.

    Access Gemini 3 and 200+ models. Build chatbots, agents, or custom models with built-in monitoring and scaling.
    Try Free
  • 1
    TURF

    TURF

    A modular geospatial engine written in JavaScript

    Turf is a JavaScript library for spatial analysis. It includes traditional spatial operations, helper functions for creating GeoJSON data, and data classification and statistics tools. Turf can be added to your website as a client-side plugin, or you can run Turf server-side with Node.js. Modular, simple-to-understand JavaScript functions that speak GeoJSON. Turf is a collection of small modules, you only need to take what you want to use.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 2
    OpenViking

    OpenViking

    Context database designed specifically for AI Agents

    OpenViking is an open-source context database engineered for efficient indexing and retrieval of large amounts of unstructured or semi-structured context data used by AI applications. It’s primarily designed to serve as a high-performance, scalable backend for storing app context, embeddings, conversational histories, and other textual artifacts that need rapid lookup and semantic search, which makes it especially useful for systems like chatbots or memory-augmented agents. The project is...
    Downloads: 4 This Week
    Last Update:
    See Project
  • 3
    SpeechRecognition

    SpeechRecognition

    Speech recognition module for Python

    ...The first software requirement is Python 2.6, 2.7, or Python 3.3+. This is required to use the library. PyAudio is required if and only if you want to use microphone input (Microphone). PyAudio version 0.2.11+ is required, as earlier versions have known memory management bugs when recording from microphones in certain situations. To hack on this library, first make sure you have all the requirements listed in the "Requirements" section.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 4
    chDB

    chDB

    chDB is an in-process OLAP SQL Engine

    chDB is an in-process SQL OLAP Engine powered by ClickHouse. It is developed by ClickHouse, Inc and open-source contributors.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Train ML Models With SQL You Already Know Icon
    Train ML Models With SQL You Already Know

    BigQuery automates data prep, analysis, and predictions with built-in AI assistance.

    Build and deploy ML models using familiar SQL. Automate data prep with built-in Gemini. Query 1 TB and store 10 GB free monthly.
    Try Free
  • 5
    Apache AGE

    Apache AGE

    Graph database optimized for fast analysis

    Apache AGE (A Graph Extension) is a PostgreSQL extension that adds support for graph database functionality using the openCypher query language. AGE allows users to store and query graph-structured data alongside traditional relational data in a single database. Built for performance and scalability, it brings the power of graph queries to the robustness of PostgreSQL, making it suitable for use cases like social networks, recommendation systems, and fraud detection.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    django-pgtrigger

    django-pgtrigger

    Write Postgres triggers for your Django models

    django-pgtrigger is a Django library for defining and managing PostgreSQL triggers directly in Python code. It allows developers to create database-level logic like automatic field updates, auditing, or validation without writing raw SQL. It’s ideal for teams that want stronger data integrity while keeping logic version-controlled.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    Mikro Orm

    Mikro Orm

    TypeScript ORM for Node.js based on Data Mapper, Unit of Work

    TypeScript ORM for Node.js based on Data Mapper, Unit of Work and Identity Map patterns. MikroORM allows handling transactions automatically. When you call em.flush(), all computed changes are wrapped inside a database transaction. Uses source code analysis so you do not have to repeat yourself when defining entities. Simply define correct TypeScript types and you are good to go!
    Downloads: 4 This Week
    Last Update:
    See Project
  • 8
    Apache Impala

    Apache Impala

    Apache Impala

    ...Like Hive, Impala supports SQL, so you don't have to worry about reinventing the implementation wheel. With Impala, more users, whether using SQL queries or BI applications, can interact with more data through a single repository and metadata stored from source through analysis.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    pgai

    pgai

    A suite of tools to develop RAG, semantic search, and other AI apps

    pgai is a suite of PostgreSQL extensions developed by Timescale to empower developers in building AI applications directly within their databases. It integrates tools for vector storage, advanced indexing, and AI model interactions, facilitating the development of applications like semantic search and Retrieval-Augmented Generation (RAG) without leaving the SQL environment.
    Downloads: 1 This Week
    Last Update:
    See Project
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • 10
    Enferno

    Enferno

    Modern Flask framework optimized for AI-assisted development

    Enferno is a framework for building developer-first cloud backends using PostgreSQL and TypeScript. It offers primitives for defining data models, APIs, and access rules directly in code, enabling quick iteration and deployment. Enferno is designed to accelerate SaaS and internal tool development by combining the benefits of traditional backends with developer ergonomics.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    SurrealDB

    SurrealDB

    A scalable, distributed, collaborative, document-graph database

    With an SQL-style query language, real-time queries with highly-efficient related data retrieval, advanced security permissions for multi-tenant access, and support for performant analytical workloads, SurrealDB is the next generation serverless database. SurrealDB is the ultimate cloud database for tomorrow's applications. SurrealDB is an innovative NewSQL cloud database, suitable for serverless applications, jamstack applications, single-page applications, and traditional applications. It...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    SQL Explorer

    SQL Explorer

    Easily share data across your company via SQL queries

    SQL Explorer aims to make the flow of data between people fast, simple, and confusion-free. It is a Django-based application that you can add to an existing Django site, or use as a standalone business intelligence tool. Quickly write and share SQL queries in a simple, usable SQL editor, preview the results in the browser, share links, download CSV, JSON, or Excel files (and even expose queries as API endpoints, if desired), and keep the information flowing! Comes with support for multiple...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 13
    Stoolap

    Stoolap

    A Modern Embedded SQL Database written in Rust

    Stoolap is a lightweight, self-hosted analysis and visualization tool designed to help developers and operations teams explore log data, metrics, and debugging information from distributed systems or local applications. Instead of relying on heavy commercial observability platforms, stoolap provides a fast, focused interface where users can filter, query, and visualize time-series data, logs, traces, and error metrics in a cohesive environment.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    Dokploy

    Dokploy

    Open Source Alternative to Vercel, Netlify and Heroku

    Streamline your operations with our all-in-one platform, perfect for managing projects, data, and system health with simplicity and efficiency. Simplify your project and data management, ensure robust monitoring, and secure your backups—all without the fuss over minute details. Elevate your infrastructure with tools that offer precise control, detailed monitoring, and enhanced security, ensuring seamless management and robust performance. Streamline your deployments with our PaaS....
    Downloads: 2 This Week
    Last Update:
    See Project
  • 15
    pagodo

    pagodo

    Automate Google Hacking Database scraping and searching

    pagodo automates Google searching for potentially vulnerable web pages and applications on the Internet. It replaces manually performing Google dork searches with a web GUI browser. There are 2 parts. The first is ghdb_scraper.py that retrieves the latest Google dorks and the second portion is pagodo.py that leverages the information gathered by ghdb_scraper.py. This version of pagodo also supports native HTTP(S) and SOCKS5 application support, so no more wrapping it in a tool like...
    Downloads: 4 This Week
    Last Update:
    See Project
  • 16
    sqlit

    sqlit

    A user friendly TUI for SQL databases

    sqlit is a keyboard-first terminal UI that lets you connect to, browse, and query SQL databases quickly without relying on heavyweight GUI clients. It positions itself as a “lazygit-style” experience for databases, aiming for fast startup, intuitive navigation, and developer-friendly workflows directly inside your terminal. The tool supports a wide range of database providers, so you can use one interface across local databases, remote servers, and cloud-hosted instances rather than juggling...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 17
    ZeusDB Vector Database

    ZeusDB Vector Database

    Blazing-fast vector DB with similarity search and metadata filtering

    ZeusDB is a vector database built for fast, scalable similarity search with strong production ergonomics. It combines high-performance approximate nearest neighbor indexes with clean APIs and metadata filtering so applications can retrieve semantically relevant items at low latency. The storage layer is designed for durability and growth, supporting sharding, replication, and background compaction while keeping query tails predictable. Developers get multiple ingestion paths—batch,...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    ArangoDB-Community/pyArango

    ArangoDB-Community/pyArango

    Python Driver for ArangoDB with built-in validation

    PyArango is a Python driver for ArangoDB, a multi-model NoSQL database. It provides a Pythonic way to interact with ArangoDB, allowing developers to manage collections, execute AQL queries, and integrate ArangoDB's document, graph, and key-value storage models into Python applications.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    manticoresearch

    manticoresearch

    Easy to use open source fast database for search

    Manticore Search is an easy to use open source fast database for search. Modern MPP architecture and smart query parallelization capabilities allow to fully utilize all your CPU cores to lower response time as much as possible, when needed. Powerful and fast full-text searching which works fine for small and big datasets. Columnar storage support via the Manticore Columnar Library for bigger datasets (much bigger than can fit in RAM). SQL-first: Manticore's native syntax is SQL. It speaks...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 20
    An innovative Open Source CEP (Complex Event Processing) engine. It implements the event stream processing as a library embeddable in C++ and Perl. You can think of the Complex Event Processing engine as an in-memory database driven by triggers, or a data-flow machine, or a spreadsheet on steroids (and without the GUI part).
    Downloads: 16 This Week
    Last Update:
    See Project
  • 21

    xsd2pgschema

    Relational database replication tool based on XML Schema

    ...Obtained PostgreSQL database can be optimized at user's discretion. Moreover, differential update, and XPath query evaluation over PostgreSQL are possible. PgSchema server, serialized relational data model server, can be used to speed up the analysis of complex XML Schema. Large XML file can be split through xmlsplitter, a flexible XML splitter based on XPath and StAX.
    Downloads: 9 This Week
    Last Update:
    See Project
  • 22

    pyDaSSQLiteMan

    An easy way to manage SQLite databases and query CSV files

    Welcome to the project page for pyDaSSQLiteMan (Python Data and SQLite Manager). This application is a Python Tkinter application designed to enable easy SQLite database file manipulation, and an easy method to run SQL styled queries on CSV files. If you're not downloading the pre-compiled executable you will need Python to run this app. You can download it from http://www.python.org/ .
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    Arctic TimeSeries and Tick store

    Arctic TimeSeries and Tick store

    High performance datastore for time series and tick data

    Arctic is a timeseries/dataframe database that sits atop MongoDB. Arctic supports serialization of a number of datatypes for storage in the mongo document model. Serializes a number of data types eg. Pandas DataFrames, Numpy arrays, Python objects via pickling etc. so you don't have to handle different datatypes manually. Uses LZ4 compression by default on the client side to get big savings on network / disk. Allows you to version different stages of an object and snapshot the state (In some ways similar to git), and allows you to freely experiment and then just revert back the snapshot. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 24
    Tulip
    Tulip is an information visualization framework dedicated to the analysis and visualization of relational data. Tulip aims to provide the developer/user with a complete library/software, supporting the design of interactive information visualization
    Leader badge
    Downloads: 78 This Week
    Last Update:
    See Project
  • 25
    Orbit

    Orbit

    ORBIT : Operating Business Intelligence Tool

    ORBIT : Operating Business Intelligence Tool Making Data Accessible Through Centralized Database Access ORBIT is a business intelligence tool designed to make data accessible to a broad audience within your company by centralizing access to databases. With this application, users can easily create reports, perform interactive analyses, and extract insights from raw data. The application simplifies data handling by providing easy-to-use features for non-technical users while maintaining...
    Downloads: 1 This Week
    Last Update:
    See Project
MongoDB Logo MongoDB