Showing 306 open source projects for "python data analysis"

View related business solutions
  • Our Free Plans just got better! | Auth0 Icon
    Our Free Plans just got better! | Auth0

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
    Try free now
  • Build Securely on Azure with Proven Frameworks Icon
    Build Securely on Azure with Proven Frameworks

    Lay a foundation for success with Tested Reference Architectures developed by Fortinet’s experts. Learn more in this white paper.

    Moving to the cloud brings new challenges. How can you manage a larger attack surface while ensuring great network performance? Turn to Fortinet’s Tested Reference Architectures, blueprints for designing and securing cloud environments built by cybersecurity experts. Learn more and explore use cases in this white paper.
    Download Now
  • 1
    pagodo

    pagodo

    Automate Google Hacking Database scraping and searching

    ... if you need proxy support. You can specify multiple proxies to use in a round-robin fashion by providing a comma-separated string of proxies using the -p switch. To start off, pagodo.py needs a list of all the current Google dorks. The repo contains a dorks/ directory with the current dorks when the ghdb_scraper.py was last run. It's advised to run ghdb_scraper.py to get the freshest data before running pagodo.py.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 2
    DictDataBase

    DictDataBase

    A python NoSQL dictionary database, with concurrent access and ACID

    DictDataBase (DictDB) is a lightweight, Python-based in-memory database that uses dictionaries as its primary data structure. It provides a simple and efficient way to store, retrieve, and manipulate data without requiring an external database server. DictDB is useful for applications needing fast lookups, temporary storage, or embedded database functionalities.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 3
    CogDB

    CogDB

    Micro Graph Database for Python Applications

    Cog is a lightweight, embedded graph database for Go that provides a simple interface for storing and querying graph-based data structures, making it useful for knowledge representation and graph analytics.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 4
    HugeGraph

    HugeGraph

    A graph database that supports more than 100+ billion data

    HugeGraph is a convenient, efficient, and adaptable graph database compatible with the Apache TinkerPop3 framework and the Gremlin query language. HugeGraph supports fast import performance in the case of more than 10 billion Vertices and Edges Graph, millisecond-level OLTP query capability, and can be integrated into big data platforms like Hadoop or Spark for OLAP analysis. The main scenarios of HugeGraph include correlation search, fraud detection, and knowledge graph. Not only supports...
    Downloads: 1 This Week
    Last Update:
    See Project
  • MongoDB 8.0 on Atlas | Run anywhere Icon
    MongoDB 8.0 on Atlas | Run anywhere

    Now available in even more cloud regions across AWS, Azure, and Google Cloud.

    MongoDB 8.0 brings enhanced performance and flexibility to Atlas—with expanded availability across 125+ regions globally. Build modern apps anywhere your users are, with the power of a modern database behind you.
    Learn More
  • 5
    Apache Impala

    Apache Impala

    Apache Impala

    ..., and resource management frameworks as your Hadoop deployment, with no redundant infrastructure or data conversion/duplication. For Apache Hive users, Impala utilizes the same metadata and ODBC driver. Like Hive, Impala supports SQL, so you don't have to worry about reinventing the implementation wheel. With Impala, more users, whether using SQL queries or BI applications, can interact with more data through a single repository and metadata stored from source through analysis.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 6
    Arctic TimeSeries and Tick store

    Arctic TimeSeries and Tick store

    High performance datastore for time series and tick data

    Arctic is a timeseries/dataframe database that sits atop MongoDB. Arctic supports serialization of a number of datatypes for storage in the mongo document model. Serializes a number of data types eg. Pandas DataFrames, Numpy arrays, Python objects via pickling etc. so you don't have to handle different datatypes manually. Uses LZ4 compression by default on the client side to get big savings on network / disk. Allows you to version different stages of an object and snapshot the state (In some...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 7
    pgai

    pgai

    A suite of tools to develop RAG, semantic search, and other AI apps

    pgai is a suite of PostgreSQL extensions developed by Timescale to empower developers in building AI applications directly within their databases. It integrates tools for vector storage, advanced indexing, and AI model interactions, facilitating the development of applications like semantic search and Retrieval-Augmented Generation (RAG) without leaving the SQL environment.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 8
    SQLAlchemy

    SQLAlchemy

    The Database Toolkit for Python

    SQLAlchemy is the Python SQL toolkit and Object Relational Mapper that gives application developers the full power and flexibility of SQL. SQLAlchemy provides a full suite of well known enterprise-level persistence patterns, designed for efficient and high-performing database access, adapted into a simple and Pythonic domain language. An industrial strength ORM, built from the core on the identity map, unit of work, and data mapper patterns. These patterns allow the transparent persistence...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 9
    SQLModel

    SQLModel

    SQL databases in Python, designed for simplicity, compatibility

    SQLModel, SQL databases in Python, designed for simplicity, compatibility, and robustness. SQLModel is a library for interacting with SQL databases from Python code, with Python objects. It is designed to be intuitive, easy to use, highly compatible, and robust. SQLModel is based on Python-type annotations, and powered by Pydantic and SQLAlchemy.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Secure remote access solution to your private network, in the cloud or on-prem. Icon
    Secure remote access solution to your private network, in the cloud or on-prem.

    Deliver secure remote access with OpenVPN.

    OpenVPN is here to bring simple, flexible, and cost-effective secure remote access to companies of all sizes, regardless of where their resources are located.
    Get started — no credit card required.
  • 10
    MobilityDB

    MobilityDB

    MobilityDB is a geospatial trajectory data management & analysis

    MobilityDB is an extension for PostgreSQL and PostGIS that adds support for managing and querying moving object data, such as vehicles, vessels, or wildlife. It allows developers to store temporal and spatiotemporal data types and execute complex mobility analytics using SQL. It is particularly suited for GIS applications and location-based services.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    Cloudberry

    Cloudberry

    One advanced and mature open-source MPP

    Apache Cloudberry is a distributed real-time analytics engine designed for querying massive social media datasets. It integrates with Apache AsterixDB and supports efficient ad-hoc queries and aggregations across large volumes of data. Cloudberry is especially useful for dashboards, trend analysis, and time-series social data exploration.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    CursusDB

    CursusDB

    CursusDB is an open-source distributed in-memory database

    CursusDB is a time-series database built for high-performance analytics and data processing, optimized for handling large volumes of sequential data efficiently.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    ArangoDB-Community/pyArango

    ArangoDB-Community/pyArango

    Python Driver for ArangoDB with built-in validation

    PyArango is a Python driver for ArangoDB, a multi-model NoSQL database. It provides a Pythonic way to interact with ArangoDB, allowing developers to manage collections, execute AQL queries, and integrate ArangoDB's document, graph, and key-value storage models into Python applications.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    django-pgtrigger

    django-pgtrigger

    Write Postgres triggers for your Django models

    django-pgtrigger is a Django library for defining and managing PostgreSQL triggers directly in Python code. It allows developers to create database-level logic like automatic field updates, auditing, or validation without writing raw SQL. It’s ideal for teams that want stronger data integrity while keeping logic version-controlled.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    Bracket

    Bracket

    Selfhosted tournament system

    Bracket is an open-source tool that tracks and manages data access across your PostgreSQL database. It provides visibility into which parts of your codebase are accessing which tables and columns, enabling data governance, security auditing, and architectural insights. Bracket is particularly helpful for growing teams needing better observability in complex applications.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    Apache AGE

    Apache AGE

    Graph database optimized for fast analysis

    Apache AGE (A Graph Extension) is a PostgreSQL extension that adds support for graph database functionality using the openCypher query language. AGE allows users to store and query graph-structured data alongside traditional relational data in a single database. Built for performance and scalability, it brings the power of graph queries to the robustness of PostgreSQL, making it suitable for use cases like social networks, recommendation systems, and fraud detection.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    Enferno

    Enferno

    Modern Flask framework optimized for AI-assisted development

    Enferno is a framework for building developer-first cloud backends using PostgreSQL and TypeScript. It offers primitives for defining data models, APIs, and access rules directly in code, enabling quick iteration and deployment. Enferno is designed to accelerate SaaS and internal tool development by combining the benefits of traditional backends with developer ergonomics.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    Mikro Orm

    Mikro Orm

    TypeScript ORM for Node.js based on Data Mapper, Unit of Work

    TypeScript ORM for Node.js based on Data Mapper, Unit of Work and Identity Map patterns. MikroORM allows handling transactions automatically. When you call em.flush(), all computed changes are wrapped inside a database transaction. Uses source code analysis so you do not have to repeat yourself when defining entities. Simply define correct TypeScript types and you are good to go! Supports MongoDB, MySQL, MariaDB, PostgreSQL and SQLite databases, and more can be supported via custom drivers...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    Cinemagoer

    Cinemagoer

    Python package useful to retrieve and manage the data of IMDb

    Cinemagoer (previously known as IMDbPY) is a Python package for retrieving and managing the data of the IMDb movie database about movies and people. You can use the search_movie method of the access object to search for movies with a given (or similar) title. Similarly, you can search for people and companies using the search_person and the search_company methods. Movie, person, and company objects have id attributes which -when fetched through the IMDb web server- store the IMDb id...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 20
    Tulip
    Tulip is an information visualization framework dedicated to the analysis and visualization of relational data. Tulip aims to provide the developer/user with a complete library/software, supporting the design of interactive information visualization
    Leader badge
    Downloads: 71 This Week
    Last Update:
    See Project
  • 21
    DiLu Converter

    DiLu Converter

    A powerful Import and Export tool between Excel and Database

    DiLu Converter is a powerful Excel import and export tool that supports more than 10 databases such as MySQL, Oracle, SQL Server, PostgreSQL and so on. The supported file formats include xls, xlsx, xlsm, xlsb, csv, txt, xml, json, and dbf. Its native user interface brings users a comfortable experience of simplified Excel import and export, making Excel import and export easier than ever before. Whether you want one-click, batch, and personalized import and export, or want to use...
    Downloads: 23 This Week
    Last Update:
    See Project
  • 22

    xsd2pgschema

    Relational database replication tool based on XML Schema

    ... database can be optimized at user's discretion. Moreover, differential update, and XPath query evaluation over PostgreSQL are possible. PgSchema server, serialized relational data model server, can be used to speed up the analysis of complex XML Schema. Large XML file can be split through xmlsplitter, a flexible XML splitter based on XPath and StAX.
    Downloads: 8 This Week
    Last Update:
    See Project
  • 23
    Orbit

    Orbit

    ORBIT : Operating Business Intelligence Tool

    ORBIT : Operating Business Intelligence Tool Making Data Accessible Through Centralized Database Access ORBIT is a business intelligence tool designed to make data accessible to a broad audience within your company by centralizing access to databases. With this application, users can easily create reports, perform interactive analyses, and extract insights from raw data. The application simplifies data handling by providing easy-to-use features for non-technical users while maintaining...
    Downloads: 4 This Week
    Last Update:
    See Project
  • 24
    An innovative Open Source CEP (Complex Event Processing) engine. It implements the event stream processing as a library embeddable in C++ and Perl. You can think of the Complex Event Processing engine as an in-memory database driven by triggers, or a data-flow machine, or a spreadsheet on steroids (and without the GUI part).
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25

    DBD2

    A multi-threaded, multi-database tcp-based database insertion app.

    DBD2 inserts database records from TCP input. It is a companion project to Syslgod2 and an application in its own right. It ships with a backgrounding client for generic use. The client sends whatever information the user puts on the command-line to the back-end daemon via TCP. Upon successful transmission, the client exits. On failure, it backs off and retries until its life-time timer expires. The back-end daemon accepts data from Syslogd2 or its own client and inserts that data...
    Downloads: 0 This Week
    Last Update:
    See Project
Want the latest updates on software, tech news, and AI?
Get latest updates about software, tech news, and AI from SourceForge directly in your inbox once a month.