Showing 43 open source projects for "python data analysis"

View related business solutions
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • Custom VMs From 1 to 96 vCPUs With 99.95% Uptime Icon
    Custom VMs From 1 to 96 vCPUs With 99.95% Uptime

    General-purpose, compute-optimized, or GPU/TPU-accelerated. Built to your exact specs.

    Live migration and automatic failover keep workloads online through maintenance. One free e2-micro VM every month.
    Try Free
  • 1
    ArcticDB

    ArcticDB

    ArcticDB is a high performance, serverless DataFrame database

    Built for the modern Python Data Science ecosystem, ArcticDB transforms your ability to handle complex real-world data with an Incredibly fast proven Petabyte scale. ArcticDB is designed with quant users in mind. It allows you to self-manage your data leveraging your preferred infrastructure. Giving you the keys to protect your most valuable asset. Supports large concurrent writes to many tables ensuring datasets can be onboarded fast and in the most convenient format. ...
    Downloads: 40 This Week
    Last Update:
    See Project
  • 2
    chDB

    chDB

    chDB is an in-process OLAP SQL Engine

    chDB is an in-process SQL OLAP Engine powered by ClickHouse. It is developed by ClickHouse, Inc and open-source contributors.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 3
    DuckDB

    DuckDB

    DuckDB is an in-process SQL OLAP Database Management System

    ...For more information on the goals of DuckDB, please refer to the Why DuckDB page on our website. Processing and storing tabular datasets, e.g. from CSV or Parquet files. Interactive data analysis, e.g. Joining & aggregate multiple large tables. Concurrent large changes, to multiple large tables, e.g. appending rows, adding/removing/updating columns. Large result set transfer to client. For development, DuckDB requires CMake, Python3 and a C++11 compliant compiler. Run make in the root directory to compile the sources. ...
    Downloads: 10 This Week
    Last Update:
    See Project
  • 4
    osm2pgsql

    osm2pgsql

    Import OpenStreetMap data into a PostgreSQL/PostGIS database

    osm2pgsql is a powerful tool for importing OpenStreetMap (OSM) data into a PostgreSQL/PostGIS database, enabling geographic data analysis and map rendering. It supports various rendering schemas like "flex" and "lua" to customize how data is loaded and indexed. Designed for performance and scalability, osm2pgsql is widely used in map tile generation pipelines and by GIS professionals handling large-scale spatial datasets.
    Downloads: 1 This Week
    Last Update:
    See Project
  • Full-stack observability with actually useful AI | Grafana Cloud Icon
    Full-stack observability with actually useful AI | Grafana Cloud

    Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

    Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.
    Create free account
  • 5
    Apache Impala

    Apache Impala

    Apache Impala

    ...Like Hive, Impala supports SQL, so you don't have to worry about reinventing the implementation wheel. With Impala, more users, whether using SQL queries or BI applications, can interact with more data through a single repository and metadata stored from source through analysis.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    manticoresearch

    manticoresearch

    Easy to use open source fast database for search

    Manticore Search is an easy to use open source fast database for search. Modern MPP architecture and smart query parallelization capabilities allow to fully utilize all your CPU cores to lower response time as much as possible, when needed. Powerful and fast full-text searching which works fine for small and big datasets. Columnar storage support via the Manticore Columnar Library for bigger datasets (much bigger than can fit in RAM). SQL-first: Manticore's native syntax is SQL. It speaks...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    An innovative Open Source CEP (Complex Event Processing) engine. It implements the event stream processing as a library embeddable in C++ and Perl. You can think of the Complex Event Processing engine as an in-memory database driven by triggers, or a data-flow machine, or a spreadsheet on steroids (and without the GUI part).
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    Tulip
    Tulip is an information visualization framework dedicated to the analysis and visualization of relational data. Tulip aims to provide the developer/user with a complete library/software, supporting the design of interactive information visualization
    Leader badge
    Downloads: 26 This Week
    Last Update:
    See Project
  • 9
    odaba

    odaba

    ODABA is an terminology-oriented database management system and develo

    ODABA is an terminology-oriented database management system (TODBMS) on a high conceptual level. It provides a number of enhanced features based on natural language analysis. Terminology orientation allows designing user oriented data models. Enhanced event model and Active Data Link technology support rapid development of GUI applications. Several development, documentation, test and analyzing tools are provided in order to support developers and administrators. .NET interface and a powerful object-orjented script language (OSI) improve development efficiency.
    Downloads: 10 This Week
    Last Update:
    See Project
  • Try Google Cloud Risk-Free With $300 in Credit Icon
    Try Google Cloud Risk-Free With $300 in Credit

    No hidden charges. No surprise bills. Cancel anytime.

    Use your credit across every product. Compute, storage, AI, analytics. When it runs out, 20+ products stay free. You only pay when you choose to.
    Start Free
  • 10
    Cinemagoer

    Cinemagoer

    Python package to retrieve and manage data of the IMDb

    Cinemagoer is a Python package useful to retrieve and manage the data of the IMDb movie database about movies, people, characters and companies. Platform-independent, it can retrieve data from both the IMDb's web server and a local copy of the whole db.
    Downloads: 9 This Week
    Last Update:
    See Project
  • 11
    The Lemur Project

    The Lemur Project

    Search engine and data mining applications and ClueWeb datasets.

    The Lemur Project develops search engines, browser toolbars, text analysis tools, and data resources that support research and development of information retrieval and text mining software, including the Indri search engine in C++, the Galago search engine research framework in Java, the RankLib learning to rank library, ClueWeb09 and ClueWeb12 datasets and the Sifaka data mining application.
    Downloads: 19 This Week
    Last Update:
    See Project
  • 12

    GetData

    Scientific Database Format

    The GetData library provides an API to interface with Dirfile databases. The Dirfile database format is designed to provide a fast, simple, scalable format for storing and reading binary, synchronously-sampled, time-ordered data.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 13
    SSDB

    SSDB

    A fast NoSQL database, an alternative to Redis

    ...SSDB is stable, production-ready and is widely used by many Internet companies including QIHU 360. LevelDB client-server support, written in C/C++. Designed to store collection data. Persistent key-value, key-zset, key-map('hashmap'), key-list storage. Redis clients are supported. Client API supports including C++, PHP, Python, Cpy, Java, nodejs, Ruby, Go(see all). Persistent queue service. Replication(master-slave), load balance. GUI administration tool(phpssdbadmin). Built-in CLI nagios self-checks. Benchmarks are taken by ssdb-bench(SSDB) and redis-benchmark(Redis). ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    Patlac--Xml2cpp is a totally customizable code generator that generates classes (c++ for now) automatically from a xsd schema. Classes are described in google-ctemplate files and a xml options file allow precise definition of projects.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    Tera

    Tera

    An Internet-scale database

    ...Support RAMDISK/SSD/DFS tiered cache. Block cache and Bloom Filters for real-time queries. Multi-type table support (RAMDISK/SSD/DISK table). Easy to use C++/Java/Python/REST-ful API. Column-oriented storage and locality group support. Ranged and hashed sharding strategies.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    Smart-M3 is a functional platform that provides a cross domain search extent for triple based information. Smart-M3 enables smart cross domain applications that rely on information level interoperability.
    Downloads: 6 This Week
    Last Update:
    See Project
  • 17

    E-Reader

    Extreem Reader for Cosmological simulation data P-Gadet2 and ART codes

    Extreem Reader for Cosmological simulation data P-Gadet2 and ART codes.It conserves memory by using memory mapped files. Contains some basic analysis tools such a KD-Tree, Minimum Spanning Tree, and rendering module using PMViewer algorithm.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18

    xlsLib

    C++/C library to construct Excel .xls files in code.

    A multiplatform C++ library for dynamic generation of Excel .xls files containing multiple worksheets. Unlike .csv files, these can be directly opened by Excel and thus provide an excellent way to output large data sets that require further analysis. To see the latest changes, select "Files" and view the README text displayed at the bottom of that pane. IMPORTANT: Major changes are contained in the current SVN source. If you have time please try to use it or the xlslib-package-2.4.0b1.zip archive, and enter bug reports on any problems! ...
    Downloads: 13 This Week
    Last Update:
    See Project
  • 19
    KeplerDB

    KeplerDB

    Timeseries databases management system

    KeplerDB is a temporal database to store time/value entries where the type of value could be integer, float/double, boolean and string. KeplerDB is dedicated to be scalable and to create clusters of server allowing the user to analyse and store massing amount of data to monitor systems like computers, clusters, building and captors or financial systems like markets and accounts. The user can use KeplerDB to make data analysis on enormous amount of data (statistics and modelling).
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    Darkfall Emulator

    Darkfall Emulator

    Develop an emulated Darkfall Online Server.

    This project is the development of an open source fully emulated Darkfall Online Server and reverse engineering the proprietary game engine that DFO uses, the SF3D engine as well as its source code.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 21

    HackerEX

    Free tool designed to make client side hacking easy

    Free tool designed to make client side hacking easy. ALL actions this software takes are user initiated so we take no responsibility for any damaged dealt to software, data or hardware. Codelists found on the web are the responsibility of both the user and the original hacker.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 22
    BlackRay is a persistent in-memory high performance relational database written in C++. It combines regular database features with search engine like performance. Please visit https://forge.softmethod.de for the main development site.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    representation by relations vs data storage. pile unifies the contradicted qualities of tables/trees (databases) and complex networks, order and flexibility. dynamical data generation on call. fast. natively compressing similarities. search across scopes.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    An experimental CEP (Complex Event Processing) engine. It implements the event stream processing as a library embeddable in C++ and Perl. Since then it has been renamed to Triceps, so please look at the new location https://sourceforge.net/projects/t
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    A collection of tools for working with the comparative data analysis ontology including import/export facilities for common phylogenetic file formats, and also a triple-store framework.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • Next
MongoDB Logo MongoDB