Showing 306 open source projects for "python data analysis"

View related business solutions
  • Our Free Plans just got better! | Auth0 Icon
    Our Free Plans just got better! | Auth0

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
    Try free now
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • 1
    data-diff

    data-diff

    Efficiently diff rows across two different databases

    We're excited to announce the launch of a new open-source product, data-diff that makes comparing datasets across databases fast at any scale. data-diff automates data quality checks for data replication and migration. In modern data platforms, data is constantly moving between systems, and at the modern data volume and complexity, systems go out of sync all the time. Until now, there has not been any tooling to ensure that when the data is correctly copied. Replicating data at scale, across...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 2
    HDF5

    HDF5

    Official HDF5® Library Repository

    HDF5 (Hierarchical Data Format v5) is a widely-used data management library and file format for storing large and complex scientific data sets efficiently.
    Downloads: 26 This Week
    Last Update:
    See Project
  • 3
    Grafana

    Grafana

    The open observability and monitoring platform

    Grafana is an open source analytics and monitoring platform designed for every database. It allows you to visualize and understand your metrics through dynamic and reusable data-driven dashboards that you can create, explore and share with others. Grafana offers a multitude of visualization options and lets you explore your metrics and logs like never before. It can also be set to alert you on your most important metrics. Thousands of companies have been using Grafana to monitor everything...
    Downloads: 22 This Week
    Last Update:
    See Project
  • 4
    Finance Database

    Finance Database

    This is a database of 300.000+ symbols containing Equities, ETFs, etc.

    ... unknown. This database tries to solve that. It features 300.000+ symbols containing Equities, ETFs, Funds, Indices, Currencies, Cryptocurrencies and Money Markets. It, therefore, allows you to obtain a broad overview of sectors, industries, types of investments and much more. The aim of this database is explicitly not to provide up-to-date fundamentals or stock data as those can be obtained with ease (with the help of this database) by using yfinance, FundamentalAnalysis or ThePassiveInvestor.
    Downloads: 8 This Week
    Last Update:
    See Project
  • Build Securely on AWS with Proven Frameworks Icon
    Build Securely on AWS with Proven Frameworks

    Lay a foundation for success with Tested Reference Architectures developed by Fortinet’s experts. Learn more in this white paper.

    Moving to the cloud brings new challenges. How can you manage a larger attack surface while ensuring great network performance? Turn to Fortinet’s Tested Reference Architectures, blueprints for designing and securing cloud environments built by cybersecurity experts. Learn more and explore use cases in this white paper.
    Download Now
  • 5
    SpeechRecognition

    SpeechRecognition

    Speech recognition module for Python

    Library for performing speech recognition, with support for several engines and APIs, online and offline. Recognize speech input from the microphone, transcribe an audio file, save audio data to an audio file. Show extended recognition results, calibrate the recognizer energy threshold for ambient noise levels (see recognizer_instance.energy_threshold for details). Listening to a microphone in the background, various other useful recognizer features. The easiest way to install this is using pip...
    Downloads: 9 This Week
    Last Update:
    See Project
  • 6
    DuckDB

    DuckDB

    DuckDB is an in-process SQL OLAP Database Management System

    ... data analysis, e.g. Joining & aggregate multiple large tables. Concurrent large changes, to multiple large tables, e.g. appending rows, adding/removing/updating columns. Large result set transfer to client. For development, DuckDB requires CMake, Python3 and a C++11 compliant compiler. Run make in the root directory to compile the sources. For development, use make debug to build a non-optimized debug version.
    Downloads: 8 This Week
    Last Update:
    See Project
  • 7
    ArcticDB

    ArcticDB

    ArcticDB is a high performance, serverless DataFrame database

    Built for the modern Python Data Science ecosystem, ArcticDB transforms your ability to handle complex real-world data with an Incredibly fast proven Petabyte scale. ArcticDB is designed with quant users in mind. It allows you to self-manage your data leveraging your preferred infrastructure. Giving you the keys to protect your most valuable asset. Supports large concurrent writes to many tables ensuring datasets can be onboarded fast and in the most convenient format. Scale-out architecture...
    Downloads: 7 This Week
    Last Update:
    See Project
  • 8
    TURF

    TURF

    A modular geospatial engine written in JavaScript

    Turf is a JavaScript library for spatial analysis. It includes traditional spatial operations, helper functions for creating GeoJSON data, and data classification and statistics tools. Turf can be added to your website as a client-side plugin, or you can run Turf server-side with Node.js. Modular, simple-to-understand JavaScript functions that speak GeoJSON. Turf is a collection of small modules, you only need to take what you want to use. Takes advantage of the newest algorithms and doesn't...
    Downloads: 6 This Week
    Last Update:
    See Project
  • 9
    SuperDuperDB

    SuperDuperDB

    Integrate, train and manage any AI models and APIs with your database

    Build and manage AI applications easily without needing to move your data to complex pipelines and specialized vector databases. Integrate AI and vector search directly with your database including real-time inference and model training. Just using Python. A single scalable deployment of all your AI models and APIs which is automatically kept up-to-date as new data is processed immediately. No need to introduce an additional database and duplicate your data to use vector search and build on top...
    Downloads: 4 This Week
    Last Update:
    See Project
  • Powering the best of the internet | Fastly Icon
    Powering the best of the internet | Fastly

    Fastly's edge cloud platform delivers faster, safer, and more scalable sites and apps to customers.

    Ensure your websites, applications and services can effortlessly handle the demands of your users with Fastly. Fastly’s portfolio is designed to be highly performant, personalized and secure while seamlessly scaling to support your growth.
    Try for free
  • 10
    SQL Explorer

    SQL Explorer

    Easily share data across your company via SQL queries

    SQL Explorer aims to make the flow of data between people fast, simple, and confusion-free. It is a Django-based application that you can add to an existing Django site, or use as a standalone business intelligence tool. Quickly write and share SQL queries in a simple, usable SQL editor, preview the results in the browser, share links, download CSV, JSON, or Excel files (and even expose queries as API endpoints, if desired), and keep the information flowing! Comes with support for multiple...
    Downloads: 5 This Week
    Last Update:
    See Project
  • 11
    IoTDB

    IoTDB

    Apache IoTDB

    Apache IoTDB (Database for Internet of Things) is an IoT native database with high performance for data management and analysis, deployable on the edge and the cloud. Due to its light-weight architecture, high performance and rich feature set together with its deep integration with Apache Hadoop, Spark and Flink, Apache IoTDB can meet the requirements of massive data storage, high-speed data ingestion and complex data analysis in the IoT industrial fields. In the scene of factories...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 12
    pgsync

    pgsync

    Postgres to Elasticsearch/OpenSearch sync

    pgsync is a lightweight tool for syncing Postgres databases across environments, such as from production to staging. It allows selective table syncing, data masking, and parallel copying for fast and safe data migration. pgsync is ideal for developers who need realistic test data without exposing sensitive information.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 13
    osm2pgsql

    osm2pgsql

    Import OpenStreetMap data into a PostgreSQL/PostGIS database

    osm2pgsql is a powerful tool for importing OpenStreetMap (OSM) data into a PostgreSQL/PostGIS database, enabling geographic data analysis and map rendering. It supports various rendering schemas like "flex" and "lua" to customize how data is loaded and indexed. Designed for performance and scalability, osm2pgsql is widely used in map tile generation pipelines and by GIS professionals handling large-scale spatial datasets.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 14
    pagodo

    pagodo

    Automate Google Hacking Database scraping and searching

    ... if you need proxy support. You can specify multiple proxies to use in a round-robin fashion by providing a comma-separated string of proxies using the -p switch. To start off, pagodo.py needs a list of all the current Google dorks. The repo contains a dorks/ directory with the current dorks when the ghdb_scraper.py was last run. It's advised to run ghdb_scraper.py to get the freshest data before running pagodo.py.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 15
    SQL Notebook

    SQL Notebook

    SQL Notebook — Casual data exploration in SQL

    SQL Notebook is a free Windows application for querying and analyzing data across multiple sources, including SQLite, PostgreSQL, Excel, and CSV files. It combines a SQL editor with a notebook interface, allowing for data exploration, transformation, and visualization in one place. SQL Notebook is ideal for analysts and data enthusiasts.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 16
    SurrealDB

    SurrealDB

    A scalable, distributed, collaborative, document-graph database

    With an SQL-style query language, real-time queries with highly-efficient related data retrieval, advanced security permissions for multi-tenant access, and support for performant analytical workloads, SurrealDB is the next generation serverless database. SurrealDB is the ultimate cloud database for tomorrow's applications. SurrealDB is an innovative NewSQL cloud database, suitable for serverless applications, jamstack applications, single-page applications, and traditional applications...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 17
    EventStoreDB

    EventStoreDB

    The stream database optimised for event sourcing

    ... Sourcing offers some great benefits over state-oriented systems; the key ones are explained below. An event-sourced system stores your data as a series of immutable events over time, providing one of the strongest audit log options available. All state changes are kept, so it is possible to move systems backward and forwards in time which is extremely valuable for debugging and “what if” analysis.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 18
    EdgeDB

    EdgeDB

    A next-generation graph-relational database

    ... with properties connected by links. It's like a relational database with an object-oriented data model, or a graph database with strict schema. We call it a graph-relational database. The core unit of schema in the graph-relational model is the object type, analogous to a table in SQL. Object types contain properties and can be linked to other object types to form a schema graph.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 19
    Dokploy

    Dokploy

    Open Source Alternative to Vercel, Netlify and Heroku

    Streamline your operations with our all-in-one platform, perfect for managing projects, data, and system health with simplicity and efficiency. Simplify your project and data management, ensure robust monitoring, and secure your backups—all without the fuss over minute details. Elevate your infrastructure with tools that offer precise control, detailed monitoring, and enhanced security, ensuring seamless management and robust performance. Streamline your deployments with our PaaS. Effortlessly...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 20
    chDB

    chDB

    chDB is an in-process OLAP SQL Engine

    chDB is an in-process SQL OLAP Engine powered by ClickHouse. It is developed by ClickHouse, Inc and open-source contributors.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 21
    Jailer Database Tool

    Jailer Database Tool

    Database subsetting and relational data browsing tool

    Jailer is a tool for database subsetting, schema and data browsing. It creates small slices from your database and lets you navigate through your database following the relationships. Ideal for creating small samples of test data or for local problem analysis with relevant production data. Creates small slices from your productive database and imports the data into your development and test environment (consistent and referentially intact). Improves database performance by removing...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 22
    Blue Whale Configuration Platform

    Blue Whale Configuration Platform

    Blue Whale smart cloud configuration platform

    Has accumulated experience in supporting hundreds of Tencent businesses, compatible with various complex system architectures, born in operation and maintenance, and proficient in operation and maintenance. From configuration management to job execution, task scheduling and monitoring self-healing, and then through operation and maintenance big data analysis to assist operational decision-making, it covers the full-cycle assurance management of business operations in a comprehensive manner...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 23
    manticoresearch

    manticoresearch

    Easy to use open source fast database for search

    ... over HTTP and uses the MySQL protocol (you can use your preferred MySQL client). JSON over HTTP: to provide a more programmatic way to manage your data and schemas, Manticore provides a HTTP JSON protocol. Written fully in C++: starts fast, doesn't take much RAM, and low-level optimizations provide good performance. Can sync from MySQL/PostgreSQL/ODBC/xml/csv out of the box. Not fully ACID-compliant, but supports transactions and binlog for safe writes.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 24
    Bdash

    Bdash

    Simple SQL Client for lightweight data analysis

    Simple SQL Client for lightweight data analysis. You can share the result with gist. Supports MySQL, PostgreSQL (Amazon Redshift), SQLite3, Google BigQuery, Treasure Data, Amazon Athena. You can download and install from Web Site or Releases.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 25
    CodeChecker

    CodeChecker

    CodeChecker is an analyzer tooling, defect database

    CodeChecker is a static analysis infrastructure built on the LLVM/Clang Static Analyzer toolchain, replacing scan-build in a Linux or macOS (OS X) development environment. Executes Clang-Tidy and Clang Static Analyzer with Cross-Translation Unit analysis, Statistical Analysis (when checkers are available). Creates the JSON compilation database by wiretapping any build process (e.g., CodeChecker log -b "make"). Automatically analyzes GCC cross-compiled projects: detecting GCC or Clang compiler...
    Downloads: 1 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • 3
  • 4
  • 5
  • Next
Want the latest updates on software, tech news, and AI?
Get latest updates about software, tech news, and AI from SourceForge directly in your inbox once a month.