Showing 28 open source projects for "python data analysis"

View related business solutions
  • Try Google Cloud Risk-Free With $300 in Credit Icon
    Try Google Cloud Risk-Free With $300 in Credit

    No hidden charges. No surprise bills. Cancel anytime.

    Use your credit across every product. Compute, storage, AI, analytics. When it runs out, 20+ products stay free. You only pay when you choose to.
    Start Free
  • Earn up to 16% annual interest with Nexo. Icon
    Earn up to 16% annual interest with Nexo.

    Let your crypto work for you

    Put idle assets to work with competitive interest rates, borrow without selling, and trade with precision. All in one platform. Geographic restrictions, eligibility, and terms apply.
    Get started with Nexo.
  • 1
    Apache Doris

    Apache Doris

    MPP-based interactive SQL data warehousing for reporting and analysis

    Apache Doris is a modern MPP analytical database product. It can provide sub-second queries and efficient real-time data analysis. With it's distributed architecture, up to 10PB level datasets will be well supported and easy to operate. Apache Doris can meet various data analysis demands, including history data reports, real-time data analysis, interactive data analysis, and exploratory data analysis. Make your data analysis easier! Support standard SQL language, compatible with MySQL protocol. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    DataChain

    DataChain

    AI-data warehouse to enrich, transform and analyze unstructured data

    Datachain enables multimodal API calls and local AI inferences to run in parallel over many samples as chained operations. The resulting datasets can be saved, versioned, and sent directly to PyTorch and TensorFlow for training. Datachain can persist features of Python objects returned by AI models, and enables vectorized analytical operations over them. The typical use cases are data curation, LLM analytics and validation, image segmentation, pose detection, and GenAI alignment. Datachain is especially helpful if batch operations can be optimized – for instance, when synchronous API calls can be parallelized or where an LLM API offers batch processing.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 3
    gravitino

    gravitino

    Unified metadata lake for data & AI assets.

    Apache Gravitino is a high-performance, geo-distributed, and federated metadata lake. It manages metadata directly in different sources, types, and regions, providing users with unified metadata access for data and AI assets.
    Downloads: 9 This Week
    Last Update:
    See Project
  • 4
    InterMine

    InterMine

    A powerful open source data warehouse system

    InterMine is an open-source data warehouse system tailored for the integration and analysis of complex biological data. It enables researchers to create databases from diverse data sources and provides sophisticated web query tools for data exploration.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Forever Free Full-Stack Observability | Grafana Cloud Icon
    Forever Free Full-Stack Observability | Grafana Cloud

    Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

    Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.
    Create free account
  • 5
    TensorBase

    TensorBase

    TensorBase is a new big data warehousing with modern efforts

    ...TensorBase has a clear-cut opposition to fork communities, repeat wheels, or hack traffic for so-called reputations (like Github stars). After thoughts, we decided to temporarily leave the general data warehousing field. For people who want to learn how a database system can be built up, or how to apply modern Rust to the high-performance field, or embed a lightweight data analysis system into your own big one. You can still try, ask or contribute to TensorBase. The committers are still around the community. We will help you in all kinds of interesting things pursued in the project by us and maybe you. ...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 6
    Open Source Data Quality and Profiling

    Open Source Data Quality and Profiling

    World's first open source data quality & data preparation project

    This project is dedicated to open source data quality and data preparation solutions. Data Quality includes profiling, filtering, governance, similarity check, data enrichment alteration, real time alerting, basket analysis, bubble chart Warehouse validation, single customer view etc. defined by Strategy. This tool is developing high performance integrated data management platform which will seamlessly do Data Integration, Data Profiling, Data Quality, Data Preparation, Dummy Data Creation, Meta Data Discovery, Anomaly Discovery, Data Cleansing, Reporting and Analytic. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 7
    CloverDX

    CloverDX

    Design, automate, operate and publish data pipelines at scale

    ...Simple data manipulation jobs can be created visually. More complex business logic can be implemented using Clover's domain-specific-language CTL, in Java or languages like Python or JavaScript. Through its DataServices functionality, it allows to quickly turn data pipelines into REST API endpoints. The platform allows to easily scale your data job across multiple cores or nodes/machines.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 8
    DataCleaner

    DataCleaner

    Data quality analysis, profiling, cleansing, duplicate detection +more

    DataCleaner is a data quality analysis application and a solution platform for DQ solutions. It's core is a strong data profiling engine, which is extensible and thereby adds data cleansing, transformations, enrichment, deduplication, matching and merging. Website: http://datacleaner.github.io
    Downloads: 2 This Week
    Last Update:
    See Project
  • 9
    A command-line interface for the OpenGroupware Coils collaboration platform and OIE workflow solution. Using snurtle you can manage and examine the content of your Coils server as well as manage workflows all from a convenient command line interface
    Downloads: 0 This Week
    Last Update:
    See Project
  • Custom VMs From 1 to 96 vCPUs With 99.95% Uptime Icon
    Custom VMs From 1 to 96 vCPUs With 99.95% Uptime

    General-purpose, compute-optimized, or GPU/TPU-accelerated. Built to your exact specs.

    Live migration and automatic failover keep workloads online through maintenance. One free e2-micro VM every month.
    Try Free
  • 10

    Workshop resource manager

    Easy and fast tool for managing home workshop resources

    Storekeeper is small tool that helps you to keep an eye on your resources in home lab or workshop. It can be used in several places and, thanks to it's single-file and synchronize option, merge data between users.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11

    ebay mine

    OO PHP Libraries for mining data from eBay into mysql database

    I started this project for use in a new business and decided the the development time for the end result was going to be too long. This is basically a OO PHP API to retrieve data from eBay to be stored in a MySQL database for analysis. In a test run I retrieved over 804,000 completed item auction records from the consumer electronics category on eBay.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    A tool that parses SQL Select statements and generates a diagram. The diagram shows parts of the underlying SQL directly in the diagram. For example x=30 , GROUP BY (year), HAVING MIN(age) > 18. It is easy to see cartesian joins and/or loops.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    Representation of events graphically on screen using SDL. External applications send a simple message and visual event renders animation on screen depicting the event taking place.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    Longname:Operational data business express---- ODBExpress is a report suit for business intelligence, it includes reporting, analysis (OLAP),etc.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    SPASE Model is a collection of tools for working with the structured data model information. Tools can convert the relational version of the data model into various expressions, including XSD, XMI and PDF documentation.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    Datacleaning Open Source
    A group a subprojects for Data Cleaning projects, mainly as a step of a Data Mining Project. Visit www.datacleaningopensource.com to review our current applications or if you want to add yours. NOTE: PROGRAMMING SKILLS ARE REQUIRED.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    Spire is a print stream converter/manipulator. It can transform print streams from Metacode to Postscript, Postscript to Metacode. PCL support will be added soon. Spire is also capable of sorting documents (think postal sortation) and added barcodes.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    Drop is a graphical interface for the GPL project Wets (hosted in SourceForge). Wets is an ETL software.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    SYRAH si propone di far emergere e rappresentare i concetti espressi per mezzo di un linguaggio naturale. SYRAH aims to discover and represent concepts expressed in natural languages. NLP, lemma, lemmario, italiano, rete, semantica, clustering, semantic
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    Openminer, as a data mining engine, is developed on java for analysis of dataset with the methods of data mining. By making use of openminer, we could discovery the knowledge which interests us but hides in the raw data.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    Use Pentaho open source business intelligence tools and MySQL to collect & distribute web analytics (clickstream) data. Extract data from logs, load database tables, & present the information in dashboards, analysis cubes, and reports for business users. This project has been moved to github - https://github.com/cjlavigne/breadboard
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    Trauma registry suite; Data collection application and server scripts to build trauma data warehouse and perform web-based analysis reporting. Cross-platform compatible for Windows, Apple, Unix, or Linux.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 23
    Monitors webpages for changes and emails output with differences to subscribers. Permits user accounts and registration. PHP/MYSQL.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    FormatCheck screens flat files looking for violations in the format of the data. It uses a set of XML files that define the rules for each file format. The Swing front-end allows the user to run the verification, view and print the errors.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    Utilities for managing images of documents and their summaries produced during the discovery process for litigation.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • Next
MongoDB Logo MongoDB