Data Analytics Tools for Linux

View 15 business solutions

Browse free open source Data Analytics tools and projects for Linux below. Use the toggles on the left to filter open source Data Analytics tools by OS, license, language, programming language, and project status.

  • Our Free Plans just got better! | Auth0 Icon
    Our Free Plans just got better! | Auth0

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
    Try free now
  • Catch Bugs Before Your Customers Do Icon
    Catch Bugs Before Your Customers Do

    Real-time error alerts, performance insights, and anomaly detection across your full stack. Free 30-day trial.

    Move from alert to fix before users notice. AppSignal monitors errors, performance bottlenecks, host health, and uptime—all from one dashboard. Instant notifications on deployments, anomaly triggers for memory spikes or error surges, and seamless log management. Works out of the box with Rails, Django, Express, Phoenix, Next.js, and dozens more. Starts at $23/month with no hidden fees.
    Try AppSignal Free
  • 1
    Gwyddion

    Gwyddion

    Scanning probe microscopy data visualisation and analysis

    A data visualization and processing tool for scanning probe microscopy (SPM, i.e. AFM, STM, MFM, SNOM/NSOM, ...) and profilometry data, useful also for general image and 2D data analysis.
    Leader badge
    Downloads: 1,863 This Week
    Last Update:
    See Project
  • 2
    SciDAVis is a user-friendly data analysis and visualization program primarily aimed at high-quality plotting of scientific data. It strives to combine an intuitive, easy-to-use graphical user interface with powerful features such as Python scriptability.
    Leader badge
    Downloads: 1,428 This Week
    Last Update:
    See Project
  • 3
    pandas

    pandas

    Fast, flexible and powerful Python data analysis toolkit

    pandas is a Python data analysis library that provides high-performance, user friendly data structures and data analysis tools for the Python programming language. It enables you to carry out entire data analysis workflows in Python without having to switch to a more domain specific language. With pandas, performance, productivity and collaboration in doing data analysis in Python can significantly increase. pandas is continuously being developed to be a fundamental high-level building block for doing practical, real world data analysis in Python, as well as powerful and flexible open source data analysis/ manipulation tool for any language.
    Downloads: 73 This Week
    Last Update:
    See Project
  • 4
    CyberChef

    CyberChef

    A web app for encryption, encoding, compression and data analysis

    CyberChef, developed by GCHQ, is a versatile web application dubbed the "Cyber Swiss Army Knife." It enables users to perform a wide array of operations on data, including encryption, encoding, compression, and analysis, all within a browser interface.​
    Downloads: 39 This Week
    Last Update:
    See Project
  • Try Google Cloud Risk-Free With $300 in Credit Icon
    Try Google Cloud Risk-Free With $300 in Credit

    No hidden charges. No surprise bills. Cancel anytime.

    Use your credit across every product. Compute, storage, AI, analytics. When it runs out, 20+ products stay free. You only pay when you choose to.
    Start Free
  • 5
    Orange Data Mining

    Orange Data Mining

    Orange: Interactive data analysis

    Open source machine learning and data visualization. Build data analysis workflows visually, with a large, diverse toolbox. Perform simple data analysis with clever data visualization. Explore statistical distributions, box plots and scatter plots, or dive deeper with decision trees, hierarchical clustering, heatmaps, MDS and linear projections. Even your multidimensional data can become sensible in 2D, especially with clever attribute ranking and selections. Interactive data exploration for rapid qualitative analysis with clean visualizations. Graphic user interface allows you to focus on exploratory data analysis instead of coding, while clever defaults make fast prototyping of a data analysis workflow extremely easy. Place widgets on the canvas, connect them, load your datasets and harvest the insight! When teaching data mining, we like to illustrate rather than only explain.
    Downloads: 39 This Week
    Last Update:
    See Project
  • 6
    Gephi

    Gephi

    Gephi the open graph Viz platform

    Gephi is the leading visualization and exploration software for all kinds of graphs and networks. Gephi is open-source and free. Gephi is an award-winning open-source platform for visualizing and manipulating large graphs. It runs on Windows, Mac OS X and Linux. Localization is available in English, French, Spanish, Japanese, Russian, Brazilian Portuguese, Chinese, Czech and German. Fast Powered by a built-in OpenGL engine, Gephi is able to push the envelope with very large networks. Visualize networks up to a million elements. All actions (e.g. layout, filter, drag) run in real-time. Simple Easy to install and get started. An UI that is centered around the visualization. Like Photoshop™ for graphs. Modular Extend Gephi with plug-ins. The architecture is built on top of Apache Netbeans Platform and can be extended or reused easily through well-written APIs.
    Downloads: 31 This Week
    Last Update:
    See Project
  • 7
    Metabase

    Metabase

    The simplest, fastest way to share business intelligence and analytics

    Metabase is the easiest way to let everyone in your company access business data and analytics, learn from it and ask questions. Even if you or your colleagues have no experience in SQL, you can easily summarize and visualize your data, share it and let your team ask questions about it. Metabase creates beautiful graphs and charts, with an easy-to-use dashboard where everyone can create, organize and share exceptionally visualized data. It supports a great number of databases, including Postgres, MySQL, Druid, MongoDB, SQLite and more. Setup literally takes 5 minutes.
    Downloads: 24 This Week
    Last Update:
    See Project
  • 8
    Kapacitor

    Kapacitor

    Open source framework for processing, monitoring, and alerting

    Open source framework for processing, monitoring, and alerting on time series data. Kapacitor is a real-time data processing engine for monitoring and alerting, specifically designed to work with time-series data from InfluxDB.
    Downloads: 16 This Week
    Last Update:
    See Project
  • 9
    scikit-learn

    scikit-learn

    Machine learning in Python

    scikit-learn is an open source Python module for machine learning built on NumPy, SciPy and matplotlib. It offers simple and efficient tools for predictive data analysis and is reusable in various contexts.
    Downloads: 13 This Week
    Last Update:
    See Project
  • Custom VMs From 1 to 96 vCPUs With 99.95% Uptime Icon
    Custom VMs From 1 to 96 vCPUs With 99.95% Uptime

    General-purpose, compute-optimized, or GPU/TPU-accelerated. Built to your exact specs.

    Live migration and automatic failover keep workloads online through maintenance. One free e2-micro VM every month.
    Try Free
  • 10
    HEALPix

    HEALPix

    Data Analysis, Simulations and Visualization on the Sphere

    Software for pixelization, hierarchical indexation, synthesis, analysis, and visualization of data on the sphere. Please acknowledge HEALPix by quoting the web page http://healpix.sourceforge.net (or https://healpix.sourceforge.io) and publication: K.M. Gorski et al., 2005, Ap.J., 622, p.759 Full software documentation available at https://healpix.sourceforge.io/documentation.php Wiki Pages: https://sourceforge.net/p/healpix/wiki/Home Exchanging Data with HEALPix (in FITS files): https://sourceforge.net/p/healpix/wiki/Exchanging%20Data%20with%20HEALPix/ GDL and FL users should read https://sourceforge.net/p/healpix/wiki/HEALPix%20and%20GDL/
    Leader badge
    Downloads: 313 This Week
    Last Update:
    See Project
  • 11
    Elasticsearch

    Elasticsearch

    A Distributed RESTful Search Engine

    Elasticsearch is a distributed, RESTful search and analytics engine that lets you store, search and analyze with ease at scale. It lets you perform and combine many types of searches; it scales seamlessly, and offers answers incredibly fast with search results you can rank based on a variety of factors. Elasticsearch can be used for a wide variety of use cases, from maps and metrics to site search and workplace search, and with all data types.
    Downloads: 10 This Week
    Last Update:
    See Project
  • 12
    EEGLAB

    EEGLAB

    EEGLAB is an open source signal processing environment

    EEGLAB is an open source, MATLAB-based interactive environment for analyzing electrophysiological signals such as EEG and MEG. It incorporates powerful tools for data import, preprocessing, independent component analysis (ICA), time-frequency analysis, artifact rejection, and visualization—all within a GUI framework that also supports scripting and plugin extensions. EEGLAB is an open source signal processing environment for electrophysiological signals running on Matlab and Octave (command line only for Octave). This folder contains original Matlab functions from the EEGLAB (formerly ICA/EEG) Matlab toolbox, all released under the Gnu public license (see eeglablicence.txt). See the EEGLAB tutorial and reference paper (URLs given below) for more information. All distributed EEGLAB functions (admin, sigproc, pop, misc).
    Downloads: 9 This Week
    Last Update:
    See Project
  • 13
    LabPlot

    LabPlot

    Data Visualization and Analysis

    LabPlot is a FREE, open source and cross-platform Data Visualization and Analysis software accessible to everyone.
    Downloads: 56 This Week
    Last Update:
    See Project
  • 14
    DuckDB

    DuckDB

    DuckDB is an in-process SQL OLAP Database Management System

    DuckDB is a high-performance analytical database system. It is designed to be fast, reliable and easy to use. DuckDB provides a rich SQL dialect, with support far beyond basic SQL. DuckDB supports arbitrary and nested correlated subqueries, window functions, collations, complex types (arrays, structs), and more. For more information on the goals of DuckDB, please refer to the Why DuckDB page on our website. Processing and storing tabular datasets, e.g. from CSV or Parquet files. Interactive data analysis, e.g. Joining & aggregate multiple large tables. Concurrent large changes, to multiple large tables, e.g. appending rows, adding/removing/updating columns. Large result set transfer to client. For development, DuckDB requires CMake, Python3 and a C++11 compliant compiler. Run make in the root directory to compile the sources. For development, use make debug to build a non-optimized debug version.
    Downloads: 8 This Week
    Last Update:
    See Project
  • 15
    F1 Race Replay

    F1 Race Replay

    An interactive Formula 1 race visualisation and data analysis tool

    F1 Race Replay is an interactive replay viewer that lets users watch and analyze recorded Formula 1 race sessions with precise control over camera angles, timing, and telemetry overlay, offering a rich experience beyond standard broadcast replays. It ingests official timing and positional data, then renders vehicle movements through track maps and 3D visualizations so fans, analysts, and engineers can review strategy, overtakes, tire degradation effects, and pit stop impacts in detail. Users can scrub through time, jump between cars, and overlay performance graphs such as speed, sector times, and gap differentials to evaluate performance trends across laps. This deep dive capability turns passive viewing into active exploration, empowering enthusiasts and professionals to discover insights usually hidden in raw data. The viewer also supports annotations and bookmark capabilities so users can mark moments of interest for future review or comparison.
    Downloads: 6 This Week
    Last Update:
    See Project
  • 16
    POCO

    POCO

    Cross-platform C++ libraries for building network applications

    The POCO C++ Libraries are powerful cross-platform C++ libraries for building network- and internet-based applications that run on desktop, server, mobile, IoT, and embedded systems. Whether building automation systems, industrial automation, IoT platforms, air traffic management systems, enterprise IT application and infrastructure management, security and network analytics, automotive infotainment and telematics, financial or healthcare, C++ developers have been trusting the POCO C++ Libraries for 15+ years and deployed it in millions of devices. Create software for connected embedded devices running Linux, Windows Embedded or QNX. Create cross-platform backends in C++ for iOS and Android applications and combine it with a native or HTML5-based user interface. Create software for IoT devices that talk to cloud backends over HTTP REST APIs. See macchina.io for an IoT platform built with POCO.
    Downloads: 6 This Week
    Last Update:
    See Project
  • 17
    BDS

    BDS

    Blockchain data parsing and persisting results

    JD Cloud Blockchain Data Service (BDS) is a real-time data aggregating, analyzing, and visualization service for chain-like unstructured data from all kinds of 3rd party Blockchains. Splitter is the key module of Blockchain Data Service (BDS) and provides data analysis capability. Splitter is responsible for consuming blockchain data from message queue (kafka) and inserting data into persistent data storage services (relational database, data warehouse, etc.) for further processing. Before compiling and running BDS, you must install go's compilation environment locally.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 18
    ECharts

    ECharts

    A powerful, interactive charting and visualization library for browser

    ECharts is a free and open source charting and visualization library that gives you an easy way to add interactive, intuitive, custom charts to your commercial products, projects, presentations and more. It offers a rich set of features that includes rendering ability for ten-million-level data, Wechart and Powerpoint support, multi-dimension data analysis, and more. It also has a number of extensions for various applications. ECharts is written in pure JavaScript, and is based on zrender, a new and lightweight canvas library.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 19
    mrtg

    mrtg

    MRTG - Multi Router Traffic Grapher

    MRTG is a free, open-source tool designed to monitor and measure the traffic load on network links. It generates HTML pages containing graphical representations (PNG images) of network traffic, providing visual insights into bandwidth usage over time. Originally developed to monitor router traffic, MRTG has evolved to graph various network devices and other metrics.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 20
    R packages (maintained by YJLEE)

    R packages (maintained by YJLEE)

    R packages for PK/PD modeling , BE/BA, drug stability, ivivc, etc.

    These R packages are developed for data analysis of PK/PD modeling & simulation, bioequivalence/bioavailability (BE/BA), drug stability, in-vitro and in-vivo correlation (ivivc), as well as therapeutic drug monitoring (TDM).
    Downloads: 21 This Week
    Last Update:
    See Project
  • 21
    AlphaPlot

    AlphaPlot

    Interactive scientific graphing and data analysis software.

    Alpha Plot can generate different types of 2D and 3D plots (such as line, scatter, bar, pie, and surface plots) from data that is either imported from ASCII files, entered by hand, or calculated using formulas. The data is held in spreadsheets which are referred to as tables with column-based data (typically X and Y values for 2D plots) or matrices (for 3D plots). The spreadsheets as well as graphs and note windows are gathered in a project and can be organized using folders. The built-in analysis operations include column/row statistics, (de)convolution, FFT and FFT-based filters. Scripting Console support in-place evaluation of mathematical expressions and scrtipting interface to ECMAScript like dynamic scripting language(java script). The GUI of the application uses the Qt toolkit. Periodic test builds are available here http://alphaplot.sourceforge.net/test-build.html
    Leader badge
    Downloads: 38 This Week
    Last Update:
    See Project
  • 22
    .NET for Apache Spark

    .NET for Apache Spark

    A free, open-source, and cross-platform big data analytics framework

    .NET for Apache Spark provides high-performance APIs for using Apache Spark from C# and F#. With these .NET APIs, you can access the most popular Dataframe and SparkSQL aspects of Apache Spark, for working with structured data, and Spark Structured Streaming, for working with streaming data. .NET for Apache Spark is compliant with .NET Standard - a formal specification of .NET APIs that are common across .NET implementations. This means you can use .NET for Apache Spark anywhere you write .NET code allowing you to reuse all the knowledge, skills, code, and libraries you already have as a .NET developer. .NET for Apache Spark runs on Windows, Linux, and macOS using .NET Core, or Windows using .NET Framework. It also runs on all major cloud providers including Azure HDInsight Spark, Amazon EMR Spark, AWS & Azure Databricks.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 23
    IoTDB

    IoTDB

    Apache IoTDB

    Apache IoTDB (Database for Internet of Things) is an IoT native database with high performance for data management and analysis, deployable on the edge and the cloud. Due to its light-weight architecture, high performance and rich feature set together with its deep integration with Apache Hadoop, Spark and Flink, Apache IoTDB can meet the requirements of massive data storage, high-speed data ingestion and complex data analysis in the IoT industrial fields. In the scene of factories, there are tens of devices under LAN network. IoTDB can be installed on a local controller server in the factory to receive data from those devices. The local controller server (normal PC or workstation) with IoTDB can provide the ability to persist data and query data with SQL-like interface. In addition, with TsFile-Sync tool, TsFiles on the local controller can be transmitted to the data center equipped with IoTDB instance in the cloud.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 24
    Symfony PropertyInfo

    Symfony PropertyInfo

    Extracts information about PHP class' properties using metadata

    Symfony PropertyInfo is a component that extracts information about the properties of PHP classes, such as their names, types, visibility, and documentation. It is particularly useful in scenarios like serialization, form generation, and validation, where understanding the structure of an object is essential. PropertyInfo can fetch data from PHPDoc annotations, reflection, and type hints, offering flexible integration with Symfony and other systems.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 25
    Timesketch

    Timesketch

    Collaborative forensic timeline analysis

    Timesketch is a collaborative forensic timeline analysis platform used to investigate security incidents by turning diverse evidence into a single, searchable chronology. Analysts ingest logs and artifacts from many sources—endpoints, servers, cloud services—and Timesketch normalizes them into events on a unified timeline. Powerful search, aggregations, and saved views help you pivot quickly, highlight anomalies, and preserve investigative steps for later review. The system supports tagging, sketch notes, and story building so teams can annotate findings and share context without losing the raw data trail. Integrations with popular DFIR pipelines make ingestion repeatable, while role-based access and audit logs support enterprise workflows. By combining scale, collaboration, and reproducibility, Timesketch moves incident response beyond ad-hoc spreadsheets to a durable, team-oriented investigation record.
    Downloads: 2 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • 3
  • 4
  • 5
  • Next
MongoDB Logo MongoDB