Data Warehousing Software for Linux

View 10 business solutions

Browse free open source Data Warehousing software and projects for Linux below. Use the toggles on the left to filter open source Data Warehousing software by OS, license, language, programming language, and project status.

  • Fully Managed MySQL, PostgreSQL, and SQL Server Icon
    Fully Managed MySQL, PostgreSQL, and SQL Server

    Automatic backups, patching, replication, and failover. Focus on your app, not your database.

    Cloud SQL handles your database ops end to end, so you can focus on your app.
    Try Free
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • 1
    Greenplum Database

    Greenplum Database

    Massive parallel data platform for analytics, machine learning and AI

    Rapidly create and deploy models for complex applications in cybersecurity, predictive maintenance, risk management, fraud detection, and many other areas. With its unique cost-based query optimizer designed for large-scale data workloads, Greenplum scales interactive and batch-mode analytics to large datasets in the petabytes without degrading query performance and throughput. Based on PostgreSQL, Greenplum provides you with more control over the software you deploy, reducing vendor lock-in, and allowing open influence on product direction. Greenplum reduces data silos by providing you with a single, scale-out environment for converging analytic and operational workloads, like streaming ingestion. All major Greenplum contributions are part of the Greenplum Database project and share the same database core, including the MPP architecture, analytical interfaces, and security capabilities.
    Downloads: 17 This Week
    Last Update:
    See Project
  • 2
    ReportServer Community Edition

    ReportServer Community Edition

    ReportServer is a modern and versatile business intelligence platform

    ReportServer is a modern and versatile open source business intelligence (BI) platform with powerful reporting features. With ReportServer you are not limited to one provider's solutions. ReportServer integrates Jasper, Birt, Mondrian and Excel-based reporting: choose what best suits your needs! The source code is also available in GitHub: https://github.com/infofabrik/reportserver ReportServer scripting samples: https://github.com/infofabrik/reportserver-samples
    Downloads: 72 This Week
    Last Update:
    See Project
  • 3
    DataChain

    DataChain

    AI-data warehouse to enrich, transform and analyze unstructured data

    Datachain enables multimodal API calls and local AI inferences to run in parallel over many samples as chained operations. The resulting datasets can be saved, versioned, and sent directly to PyTorch and TensorFlow for training. Datachain can persist features of Python objects returned by AI models, and enables vectorized analytical operations over them. The typical use cases are data curation, LLM analytics and validation, image segmentation, pose detection, and GenAI alignment. Datachain is especially helpful if batch operations can be optimized – for instance, when synchronous API calls can be parallelized or where an LLM API offers batch processing.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 4
    TensorBase

    TensorBase

    TensorBase is a new big data warehousing with modern efforts

    TensorBase hopes the open source not become a copy game. TensorBase has a clear-cut opposition to fork communities, repeat wheels, or hack traffic for so-called reputations (like Github stars). After thoughts, we decided to temporarily leave the general data warehousing field. For people who want to learn how a database system can be built up, or how to apply modern Rust to the high-performance field, or embed a lightweight data analysis system into your own big one. You can still try, ask or contribute to TensorBase. The committers are still around the community. We will help you in all kinds of interesting things pursued in the project by us and maybe you. We still maintain the project to look forward to meeting more database geniuses in this world, although no new feature will be added in the near future.
    Downloads: 2 This Week
    Last Update:
    See Project
  • Full-stack observability with actually useful AI | Grafana Cloud Icon
    Full-stack observability with actually useful AI | Grafana Cloud

    Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

    Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.
    Create free account
  • 5
    SIDU admin GUI : MySQL PostgreSQL SQLite
    SIDU is a FREE database web GUI written in PHP. Handy and powerful for MySQL + PostgreSQL + SQLite + CUBRID. SIDU is simple and easy DB tool to use! SIDU has all features you need for database admin and web development. It's a great DB admin tool! No installation need. Best database front-end web based tools, cross platform looking no further
    Downloads: 7 This Week
    Last Update:
    See Project
  • 6
    Snowplow Analytics

    Snowplow Analytics

    Enterprise-strength marketing and product analytics platform

    Snowplow is ideal for data teams who want to manage the collection and warehousing of data across all their platforms and products.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 7
    The aoetools are programs for users of the ATA over Ethernet (AoE) network storage protocol, a simple protocol for using storage over an ethernet LAN. The vblade program (storage target) exports a block device using AoE.
    Leader badge
    Downloads: 14 This Week
    Last Update:
    See Project
  • 8
    SQL*Plus Commander

    SQL*Plus Commander

    Text-based user interface to query data on Oracle DB in a smart way

    SQL*Plus Commander is Text-based user interface (TUI) / framework to query data on Oracle DB in a smart way. It consists in a fully customizable script shell for bash and ksh. It executes custom queries or procedures on DB with SQLPlus for Oracle. The results of queries can be browsed in a colorful text interface resulting data from a query can be selected and passed dinamically as parameters for others queries or procedures It may be useful for people who runs frequently a limited number of query and uses the results as parameters for other queries. suggested for DBA activities, log tables browsing. downloaded version contains a demo with HR data model from oracle.com Try it and let me know if you find it useful any idea or suggestion will be appreciated
    Downloads: 24 This Week
    Last Update:
    See Project
  • 9
    DataCleaner

    DataCleaner

    Data quality analysis, profiling, cleansing, duplicate detection +more

    DataCleaner is a data quality analysis application and a solution platform for DQ solutions. It's core is a strong data profiling engine, which is extensible and thereby adds data cleansing, transformations, enrichment, deduplication, matching and merging. Website: http://datacleaner.github.io
    Downloads: 3 This Week
    Last Update:
    See Project
  • Application Monitoring That Won't Slow Your App Down Icon
    Application Monitoring That Won't Slow Your App Down

    AppSignal's Rust-based agent is lightweight and stable. Already running in thousands of production apps.

    Full APM with errors, performance, logs, and uptime monitoring. 99.999% uptime SLA on the platform itself.
    Start Free
  • 10
    Palo ETL Server is a Java based Tool for Extraction, Transformation and Loading of mass data into the Palo OLAP Server. Palo ETL Server is one part of the Palo Suite.
    Leader badge
    Downloads: 8 This Week
    Last Update:
    See Project
  • 11
    gravitino

    gravitino

    Unified metadata lake for data & AI assets.

    Apache Gravitino is a high-performance, geo-distributed, and federated metadata lake. It manages metadata directly in different sources, types, and regions, providing users with unified metadata access for data and AI assets.
    Downloads: 8 This Week
    Last Update:
    See Project
  • 12
    OpenFakt is an open source Java invoicing and billing system using XML definition files for various forms (address book, articles, invoices, stock management). It is language, database and platform independent and aims at small to big companies.
    Downloads: 6 This Week
    Last Update:
    See Project
  • 13
    MailArchiva is a powerful, full featured email archiving (email archiver) and compliance solution for mail systems such as Microsoft Exchange. It stores all incoming, outgoing and internal emails for long term storage. A web based user interface is avail
    Downloads: 1 This Week
    Last Update:
    See Project
  • 14
    Open Source Data Quality and Profiling

    Open Source Data Quality and Profiling

    World's first open source data quality & data preparation project

    This project is dedicated to open source data quality and data preparation solutions. Data Quality includes profiling, filtering, governance, similarity check, data enrichment alteration, real time alerting, basket analysis, bubble chart Warehouse validation, single customer view etc. defined by Strategy. This tool is developing high performance integrated data management platform which will seamlessly do Data Integration, Data Profiling, Data Quality, Data Preparation, Dummy Data Creation, Meta Data Discovery, Anomaly Discovery, Data Cleansing, Reporting and Analytic. It also had Hadoop ( Big data ) support to move files to/from Hadoop Grid, Create, Load and Profile Hive Tables. This project is also known as "Aggregate Profiler" Resful API for this project is getting built as (Beta Version) https://sourceforge.net/projects/restful-api-for-osdq/ apache spark based data quality is getting built at https://sourceforge.net/projects/apache-spark-osdq/
    Downloads: 1 This Week
    Last Update:
    See Project
  • 15
    JRubik is a set of tools for navigating and reporting over OLAP Datasources, mainly Mondrian and XMLA servers. This project is based on the components developed by the JPivot project.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 16
    DBBrowser is an open source (GPL license), cross-platform tool which can be used to view the contents of a database. It works with Oracle and MySQL. The user can view, modify, delete records without writing SQL.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 17
    LucidDB is a DBMS optimized for business intelligence. Besides architectural innovations such as column-store, it supports many advanced features from SQL:2003, including SQL/MED and user-defined transformations written in Java.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 18
    JBELT : link your design to the business

    JBELT : link your design to the business

    JBelt :: link your design to the business

    JBELT is a collection of procedure to create a PLM system oriented to connect the CAD application to the ERP database. Based on the web. The engine are JBoss, PostgreSQL and developed in Java on JBoss Seam framework.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 19
    openwms.org
    openwms.org is a modularized warehouse management system split into a core project, a tms module and a wms module running in an OSGi environment to assure high availability and maintainability.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 20
    The Financial Data Access Library provides an abstraction of market data sources. It verifies data consistency and provides transformation to different time frame e.g. daily to weely. Includes SQL, CSI, ASCII and online data sources such as Yahoo!
    Downloads: 3 This Week
    Last Update:
    See Project
  • 21
    OpenReports is a powerful, flexible, and easy to use web reporting solution that provides browser based, parameter driven, dynamic report generation and flexible report scheduling capabilities. Supports JasperReports, JFreeReport, JXLS, and Eclipse BIRT
    Downloads: 1 This Week
    Last Update:
    See Project
  • 22
    CloverDX

    CloverDX

    Design, automate, operate and publish data pipelines at scale

    Please, visit www.cloverdx.com for latest product versions. Data integration platform; can be used to transform/map/manipulate data in batch and near-realtime modes. Suppors various input/output formats (CSV,FIXLEN,Excel,XML,JSON,Parquet, Avro,EDI/X12,HL7,COBOL,LOTUS, etc.). Connects to RDBMS/JMS/Kafka/SOAP/Rest/LDAP/S3/HTTP/FTP/ZIP/TAR. CloverDX offers 100+ specialized components which can be further extended by creation of "macros" - subgraphs - and libraries, shareable with 3rd parties. Simple data manipulation jobs can be created visually. More complex business logic can be implemented using Clover's domain-specific-language CTL, in Java or languages like Python or JavaScript. Through its DataServices functionality, it allows to quickly turn data pipelines into REST API endpoints. The platform allows to easily scale your data job across multiple cores or nodes/machines. Supports Docker/Kubernetes deployments and offers AWS/Azure images in their respective marketplace
    Downloads: 1 This Week
    Last Update:
    See Project
  • 23
    Inventory, manufacturing, sales (POS) automation suite that consists of software written in Java 6 and custom developed hardware.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 24
    JMagallanes is an open source end user application for Olap and Dynamic Reports written in Java/J2EE. Combines static reports, pivot table and charts. It reads from SQL and other data sources.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 25
    Alfresco Audit Analysis and Reporting
    With Alfresco Audit Analysis and Reporting (A.A.A.R.) is provided a solution to extract, store and query audit data together with the document/folder informations at a very detailed level, with the goal to be useful to the end-user in a very easy way. To reach that goal, to make the data more friendly for the end-user, the data are published in reports in well-known formats (pdf, Microsoft Excel, csv, etc.) and stored directly in Alfresco as static documents organized in folders, versioned, authorized and published. On the top of the A.A.A.R. solution, the A.A.A.R. Analytics is a set of powerful tools to analyze data in an interactive and customizable way with a user console composed by dashboards, reports and free analysis.
    Downloads: 1 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • 3
  • 4
  • 5
  • Next
MongoDB Logo MongoDB