Best Data Management Software for Linux - Page 16

Compare the Top Data Management Software for Linux as of October 2025 - Page 16

  • 1
    Apache Gobblin

    Apache Gobblin

    Apache Software Foundation

    A distributed data integration framework that simplifies common aspects of Big Data integration such as data ingestion, replication, organization, and lifecycle management for both streaming and batch data ecosystems. Runs as a standalone application on a single box. Also supports embedded mode. Runs as an mapreduce application on multiple Hadoop versions. Also supports Azkaban for launching mapreduce jobs. Runs as a standalone cluster with primary and worker nodes. This mode supports high availability and can run on bare metals as well. Runs as an elastic cluster on public cloud. This mode supports high availability. Gobblin as it exists today is a framework that can be used to build different data integration applications like ingest, replication, etc. Each of these applications is typically configured as a separate job and executed through a scheduler like Azkaban.
  • 2
    Feast

    Feast

    Tecton

    Make your offline data available for real-time predictions without having to build custom pipelines. Ensure data consistency between offline training and online inference, eliminating train-serve skew. Standardize data engineering workflows under one consistent framework. Teams use Feast as the foundation of their internal ML platforms. Feast doesn’t require the deployment and management of dedicated infrastructure. Instead, it reuses existing infrastructure and spins up new resources when needed. You are not looking for a managed solution and are willing to manage and maintain your own implementation. You have engineers that are able to support the implementation and management of Feast. You want to run pipelines that transform raw data into features in a separate system and integrate with it. You have unique requirements and want to build on top of an open source solution.
  • 3
    DataOps DataFlow
    A holistic component-based platform for automating Data Reconciliation tests in modern Data Lake and Cloud Data Migration projects using Apache Spark. DataOps DataFlow is a modern, web browser-based solution for automating the testing of ETL, Data Warehouse, and Data Migration projects. Use Dataflow to inject data from any of the varied data sources, compare data, and load differences to S3 or a database. With fast and easy to set up, create and run dataflow in minutes. A best in the class testing tool for Big Data Testing DataOps DataFlow can integrate with all modern and advanced data sources including RDBMS, NoSQL, Cloud, and File-Based.
    Starting Price: Contact us
  • 4
    Semarchy xDI
    Experience Semarchy’s flexible unified data platform to empower better business decisions enterprise-wide. Integrate all your data with xDI, the high-performance, agile, and extensible data integration for all styles and use cases. Its single technology federates all forms of data integration, and mapping converts business rules into deployable code. xDI has extensible and open architecture supporting on-premise, cloud, hybrid, and multi-cloud environments.
  • 5
    TABEX4

    TABEX4

    BOI Software

    TABEX4 runs on all common operating systems and is applicable throughout the company – both on mainframe and server systems. Tables can be maintained efficiently and safely, independent of platform or database. TABEX4 supports import of table data from other software products and memory forms through optimized APIs. Export is possible in diverse ways as well: e.g. pdf, e-mail and other data or store formats. Our TABEX4 FAQs offer you a profound overview of important TABEX4 topics. In the TABEX4 Wiki you will gain expert knowledge about technical questions and challenges. Master public audits smoothly: TABEX4 places absolute priority to transparency and security in processing master data and control data. The TABEX4 Relational Bridge extends TABEX4 by interfaces to relational databases and makes the entire TABEX4 functions available for RDBs.
  • 6
    DataSort

    DataSort

    Inventale

    A portal based on mobile- and enriched third-party data that allows one to: — reconstruct users’ sociodemographic (gender, age) — develop user segments (eg., young parents, frequent travellers, blue collars, university students, wealthy residents, etc.) — provide analytics according to clients’ requirements (places with users’ concentrations, customers’ loyalty, trends and variances, comparison with competitors, etc.) — determine the best location for opening a new kindergarten/supermarket/mall based on users' concentration, interests and sociodemographic factors. The solution started as a custom project for one of our UAE clients, but due to high demand further developed into a full-scale product that helps different businesses to answer important questions and solve principal tasks such as: — launch of granular targeted ad campaigns; — finding the best location for opening a business unit; — identification of best locations for placing outdoor banners and so on.
    Starting Price: $50,000
  • 7
    Insigna

    Insigna

    Insigna

    Insigna - Unified Digital Operations Platform™ offers comprehensive solutions for unification, management & analysis of operations data enabling insights for informed decisions and performance improvements. With Insigna, you unlock the full potential of your data. Insigna solutions focus on open integration, enabling Seamless Connectivity across your ops, Data Analytics, Workflow Simplification, Automation, & Optimization, empowering organizations to harness the power of Data Intelligence. A user-friendly, no-code configuration, helps you easily create customized dashboards & reports for actionable insights at your fingertips. Experience a rapid return on investment as Insigna streamlines your workflows & automates repetitive tasks, freeing up valuable resources for strategic initiatives. With real-time analytics & intuitive intelligence, decision-makers can quickly identify trends and make informed choices that drive incremental growth.
  • 8
    Navicat for MongoDB
    Available for all database objects such as Collections, Views, Functions, Indexes, GridFS, and MapReduce. Our professional object designer allows you to create, modify, and design database objects, all without writing a script. Navicat for MongoDB is designed to streamline your routine database tasks. The new interface is easy to access and understand -- giving you new ways to manage your MongoDB databases and making your work more efficient than ever. Available for all database objects such as Collections, Views, Functions, Indexes, GridFS, and MapReduce. Our professional object designer allows you to create, modify, and design database objects, all without writing a script.
  • 9
    CYRISMA

    CYRISMA

    CYRISMA

    CYRISMA is an all-in-one cyber risk management platform that enables you to discover, understand, mitigate, and manage risk in a holistic and cost-effective manner. Identify and mitigate network and endpoint vulnerabilities, discover and secure sensitive data across cloud and on-prem environments, strengthen OS configuration settings, track compliance, and generate cyber risk assessment reports in a few easy steps. Platform capabilities include (everything included in the price): -- Vulnerability and Patch Management -- Secure OS Configuration Scanning -- Sensitive data discovery; data protection (both on-prem cloud including Microsoft Office 365 and Google Workspace) -- Dark web monitoring -- Compliance Tracking (NIST CSF, CIS Critical Controls, SOC 2, PCI DSS, HIPAA, ACSC Essential Eight, NCSC Cyber Essentials) -- Active Directory Monitoring (both on-prem and Azure) -- Cyber risk quantification in multiple currencies -- Cyber risk assessment and reporting
  • 10
    Kestra

    Kestra

    Kestra

    Kestra is an open-source, event-driven orchestrator that simplifies data operations and improves collaboration between engineers and business users. By bringing Infrastructure as Code best practices to data pipelines, Kestra allows you to build reliable workflows and manage them with confidence. Thanks to the declarative YAML interface for defining orchestration logic, everyone who benefits from analytics can participate in the data pipeline creation process. The UI automatically adjusts the YAML definition any time you make changes to a workflow from the UI or via an API call. Therefore, the orchestration logic is defined declaratively in code, even if some workflow components are modified in other ways.
  • 11
    LiteX

    LiteX

    Jedis Singapore Pte. Ltd

    LiteX is offered in two components : Windows [ Client ] Linux Server [ LiteServer ]. The *standalone* Client functionality has : - SFTP capability, - File System Management (local and remote). - Remote Proxy FSM (PFSM). Remote system(s) to system(s) copy etc transparently via the Client. - SSH [2] [ SSL ] supported. In addition Client has an Server peer [ LiteServer ] available on Linux which gives DB maintenance and multi-domain bit level, Merge/Compare [ Client geared ] functionality. Full Client and Server Documentation is available. LiteServer examples and toolkit available. LiteX client is licensed free for SFTP and FSM. LiteServer is POA for license and Commercial use.
  • 12
    NMTY Enterprise
    NMTY Enterprise helps you protect all your privacy-sensitive data within your organization, regardless if it is stored in databases or files. Make NMTY Enterprise part of your IT environment and immediately anonymize all data sources that need to be protected. NMTY Enterprise makes it possible to anonymize data regardless of how it is stored, from databases to individual files. Anonymize data stored in a database or in separate files such as CSV and XML. Data is always anonymized directly within the source. This prevents non-anonymized data from being duplicated unnecessarily. Connections to your data sources support integrated authentication and are always encrypted when stored. In addition to anonymizing datasets, it is also possible to directly anonymize data processed within documents and images. Our solutions are developed based on the latest innovations and integrate directly into your existing processes. This way we ensure we always achieve the maximum result.
  • 13
    PK Protect
    PK Protect is a data protection platform designed to help organizations safeguard sensitive information across diverse environments. It provides robust tools for data discovery, classification, encryption, and monitoring, ensuring that critical data is protected both at rest and in transit. With automated policies and compliance controls, PK Protect enables businesses to meet regulatory requirements like GDPR and HIPAA while minimizing the risk of data breaches. The platform integrates with various systems to provide a unified approach to managing data security across cloud, on-premises, and hybrid environments. By offering real-time visibility and proactive threat detection, PK Protect helps organizations maintain control over their sensitive data and reduce security vulnerabilities.
  • 14
    Odyx yHat

    Odyx yHat

    Odyssey Analytics

    Odyx yHat is a Time Series Forecasting tool designed to simplify the intricate field of data science, making it accessible and user-friendly for individuals without any background in data science.
    Starting Price: $300/month
  • 15
    Document Companion
    FabSoft's Document Companion, caters to individual and business needs and is designed for ease of use, flexibility, and affordability. This document composer and editor offers an office-style interface compatible with Windows 10 & 11, allowing users to create, convert, edit, share, and sign text, PDF files efficiently.
    Starting Price: $39/year/user
  • 16
    ORMIT™ Jasper
    ORMIT™ Jasper is the only seamless automated migration solution to migrate Oracle Reports to Jasper Reports that can cost up to 90% less time than a manual upgrade. RENAPS ORMIT™ Jasper eliminates all migration risks associated with a manual migration. ORMIT™ Jasper improves code quality and maintainability, thus paves the way for even more savings over time. 100% Open source: no licensing, support fees, or vendor lock-ins will ever apply to your migrated reports. Jasper Reports can be used on a regular JavaEE server such as Tomcat, JBoss or Jetty and can also be used outside of pure Java application development
  • 17
    Datafor

    Datafor

    Datafor

    Let everyone become a data analyst, easily analyze all data, gain keen insight, and realize business innovation. Datafor's unique and powerful data visualization and analytics capabilities help organizations get real value from their data, and everyone in the business can ask complex data questions and gain meaningful insights. Datafor can intelligently generate your data model, allowing you to focus on the understanding of business and data, and the standardization and unification of measurement standards. Every employee has access to accurate, customizable datasets and metrics. Easily create reports and beautiful dashboards by dragging and dropping without professional skills. Business analysts can easily share data stories and the value of data from any device anytime, anywhere. With the advent of the data age changing people's lifestyles and ever-changing data technologies, Datafor's scalability allows you to stay on top of technology trends.
  • 18
    Arroyo

    Arroyo

    Arroyo

    Scale from zero to millions of events per second. Arroyo ships as a single, compact binary. Run locally on MacOS or Linux for development, and deploy to production with Docker or Kubernetes. Arroyo is a new kind of stream processing engine, built from the ground up to make real-time easier than batch. Arroyo was designed from the start so that anyone with SQL experience can build reliable, efficient, and correct streaming pipelines. Data scientists and engineers can build end-to-end real-time applications, models, and dashboards, without a separate team of streaming experts. Transform, filter, aggregate, and join data streams by writing SQL, with sub-second results. Your streaming pipelines shouldn't page someone just because Kubernetes decided to reschedule your pods. Arroyo is built to run in modern, elastic cloud environments, from simple container runtimes like Fargate to large, distributed deployments on the Kubernetes logo Kubernetes.
  • 19
    MINDely
    MIND is the first-ever data security platform that puts data loss prevention (DLP) and insider risk management (IRM) programs on autopilot, so you can automatically identify, detect, and prevent data leaks at machine speed. Continuously find your sensitive data in files spread across your IT environments whether at rest, in motion, or in use. MIND continuously exposes blindspots of sensitive data across your IT environments including SaaS, AI apps, endpoints, on-premise file shares, and emails. MIND monitors and analyzes billions of data security events in real time, enriches each incident with context, and remediates autonomously. MIND automatically blocks sensitive data in real-time from escaping your control, or collaborates with users to remediate risks and educate on your policies. MIND continuously exposes blindspots of sensitive data at rest, in motion, and in use by integrating with data sources across your IT workloads, e.g. SaaS, AI apps, on-premises, endpoints, and emails.
  • 20
    rqlite

    rqlite

    rqlite

    The lightweight, user-friendly, distributed relational database built on SQLite. Fault tolerance and high availability with zero hassle. rqlite is a distributed relational database that combines the simplicity of SQLite with the robustness of a fault-tolerant, highly available system. It's developer-friendly, its operation is straightforward, and it's designed for reliability with minimal complexity. Deploy in seconds, with no complex configurations. Seamlessly integrates with modern cloud infrastructures. Built on SQLite, the world’s most popular database. Supports full-text search, Vector Search, and JSON documents. Access controls and encryption for secure deployments. Rigorous, automated testing ensures high quality. Clustering provides high availability and fault tolerance. Automatic node discovery simplifies clustering.
  • 21
    VaultFS

    VaultFS

    Swiss Vault

    VaultFS, developed by Swiss Vault Global, is an advanced data archive solution designed to provide exceptional data robustness, scalability, and efficiency for long-term storage needs. By utilizing erasure coding, VaultFS divides data into fragments with redundant pieces, distributing them across various storage locations to ensure data can be reconstructed even if some fragments are lost or corrupted. This approach minimizes hardware overhead, reducing both initial investments and ongoing maintenance costs. VaultFS's peer-to-peer architecture eliminates single points of failure, and its auto-regeneration mechanisms automatically repair corrupted data, ensuring continuous availability. The system's dynamic configuration allows for seamless scalability by adding more disks or nodes without disrupting operations, making it a reliable and efficient choice for organizations seeking next-generation data storage solutions.
  • 22
    Scourhead

    Scourhead

    Scourhead

    Scourhead is a free, open source AI agent that scours the web, organizes data, and delivers results in a spreadsheet. It runs locally on your computer with no cloud dependencies or fees, ensuring privacy and control over your data. Available for macOS, Windows, and Linux, Scourhead automates online research by gathering information from multiple sources and consolidating it into an easy-to-analyze spreadsheet format. This streamlines data collection and analysis, making it ideal for researchers, analysts, and professionals seeking efficient data management solutions. By operating directly on your machine, Scourhead eliminates the need for cloud services, enhancing data security and reducing costs. Its open source nature allows for customization and community contributions, fostering continuous improvement and adaptability to various research needs. Whether for market research, academic studies, or business intelligence, Scourhead simplifies complex research tasks.
  • 23
    Dadroit JSON Viewer
    Experience lightning-fast JSON data management with Dadroit JSON Viewer. Effortlessly open massive JSON files (up to 1GB) in seconds and quickly search through your data using advanced RegEx and JSONPath features. Easily export your JSON files to XML, available in both minified and formatted versions.
    Starting Price: $8.20/month
  • 24
    DataBahn

    DataBahn

    DataBahn

    DataBahn.ai is redefining how enterprises manage the explosion of security and operational data in the AI era. Our AI-powered data pipeline and fabric platform helps organizations securely collect, enrich, orchestrate, and optimize enterprise data—including security, application, observability, and IoT/OT telemetry—for analytics, automation, and AI. With native support for over 400 integrations and built-in enrichment capabilities, DataBahn streamlines fragmented data workflows and reduces SIEM and infrastructure costs from day one. The platform requires no specialist training, enabling security and IT teams to extract insights in real time and adapt quickly to new demands. We've helped Fortune 500 and Global 2000 companies reduce data processing costs by over 50% and automate more than 80% of their data engineering workloads.
  • 25
    ZARUS

    ZARUS

    Maiora

    ZARUS is Maiora’s end-to-end No-Code/Low-Code Data Infrastructure Platform designed to help enterprises integrate, govern, transform, visualise, and observe data seamlessly across cloud, on-premises, and legacy systems. Built for speed, scalability, and compliance, ZARUS eliminates data silos, streamlines workflows, and enables organisations to unlock real-time, AI-ready insights without the burden of high-code development or multiple toolchains. With pre-built connectors, advanced data quality management, observability dashboards, and secure governance frameworks, ZARUS empowers CIOs, CTOs, CDOs, and CFOs to make faster, smarter decisions while reducing operational complexity and costs.
  • 26
    Microsoft R Open
    Microsoft continues its commitment and development in R, not only in the latest Machine Learning Server release, but also in the newest Microsoft R Client and Microsoft R Open releases. You can also find R and Python support in SQL Server Machine Learning Services on Windows and Linux, and R support in Azure SQL Database. R components are backwards compatible. You should be able to run existing R script on newer versions, with the exception of dependencies on packages or platforms that are no longer supported, or known issues that require a workaround or code change. Microsoft R Open is the enhanced distribution of R from Microsoft Corporation. The current release, Microsoft R Open 4.0.2, is based the statistical language R-4.0.2 and includes additional capabilities for improved performance, reproducibility and platform support. Compatibility with all packages, scripts and applications that work with R-4.0.2.
  • 27
    Datafiniti

    Datafiniti

    Datafiniti

    At Datafiniti, we help businesses become data-driven by offering easy access to a variety of high-quality, comprehensive data sets. Our customers, spanning startups to Fortune 500s, use our data to power next-generation applications and analytics. A data set of over 120 million businesses, covering 196 countries and all industries. Contains firmographics, reviews, and more. Searching for information on a company or business? Access our business database using our business API or web portal to leverage our large catalog of companies from hundreds of online directories and review websites. Integrate with firmographics, reviews, and other data. While every business is different, Datafiniti gathers and structures a wide breadth of business information for each business tracked in our catalog.
  • 28
    OneStep-JV

    OneStep-JV

    Business Control Systems

    POS system brings the most advanced technology available in a full-featured suite of applications for retailers and distributors. The OneStep-JV™ Point of Sale system combines the power and flexibility of Java and Oracle. Written in Java with Oracle as the embedded database at its foundation, OneStep-JV™ point of sale systems bring the most advanced and reliable technology and inventory management software available to achieve operational stability and cross-platform portability for retailers and distributors. The use of Java enables the operation of OneStep-JV™ POS systems on single-user computers, small and very large-scale networks and portable devices like Palm Tops running over a multitude of operating systems such as Windows and Windows Networks, Novell, Unix and Linux. The stability of Oracle gives OneStep-JV™ POS systems a resilient database foundation designed with auto-recovery features to enable database and inventory control software integrity.
  • 29
    Drive Cloner Rx

    Drive Cloner Rx

    Horizon Datasys

    Drive Cloner Rx is a bare metal recovery utility that enables professionals to easily perform system backups, images, and assist with deployments. Drive Cloner Rx images an entire hard drive, including the Windows, drivers, system files and all programs onto any media or a hidden partition for restoration. Drive Cloner Rx enables you to create an image of your hard drive and to restore your PC to this image. You can also mount this image onto an external media (CD/DVD/BRD) or external hard drive, network drive, USB flash drive, etc. You can even use the bootable system recovery CD/DVD/Blu-ray Disc creator to allow you to load your installation files from external media or create an ISO image for remote storage. You can either manually create backups or they can be set to run automatically. Drive Cloner Rx can perform full system backups or supplemental backup images. You can incrementally update a previously created Drive Cloner image.
    Starting Price: $29 one-time payment
  • 30
    Time Machine

    Time Machine

    Solution-Soft

    Time Machine® provides software virtual clocks that enable you to time travel your applications into the future or the past, facilitating time shift testing on your date and time-sensitive application logic, such as month-end, quarter-end, year-end processing, billing cycle, workflow, regulatory go live and policy life cycle. Time Machine is transparent to applications and databases so no code modification is required to do time shift testing and the system clock is never modified. Time Machine eliminates the need to reset the system clock, which is time-consuming, error-prone, and not possible under Active Directory or in a Kerberos secured environment. Mitigate risks for mission-critical application failures. Ensure large-scale software projects finish on time and under budget. Windows, Linux, Unix, Mainframe zLinux, Dockerized, Virtualized, On-Iron, or in the Cloud. Time Machine runs everywhere you need it.