Best Data Management Software for Linux - Page 16

Compare the Top Data Management Software for Linux as of May 2026 - Page 16

  • 1
    jBASE

    jBASE

    jBASE

    The future of your PICK system requires a database platform that continually evolves to meet the needs of today’s developers. jBASE is now officially certified for Docker containers, including built-in support for the MongoDB NoSQL database, and standard APIs for Salesforce, Avalara, and dozens of other platforms. Plus new enhancements to Objects that make life easier for developers. We are continuing to invest in jBASE because we believe in PICK! While others see a decline, we’ve seen 6 years of consecutive growth. We care about your long-term success and haven’t had a maintenance price increase in decades. We play well with others by collaborating and making jBASE integrate with modern technologies like VSCode, Mongo, Docker, and Salesforce. The migration routes from other PICK databases have been vastly simplified, licensing now supports flexible CPU and SaaS-based models, and our in-line operating system approach means our scalability, speed and stability are unmatched.
  • 2
    Sedna

    Sedna

    Sedna

    Sedna is a free native XML database which provides a full range of core database services - persistent storage, ACID transactions, security, indices, hot backup. Flexible XML processing facilities include W3C XQuery implementation, tight integration of XQuery with full-text search facilities and a node-level update language. It provides a number of easy exampes which can be run directly in command line and describes how to run examples provided with Sedna. Sedna distribution comes with an example set based on the XMark XML benchmark. This set allows you to investigate the features of Sedna easily. Examples include bulk load of a sample XML document and a number of sample XQuery queries and updates to this document. Below we will show how to run one of them.
  • 3
    LevelDB

    LevelDB

    Google

    LevelDB is a fast key-value storage library written at Google that provides an ordered mapping from string keys to string values. Keys and values are arbitrary byte arrays. Data is stored sorted by key. Callers can provide a custom comparison function to override the sort order. Multiple changes can be made in one atomic batch. Users can create a transient snapshot to get a consistent view of data. Forward and backward iteration is supported over the data. Data is automatically compressed using the Snappy compression library. External activity (file system operations etc.) is relayed through a virtual interface so users can customize the operating system interactions. We use a database with a million entries. Each entry has a 16 byte key, and a 100 byte value. Values used by the benchmark compress to about half their original size. We list the performance of reading sequentially in both the forward and reverse direction, and also the performance of a random lookup.
  • 4
    rsync

    rsync

    rsync

    rsync is an open source utility that provides fast incremental file transfer. rsync is freely available under the GNU General Public License. The GPG signing key that is used to sign the release files is available from the public pgp key-server network. If you have automatic key-fetching enabled, just running a normal "gpg --verify" will grab my key automatically. Or, feel free to grab the gpp key for Wayne Davison manually. rsync is a file transfer program for Unix systems. rsync uses the "rsync algorithm" which provides a very fast method for bringing remote files into sync. It does this by sending just the differences in the files across the link, without requiring that both sets of files are present at one of the ends of the link beforehand. Optionally preserves symbolic links, hard links, file ownership, permissions, devices and times. Internal pipelining reduces latency for multiple files.
  • 5
    PoINT Data Replicator

    PoINT Data Replicator

    PoINT Software & Systems

    Today, organizations are typically storing unstructured data in file systems and increasingly in object and cloud storage. Cloud and object storage have numerous advantages, particularly with regard to inactive data. This leads to the requirement to migrate or replicate files (e.g. from legacy NAS) to cloud or object storage. More and more data is stored in cloud and object storage. This has created an underestimated security risk. In most cases, data stored in the cloud or in on-premises object storage is not backed up, as it is believed to be secure. This assumption is negligent and risky. High availability and redundancy as offered by cloud services and object storage products do not protect against human error, ransomware, malware, or technology failure. Thus, also cloud and object data need backup or replication, most appropriately on a separate storage technology, at a different location and in the original format as stored in the cloud and object storage.
  • 6
    IBM ProtecTIER
    ProtecTIER® is a disk-based data storage system. It uses data deduplication technology to store data to disk arrays. With Feature Code 9022, the ProtecTIER Virtual Tape Library (VTL) service emulates traditional automated tape libraries. With Feature Code 9024, a stand-alone TS7650G can be configured as FSI. Several software applications run on various TS7650G components and configurations. The ProtecTIER Manager workstation is a customer-supplied workstation that runs the ProtecTIER Manager software. The ProtecTIER Manager software provides the management GUI interface to the TS7650G. The ProtecTIER VTL service emulates traditional tape libraries. By emulating tape libraries, ProtecTIER VTL provides the capability to transition to disk backup without having to replace your entire backup environment. Your existing backup application can access virtual robots to move virtual cartridges between virtual slots and drives.
  • 7
    Apache Kudu

    Apache Kudu

    The Apache Software Foundation

    A Kudu cluster stores tables that look just like tables you're used to from relational (SQL) databases. A table can be as simple as a binary key and value, or as complex as a few hundred different strongly-typed attributes. Just like SQL, every table has a primary key made up of one or more columns. This might be a single column like a unique user identifier, or a compound key such as a (host, metric, timestamp) tuple for a machine time-series database. Rows can be efficiently read, updated, or deleted by their primary key. Kudu's simple data model makes it a breeze to port legacy applications or build new ones, no need to worry about how to encode your data into binary blobs or make sense of a huge database full of hard-to-interpret JSON. Tables are self-describing, so you can use standard tools like SQL engines or Spark to analyze your data. Kudu's APIs are designed to be easy to use.
  • 8
    Apache Parquet

    Apache Parquet

    The Apache Software Foundation

    We created Parquet to make the advantages of compressed, efficient columnar data representation available to any project in the Hadoop ecosystem. Parquet is built from the ground up with complex nested data structures in mind, and uses the record shredding and assembly algorithm described in the Dremel paper. We believe this approach is superior to simple flattening of nested namespaces. Parquet is built to support very efficient compression and encoding schemes. Multiple projects have demonstrated the performance impact of applying the right compression and encoding scheme to the data. Parquet allows compression schemes to be specified on a per-column level, and is future-proofed to allow adding more encodings as they are invented and implemented. Parquet is built to be used by anyone. The Hadoop ecosystem is rich with data processing frameworks, and we are not interested in playing favorites.
  • 9
    Hypertable

    Hypertable

    Hypertable

    Hypertable delivers scalable database capacity at maximum performance to speed up your big data application and reduce your hardware footprint. Hypertable delivers maximum efficiency and superior performance over the competition which translates into major cost savings. A proven scalable design that powers hundreds of Google services. All the benefits of open source with a strong and thriving community. C++ implementation for optimum performance. 24/7/365 support for your business-critical big data application. Unparalleled access to Hypertable brain power by the employer of all core Hypertable developers. Hypertable was designed for the express purpose of solving the scalability problem, a problem that is not handled well by a traditional RDBMS. Hypertable is based on a design developed by Google to meet their scalability requirements and solves the scale problem better than any of the other NoSQL solutions out there.
  • 10
    InfiniDB

    InfiniDB

    Database of Databases

    InfiniDB is a column-store DBMS optimized for OLAP workloads. It has a distributed architecture to support Massive Paralllel Processing (MPP). It uses MySQL as its front-end such that users familiar with MySQL can quickly migrate to InfiniDB. Due to this fact, users can connect to InfiniDB using any MySQL connector. InfiniDB applies MVCC to do concurrency control. It uses term System Change Number (SCN) to indicate a version of the system. In its Block Resolution Manager (BRM), it utilizes three structures, version buffer, version substitution structure, and version buffer block manager, to manage multiple versions. InfiniDB applies deadlock detection to resolve conflicts. InfiniDB uses MySQL as its front-end and supports all MySQL syntaxes, including foreign keys. InfiniDB is a columnar DBMS. For each column, InfiniDB applies range partitioning and stores the minimum and maximum value of each partition in a small structure called extent map.
  • 11
    qikkDB

    qikkDB

    qikkDB

    QikkDB is a GPU accelerated columnar database, delivering stellar performance for complex polygon operations and big data analytics. When you count your data in billions and want to see real-time results you need qikkDB. We support Windows and Linux operating systems. We use Google Tests as the testing framework. There are hundreds of unit tests and tens of integration tests in the project. For development on Windows, Microsoft Visual Studio 2019 is recommended, and its dependencies are CUDA version 10.2 minimal, CMake 3.15 or newer, vcpkg, boost. For development on Linux, the dependencies are CUDA version 10.2 minimal, CMake 3.15 or newer, and boost. This project is licensed under the Apache License, Version 2.0. You can use an installation script or dockerfile to install qikkDB.
  • 12
    RRDtool

    RRDtool

    RRDtool

    RRDtool is the OpenSource industry standard, high performance data logging and graphing system for time series data. RRDtool can be easily integrated in shell scripts, perl, python, ruby, lua or tcl applications.
  • 13
    Amadea

    Amadea

    ISoft

    Amadea technology relies on the fastest real-time calculation and modeling engine on the market. Speed up the creation, deployment and automation of your analytics projects within the same integrated environment. Data quality is the key to analytical projects. Thanks to the ISoft real-time calculation engine, the fastest on the market, Amadea allows companies to prepare and use massive and/or complex data in real-time, regardless of the volume. ISoft started from a simple observation, successful analytical projects must involve the business users at every stage. Founded on a no-code interface, accessible to all types of users, Amadea allows everyone involved in analytical projects to take part. As Amadea has the fastest real-time calculation engine on the market, it lets you specify, prototype and build your data applications simultaneously. Amadea incorporates the fastest real-time data analysis engine on the market, 10 million lines per second & per core for standard calculations.
  • 14
    IBM InfoSphere Optim Data Privacy
    IBM InfoSphere® Optim™ Data Privacy provides extensive capabilities to effectively mask sensitive data across non-production environments, such as development, testing, QA or training. To protect confidential data this single offering provides a variety of transformation techniques that substitute sensitive information with realistic, fully functional masked data. Examples of masking techniques include substrings, arithmetic expressions, random or sequential number generation, date aging, and concatenation. The contextually accurate masking capabilities help masked data retain a similar format to the original information. Apply a range of masking techniques on-demand to transform personally-identifying information and confidential corporate data in applications, databases and reports. Data masking features help you to prevent misuse of information by masking, obfuscating, and privatizing personal information that is disseminated across non-production environments.
  • 15
    Axibase Enterprise Reporter (AER)
    Axibase Enterprise Reporter (AER) is a unified IT reporting solution for performance monitoring and capacity planning based on linked data and self-service concepts. The linked data architecture implemented in AER allows it to deliver reporting capabilities on top of underlying monitoring systems simultaneously, without copying the data. AER is pre-integrated with IBM Tivoli, Microsoft System Center Operations Manager, HP Openview and Performance Manager, BMC ProactiveNet, VMWare vCenter, Oracle Enterprise Manager, SAP HANA, NetApp OnCommand, WhatsUp, Dynatrace, Entuity and other solutions. In addition, AER provides the universal adapter for integration with any monitoring system or a custom data source that supports JDBC connectivity. Leveraging AER as a single point of access to IT infrastructure metrics, systems administrators and application support teams are able to execute and automate performance monitoring and capacity planning tasks with minimal effort.
  • 16
    solidDB

    solidDB

    UNICOM Systems

    solidDB is known worldwide for delivering data with extreme speed. There are millions of deployments of solidDB in telecommunications networks, enterprise applications, and embedded software & systems. Market leaders such as Cisco, HP, Alcatel, Nokia and Siemens rely on it for their mission-critical applications. By keeping critical data in memory, rather than on disk, solidDB can perform significantly faster than conventional databases. It helps applications achieve throughput of hundreds of thousands to millions of transactions per second with response times measured in microseconds. Beyond game-changing performance, solidDB also provides built-in data availability features that help sustain uptime, prevent data loss and accelerate recovery. Additionally, solidDB supports administrators with the flexibility to tailor the software to precise application needs and features designed to simplify deployment and administration, helping drive down the total cost of ownership (TCO).
  • 17
    eMite

    eMite

    eMite

    eMite is the operational intelligence platform that combines advanced analytics, data correlation, KPI management and threshold alerting into a single, out-of-the-box browser-based solution that provides actionable insights from both real-time and historical data. eMite provides a very flexible and powerful data onboarding ETL (extract, transform, load) framework using several technologies to extract data, including APIs, XML, JSON, SQL, and others. eMite has developed over 80 pre-built adaptors to automatically ingest data from common third-party solutions from vendors like Salesforce, Microsoft, Oracle, Atlassian, Snare, and Genesys. eMite also provides adaptors to onboard data from more generic data sources like a database or an Excel file. eMite includes a KPI (Key Performance Indicator) management system, allowing users to build custom KPIs that are relevant to their operations.
  • 18
    Apache Gobblin

    Apache Gobblin

    Apache Software Foundation

    A distributed data integration framework that simplifies common aspects of Big Data integration such as data ingestion, replication, organization, and lifecycle management for both streaming and batch data ecosystems. Runs as a standalone application on a single box. Also supports embedded mode. Runs as an mapreduce application on multiple Hadoop versions. Also supports Azkaban for launching mapreduce jobs. Runs as a standalone cluster with primary and worker nodes. This mode supports high availability and can run on bare metals as well. Runs as an elastic cluster on public cloud. This mode supports high availability. Gobblin as it exists today is a framework that can be used to build different data integration applications like ingest, replication, etc. Each of these applications is typically configured as a separate job and executed through a scheduler like Azkaban.
  • 19
    Feast

    Feast

    Tecton

    Make your offline data available for real-time predictions without having to build custom pipelines. Ensure data consistency between offline training and online inference, eliminating train-serve skew. Standardize data engineering workflows under one consistent framework. Teams use Feast as the foundation of their internal ML platforms. Feast doesn’t require the deployment and management of dedicated infrastructure. Instead, it reuses existing infrastructure and spins up new resources when needed. You are not looking for a managed solution and are willing to manage and maintain your own implementation. You have engineers that are able to support the implementation and management of Feast. You want to run pipelines that transform raw data into features in a separate system and integrate with it. You have unique requirements and want to build on top of an open source solution.
  • 20
    DataOps DataFlow
    A holistic component-based platform for automating Data Reconciliation tests in modern Data Lake and Cloud Data Migration projects using Apache Spark. DataOps DataFlow is a modern, web browser-based solution for automating the testing of ETL, Data Warehouse, and Data Migration projects. Use Dataflow to inject data from any of the varied data sources, compare data, and load differences to S3 or a database. With fast and easy to set up, create and run dataflow in minutes. A best in the class testing tool for Big Data Testing DataOps DataFlow can integrate with all modern and advanced data sources including RDBMS, NoSQL, Cloud, and File-Based.
    Starting Price: Contact us
  • 21
    Semarchy xDI
    Experience Semarchy’s flexible unified data platform to empower better business decisions enterprise-wide. Integrate all your data with xDI, the high-performance, agile, and extensible data integration for all styles and use cases. Its single technology federates all forms of data integration, and mapping converts business rules into deployable code. xDI has extensible and open architecture supporting on-premise, cloud, hybrid, and multi-cloud environments.
  • 22
    TABEX4

    TABEX4

    BOI Software

    TABEX4 runs on all common operating systems and is applicable throughout the company – both on mainframe and server systems. Tables can be maintained efficiently and safely, independent of platform or database. TABEX4 supports import of table data from other software products and memory forms through optimized APIs. Export is possible in diverse ways as well: e.g. pdf, e-mail and other data or store formats. Our TABEX4 FAQs offer you a profound overview of important TABEX4 topics. In the TABEX4 Wiki you will gain expert knowledge about technical questions and challenges. Master public audits smoothly: TABEX4 places absolute priority to transparency and security in processing master data and control data. The TABEX4 Relational Bridge extends TABEX4 by interfaces to relational databases and makes the entire TABEX4 functions available for RDBs.
  • 23
    Insigna

    Insigna

    Insigna

    Insigna - Unified Digital Operations Platform™ offers comprehensive solutions for unification, management & analysis of operations data enabling insights for informed decisions and performance improvements. With Insigna, you unlock the full potential of your data. Insigna solutions focus on open integration, enabling Seamless Connectivity across your ops, Data Analytics, Workflow Simplification, Automation, & Optimization, empowering organizations to harness the power of Data Intelligence. A user-friendly, no-code configuration, helps you easily create customized dashboards & reports for actionable insights at your fingertips. Experience a rapid return on investment as Insigna streamlines your workflows & automates repetitive tasks, freeing up valuable resources for strategic initiatives. With real-time analytics & intuitive intelligence, decision-makers can quickly identify trends and make informed choices that drive incremental growth.
  • 24
    Navicat for MongoDB
    Available for all database objects such as Collections, Views, Functions, Indexes, GridFS, and MapReduce. Our professional object designer allows you to create, modify, and design database objects, all without writing a script. Navicat for MongoDB is designed to streamline your routine database tasks. The new interface is easy to access and understand -- giving you new ways to manage your MongoDB databases and making your work more efficient than ever. Available for all database objects such as Collections, Views, Functions, Indexes, GridFS, and MapReduce. Our professional object designer allows you to create, modify, and design database objects, all without writing a script.
  • 25
    CYRISMA

    CYRISMA

    CYRISMA

    CYRISMA is an all-in-one cyber risk management platform that enables you to discover, understand, mitigate, and manage risk in a holistic and cost-effective manner. Identify and mitigate network and endpoint vulnerabilities, discover and secure sensitive data across cloud and on-prem environments, strengthen OS configuration settings, track compliance, and generate cyber risk assessment reports in a few easy steps. Platform capabilities include (everything included in the price): -- Vulnerability and Patch Management -- Secure OS Configuration Scanning -- Sensitive data discovery; data protection (both on-prem cloud including Microsoft Office 365 and Google Workspace) -- Dark web monitoring -- Compliance Tracking (NIST CSF, CIS Critical Controls, SOC 2, PCI DSS, HIPAA, ACSC Essential Eight, NCSC Cyber Essentials) -- Active Directory Monitoring (both on-prem and Azure) -- Cyber risk quantification in multiple currencies -- Cyber risk assessment and reporting
  • 26
    Kestra

    Kestra

    Kestra

    Kestra is an open-source, event-driven orchestrator that simplifies data operations and improves collaboration between engineers and business users. By bringing Infrastructure as Code best practices to data pipelines, Kestra allows you to build reliable workflows and manage them with confidence. Thanks to the declarative YAML interface for defining orchestration logic, everyone who benefits from analytics can participate in the data pipeline creation process. The UI automatically adjusts the YAML definition any time you make changes to a workflow from the UI or via an API call. Therefore, the orchestration logic is defined declaratively in code, even if some workflow components are modified in other ways.
  • 27
    LiteX

    LiteX

    Jedis Singapore Pte. Ltd

    LiteX is offered in two components : Windows [ Client ] Linux Server [ LiteServer ]. The *standalone* Client functionality has : - SFTP capability, - File System Management (local and remote). - Remote Proxy FSM (PFSM). Remote system(s) to system(s) copy etc transparently via the Client. - SSH [2] [ SSL ] supported. In addition Client has an Server peer [ LiteServer ] available on Linux which gives DB maintenance and multi-domain bit level, Merge/Compare [ Client geared ] functionality. Full Client and Server Documentation is available. LiteServer examples and toolkit available. LiteX client is licensed free for SFTP and FSM. LiteServer is POA for license and Commercial use.
  • 28
    NMTY Enterprise
    NMTY Enterprise helps you protect all your privacy-sensitive data within your organization, regardless if it is stored in databases or files. Make NMTY Enterprise part of your IT environment and immediately anonymize all data sources that need to be protected. NMTY Enterprise makes it possible to anonymize data regardless of how it is stored, from databases to individual files. Anonymize data stored in a database or in separate files such as CSV and XML. Data is always anonymized directly within the source. This prevents non-anonymized data from being duplicated unnecessarily. Connections to your data sources support integrated authentication and are always encrypted when stored. In addition to anonymizing datasets, it is also possible to directly anonymize data processed within documents and images. Our solutions are developed based on the latest innovations and integrate directly into your existing processes. This way we ensure we always achieve the maximum result.
  • 29
    PK Protect
    PK Protect is a data protection platform designed to help organizations safeguard sensitive information across diverse environments. It provides robust tools for data discovery, classification, encryption, and monitoring, ensuring that critical data is protected both at rest and in transit. With automated policies and compliance controls, PK Protect enables businesses to meet regulatory requirements like GDPR and HIPAA while minimizing the risk of data breaches. The platform integrates with various systems to provide a unified approach to managing data security across cloud, on-premises, and hybrid environments. By offering real-time visibility and proactive threat detection, PK Protect helps organizations maintain control over their sensitive data and reduce security vulnerabilities.
  • 30
    Odyx yHat

    Odyx yHat

    Odyssey Analytics

    Odyx yHat is a Time Series Forecasting tool designed to simplify the intricate field of data science, making it accessible and user-friendly for individuals without any background in data science.
    Starting Price: $300/month
MongoDB Logo MongoDB