Best Data Management Software for Docker - Page 3

Compare the Top Data Management Software that integrates with Docker as of October 2025 - Page 3

This a list of Data Management software that integrates with Docker. Use the filters on the left to add additional filters for products that have integrations with Docker. View the products that work with Docker in the table below.

  • 1
    Chalk

    Chalk

    Chalk

    Powerful data engineering workflows, without the infrastructure headaches. Complex streaming, scheduling, and data backfill pipelines, are all defined in simple, composable Python. Make ETL a thing of the past, fetch all of your data in real-time, no matter how complex. Incorporate deep learning and LLMs into decisions alongside structured business data. Make better predictions with fresher data, don’t pay vendors to pre-fetch data you don’t use, and query data just in time for online predictions. Experiment in Jupyter, then deploy to production. Prevent train-serve skew and create new data workflows in milliseconds. Instantly monitor all of your data workflows in real-time; track usage, and data quality effortlessly. Know everything you computed and data replay anything. Integrate with the tools you already use and deploy to your own infrastructure. Decide and enforce withdrawal limits with custom hold times.
    Starting Price: Free
  • 2
    Zerve AI

    Zerve AI

    Zerve AI

    Merging the best of a notebook and an IDE into one integrated coding environment, experts can explore their data and write stable code at the same time with fully automated cloud infrastructure. Zerve’s data science development environment gives data science and ML teams a unified space to explore, collaborate, build, and deploy data science & AI projects like never before. Zerve offers true language interoperability, meaning that as well as being able to use Python, R, SQL, or Markdown all in the same canvas, users can connect these code blocks to each other. No more long-running code blocks or containers, with Zerve enjoying unlimited parallelization at any stage of the development journey. Analysis artifacts are automatically serialized, versioned, stored, and preserved for later use, meaning easily changing a step in the data flow without needing to rerun any preceding steps. Fine-grained selection of compute resources and extra memory for complex data transformation.
  • 3
    Citus

    Citus

    Citus Data

    Citus gives you the Postgres you love, plus the superpower of distributed tables. 100% open source. Now with schema-based and row-based sharding, plus Postgres 16 support. Scale Postgres by distributing data & queries. You can start with a single Citus node, then add nodes & rebalance shards when you need to grow. Speed up queries by 20x to 300x (or more) through parallelism, keeping more data in memory, higher I/O bandwidth, and columnar compression. Citus is an extension (not a fork) to the latest Postgres versions, so you can use your familiar SQL toolset & leverage your Postgres expertise. Reduce your infrastructure headaches by using a single database for both your transactional and analytical workloads. Download and use Citus open source for free. You can manage Citus yourself, embrace open source, and help us improve Citus via GitHub. Focus on your application & forget about your database. Run your app on Citus in the cloud with Azure Cosmos DB for PostgreSQL.
    Starting Price: $0.27 per hour
  • 4
    DataTrust

    DataTrust

    RightData

    DataTrust is built to accelerate test cycles and reduce the cost of delivery by enabling continuous integration and continuous deployment (CI/CD) of data. It’s everything you need for data observability, data validation, and data reconciliation at a massive scale, code-free, and easy to use. Perform comparisons, and validations, and do reconciliation with re-usable scenarios. Automate the testing process and get alerted when issues arise. Interactive executive reports with quality dimension insights. Personalized drill-down reports with filters. Compare row counts at the schema level for multiple tables. Perform checksum data comparisons for multiple tables. Rapid generation of business rules using ML. Flexibility to accept, modify, or discard rules as needed. Reconciling data across multiple sources. DataTrust solutions offers the full set of applications to analyze source and target datasets.
  • 5
    Tarantool

    Tarantool

    Tarantool

    Corporations need a way to ensure uninterrupted operation of their systems, high speed of data processing, and reliability of storage. The in-memory technologies have proven themselves well in solving these problems. For more than 10 years, Tarantool has been helping companies all over the world build smart caches, data marts, and golden client profiles while saving server capacity. Reduce the cost of storing credentials compared to siloed solutions and improve the service and security of client applications. Reduce data management costs of maintaining a large number of disparate systems that store customer identities. Increase sales by improving the speed and quality of customer recommendations for goods or services through the analysis of user behavior and user data. Improve mobile and web channel service by accelerating frontends to reduce user outflow. IT systems of large organizations operate in a closed loop of a local network, where data circulates unprotected.
  • 6
    ProxySQL

    ProxySQL

    ProxySQL

    ProxySQL is built with an advanced multi-core architecture to support hundreds of thousands of concurrent connections, multiplexed to thousands of servers. ProxySQL supports sharding by user, schema or table by means of the advanced query rule engine or through customized plugins. The development team no longer needs to rewrite queries generated by ORMs or packaged software, ProxySQL's query rewriting feature can modify SQL statements on the fly. Battle-tested doesn't even begin to cover it — ProxySQL is war-tested. Performance is the priority and the numbers prove it. ProxySQL is an open source high performance, high availability, database protocol aware proxy for MySQL and PostgreSQL. ProxySQL is a robust SQL proxy solution that acts as a pivotal bridge between database clients and servers, offering a plethora of features designed to streamline database operations. ProxySQL empowers organizations to harness the full potential of their database infrastructure.
  • 7
    CloudBeaver Enterprise
    CloudBeaver Enterprise is a lightweight, browser-based data management platform designed for secure, multi-source database operations. It enables seamless integration with SQL, NoSQL, and cloud databases, including AWS, Microsoft Azure, and Google Cloud Platform (GCP), through its cloud explorer feature. It supports a range of functionalities such as data visualization, SQL script execution with smart autocompletion, entity-relationship diagramming, and AI-assisted query generation. Deployment is simplified via a single Docker command, and the system supports offline server installations without requiring internet access. Advanced user management capabilities include integration with enterprise authentication systems like AWS SSO, SAML, and OpenID, allowing for secure access control and user provisioning. CloudBeaver Enterprise also facilitates collaboration among teams by enabling shared access to resources and connections.
  • 8
    Astro by Astronomer
    For data teams looking to increase the availability of trusted data, Astronomer provides Astro, a modern data orchestration platform, powered by Apache Airflow, that enables the entire data team to build, run, and observe data pipelines-as-code. Astronomer is the commercial developer of Airflow, the de facto standard for expressing data flows as code, used by hundreds of thousands of teams across the world.
  • 9
    Commvault Cloud
    Commvault Cloud is a comprehensive cyber resilience platform designed to protect, manage, and recover data across diverse IT environments, including on-premises, cloud, and SaaS applications. Powered by Metallic AI, it offers advanced features such as AI-driven threat detection, automated compliance tools, and rapid recovery capabilities like Cleanroom Recovery and Cloudburst Recovery. The platform ensures continuous data security through proactive risk scanning, threat hunting, and cyber deception, while facilitating seamless recovery and business continuity with infrastructure-as-code automation. With a unified management interface, Commvault Cloud enables organizations to safeguard their critical data assets, maintain compliance, and swiftly respond to cyber threats, thereby minimizing downtime and operational disruptions.
  • 10
    Nightfall

    Nightfall

    Nightfall

    Discover, classify, and protect your sensitive data. Nightfall™ uses machine learning to identify business-critical data, like customer PII, across your SaaS, APIs, and data infrastructure, so you can manage & protect it. Integrate in minutes with cloud services via APIs to monitor data without agents. Machine learning classifies your sensitive data & PII with high accuracy, so nothing gets missed. Setup automated workflows for quarantines, deletions, alerts, and more - saving you time and keeping your business safe. Nightfall integrates directly with all your SaaS, APIs, and data infrastructure. Start building with Nightfall’s APIs for sensitive data classification & protection for free. Via REST API, programmatically get structured results from Nightfall’s deep learning-based detectors for things like credit card numbers, API keys, and more. Integrate with just a few lines of code. Seamlessly add data classification to your applications & workflows using Nightfall's REST API.
  • 11
    Gilhari

    Gilhari

    Software Tree

    We’re thrilled to announce that Software Tree has won a 2021 DEVIES Award in the code frameworks/libraries category for its innovative Gilhari microservice framework. Gilhari makes it easy for developers to quickly develop high-performance, database-agnostic, and Docker-compatible RESTful applications that need to interact with JSON data in cloud or on-premises. The object-oriented world and the relational world are conceptually different. Manually writing the verbose mapping logic to bridge the gap between the object-oriented and relational artifacts is tedious and time-consuming. Software Tree’s ORM technology frameworks are lightweight in their design and implementation and provide a lightweight feel in their usage. The lightweight aspects of our ORM technology do not compromise on its power and functionality, though. This results in faster development and deployment of modern applications that require flexible object-oriented access to relational data.
  • 12
    IBM Databand
    Monitor your data health and pipeline performance. Gain unified visibility for pipelines running on cloud-native tools like Apache Airflow, Apache Spark, Snowflake, BigQuery, and Kubernetes. An observability platform purpose built for Data Engineers. Data engineering is only getting more challenging as demands from business stakeholders grow. Databand can help you catch up. More pipelines, more complexity. Data engineers are working with more complex infrastructure than ever and pushing higher speeds of release. It’s harder to understand why a process has failed, why it’s running late, and how changes affect the quality of data outputs. Data consumers are frustrated with inconsistent results, model performance, and delays in data delivery. Not knowing exactly what data is being delivered, or precisely where failures are coming from, leads to persistent lack of trust. Pipeline logs, errors, and data quality metrics are captured and stored in independent, isolated systems.
  • 13
    Elucidata Polly
    Harness the power of biomedical data with Polly. The Polly Platform helps to scale batch jobs, workflows, coding environments and visualization applications. Polly allows resource pooling and provides optimal resource allocation based on your usage requirements and makes use of spot instances whenever possible. All this leads to optimization, efficiency, faster response time and lower costs for the resources. Get access to a dashboard to monitor resource usage and cost real time and minimize overhead of resource management by your IT team. Version control is integral to Polly’s infrastructure. Polly ensures version control for your workflows and analyses through a combination of dockers and interactive notebooks. We have built a mechanism that allows the data, code and the environment co-exist. This coupled with data storage on the cloud and the ability to share projects ensures reproducibility of every analysis you perform.
  • 14
    Nebula Graph
    The graph database built for super large-scale graphs with milliseconds of latency. We are continuing to collaborate with the community to prepare, popularize and promote the graph database. Nebula Graph only allows authenticated access via role-based access control. Nebula Graph supports multiple storage engine types and the query language can be extended to support new algorithms. Nebula Graph provides low latency read and write , while still maintaining high throughput to simplify the most complex data sets. With a shared-nothing distributed architecture , Nebula Graph offers linear scalability. Nebula Graph's SQL-like query language is easy to understand and powerful enough to meet complex business needs. With horizontal scalability and a snapshot feature, Nebula Graph guarantees high availability even in case of failures. Large Internet companies like JD, Meituan, and Xiaohongshu have deployed Nebula Graph in production environments.
  • 15
    Cayley

    Cayley

    Cayley

    Cayley is an open-source database for Linked Data. It is inspired by the graph database behind Google's Knowledge Graph (formerly Freebase). Cayley is an open-source graph database designed for ease of use and storing complex data. Built-in query editor, visualizer and REPL. Cayley can use multiple query languages like Gizmo, a query language inspired by Gremlin, GraphQL-inspired query language, MQL a simplified version for Freebase fans. Cayley is modular, easy to connect to your favorite programming languages and back-end stores, production ready, well tested and used by various companies for their production workloads and fast with optimized specifically for usage in applications. Rough performance testing shows that, on 2014 consumer hardware and an average disk, 134m quads in LevelDB is no problem and a multi-hop intersection query- films starring X and Y - takes ~150ms. Cayley is configured by default to run in memory (That's what backend memstore means).
  • 16
    AnzoGraph DB

    AnzoGraph DB

    Cambridge Semantics

    With a huge collection of analytical features, AnzoGraph DB can enhance your analytical framework. Watch this video to learn how AnzoGraph DB is a Massively Parallel Processing (MPP) native graph database that is built for data harmonization and analytics. Horizontally scalable graph database built for online analytics and data harmonization. Take on data harmonization and linked data challenges with AnzoGraph DB, a market-leading analytical graph database. AnzoGraph DB provides industrialized online performance for enterprise-scale graph applications. AnzoGraph DB uses familiar SPARQL*/OWL for semantic graphs but also supports Labeled Property Graphs (LPGs). Access to many analytical, machine learning and data science capabilities help you achieve new insights, delivered at unparalleled speed and scale. Use context and relationships between data as first-class citizens in your analysis. Ultra-fast data loading and analytical queries.
  • 17
    TIBCO Graph Database
    To unveil the true value of constantly evolving business data, you need to understand the relationships in data in a much more profound way. Unlike other databases, a graph database puts relationships at the forefront, using Graph theory and Linear Algebra to traverse and show how complex data webs, data sources, and data points relate. TIBCO® Graph Database allows you to discover, store, and convert complex dynamic data into meaningful insights. Enable users to rapidly build data and computational models that establish dynamic relationships among organizational silos. These knowledge graphs deliver value by connecting your organization’s vast array of data and revealing relationships that let you accelerate optimization of assets and processes. Combined OLTP and OLAP features in a single enterprise-grade database. Optimistic ACID level transaction properties with native storage and access.
  • 18
    Tokern

    Tokern

    Tokern

    Open source data governance suite for databases and data lakes. Tokern is a simple to use toolkit to collect, organize and analyze data lake's metadata. Run as a command-line app for quick tasks. Run as a service for continuous collection of metadata. Analyze lineage, access control and PII datasets using reporting dashboards or programmatically in Jupyter notebooks. Tokern is an open source data governance suite for databases and data lakes. Improve ROI of your data, comply with regulations like HIPAA, CCPA and GDPR and protect critical data from insider threats with confidence. Centralized metadata management of users, datasets and jobs. Powers other data governance features. Track Column Level Data Lineage for Snowflake, AWS Redshift and BigQuery. Build lineage from query history or ETL scripts. Explore lineage using interactive graphs or programmatically using APIs or SDKs.
  • 19
    Wavo

    Wavo

    Wavo

    We’ve released a revolutionary big data platform that gathers all information about a music business, providing a single source of truth for decisions. Every music business has hundreds of data sources. But they are siloed and fragmented. Our platform identifies and connects them to build a foundation of quality data that can be applied to all daily music business operations. To work efficiently and securely—and to surface valuable insight no one else can—record labels and agencies require a sophisticated data management and governance system, so that data is available, relevant, and usable at all times. As data sources are ingested into Wavo’s Big Data Platform, machine learning is deployed to tag data based on personalized templates, making it easy to access and drill-down into important information. This enables everyone in a music business to activate and deliver business-ready data, backed up and organized for immediate value.
  • 20
    jBASE

    jBASE

    jBASE

    The future of your PICK system requires a database platform that continually evolves to meet the needs of today’s developers. jBASE is now officially certified for Docker containers, including built-in support for the MongoDB NoSQL database, and standard APIs for Salesforce, Avalara, and dozens of other platforms. Plus new enhancements to Objects that make life easier for developers. We are continuing to invest in jBASE because we believe in PICK! While others see a decline, we’ve seen 6 years of consecutive growth. We care about your long-term success and haven’t had a maintenance price increase in decades. We play well with others by collaborating and making jBASE integrate with modern technologies like VSCode, Mongo, Docker, and Salesforce. The migration routes from other PICK databases have been vastly simplified, licensing now supports flexible CPU and SaaS-based models, and our in-line operating system approach means our scalability, speed and stability are unmatched.
  • 21
    Knoldus

    Knoldus

    Knoldus

    World's largest team of Functional Programming and Fast Data engineers focused on creating customized high-performance solutions. We move from "thought" to "thing" via rapid prototyping and proof of concept. Activate an ecosystem to deliver at scale with CI/CD to support your requirements. Understanding the strategic intent and stakeholder needs to develop a shared vision. Deploy MVP to launch the product in the most efficient & expedient manner possible. Continuous improvements and enhancements to support new requirements. Building great products and providing unmatched engineering services would not be possible without the knowledge and extensive usage of the latest tools and technology. We help you to capitalize on opportunities, respond to competitive threats, and scale successful investments by reducing organizational friction from your company’s structures, processes, and culture. Knoldus helps clients identify and capture the most value and meaningful insights from data.
  • 22
    WSO2 Enterprise Service Bus
    The WSO2 integration runtime engine is capable of playing multiple roles in your enterprise architecture. It can act as an ESB or a microservices integrator. When deployed as an ESB, it caters to your message routing, transformation, message mediation, service orchestration, as well as service and API hosting needs. Header-based, content-based, rule-based, and priority-based routing. Implement Enterprise Integration Patterns (EIPs), database integration, event stream integrations. Transform messages with XSLT 1.0/2.0, XPath, XQuery, and Smooks. Visual data mapping, CSV, JSON, and XML transformation connectors. Supports any RDBMS, CSV, Excel, ODS, Cassandra, and Google spreadsheets. Supports OData v4 protocol for any RDBMS and Cassandra data sources. Supports MSSQL, DB2, Oracle, OpenEdge, TerraData, MySQL, PostgreSQL/EnterpriseDB, H2, Derby or any database with a JDBC driver. Support for nested queries across data sources.
  • 23
    KeyDB

    KeyDB

    KeyDB

    KeyDB maintains full compatibility with Redis modules, API and protocol. Seamlessly drop in KeyDB and maintain full compatibility with your existing clients, scripts and configurations. Multi-Master mode uses a single replicated dataset across many nodes to serve both read and write operations Nodes can be replicated cross-region to offer submillisecond latencies to local clients. Cluster mode allows unlimited read and write scaling by splitting the dataset across shards. This allows unlimited scaling, and also support high availability through replica nodes. KeyDB offers new community driven commands that enable you to do more with your data. Add your own commands and functionality using JavaScript with the ModJS module. ModJS lets you write functions in javascript that can in turn be called directly by KeyBD. The example to the left shows and example of a javascript function that would be loaded with the module. It can then be called directly from your client.
  • 24
    witboost

    witboost

    Agile Lab

    witboost is a modular, scalable, fast, efficient data management system for your company to truly become data driven, reduce time-to-market, it expenditures and overheads. witboost comprises a series of modules. These are building blocks that can work as standalone solutions to address and solve a single need or problem, or they can be combined to create the perfect data management ecosystem for your company. Each module improves a specific data engineering function and they can be combined to create the perfect solution to answer your specific needs, guaranteeing a blazingly fact and smooth implementation, thus dramatically reducing time-to-market, time-to-value and consequently the TCO of your data engineering infrastructure. Smart Cities need digital twins to predict needs and avoid unforeseen problems, gathering data from thousands of sources and managing ever more complex telematics.
  • 25
    QuestDB

    QuestDB

    QuestDB

    QuestDB is a relational column-oriented database designed for time series and event data. It uses SQL with extensions for time series to assist with real-time analytics. These pages cover core concepts of QuestDB, including setup steps, usage guides, and reference documentation for syntax, APIs and configuration. This section describes the architecture of QuestDB, how it stores and queries data, and introduces features and capabilities unique to the system. Designated timestamp is a core feature that enables time-oriented language capabilities and partitioning. Symbol type makes storing and retrieving repetitive strings efficient. Storage model describes how QuestDB stores records and partitions within tables. Indexes can be used for faster read access on specific columns. Partitions can be used for significant performance benefits on calculations and queries. SQL extensions allow performant time series analysis with a concise syntax.
  • 26
    Cloudera Data Platform
    Unlock the potential of private and public clouds with the only hybrid data platform for modern data architectures with data anywhere. Cloudera is a hybrid data platform designed for unmatched freedom to choose—any cloud, any analytics, any data. Cloudera delivers faster and easier data management and data analytics for data anywhere, with optimal performance, scalability, and security. With Cloudera you get all the advantages of private cloud and public cloud for faster time to value and increased IT control. Cloudera provides the freedom to securely move data, applications, and users bi-directionally between the data center and multiple data clouds, regardless of where your data lives.
  • 27
    Abstracta
    Data abstraction on any data source at your fingertips without any need for deployment. Everything is managed in the cloud with no presence at the data source. Abstracta helps in unifying all data sources. It simplifies and streamlines interaction with data sources. Restify data as a service with our proprietary service builder feature. Empower business users to perform on-the-fly data analysis with self-service capabilities. Accelerate data security initiatives much faster than you would have imagined. Our innovative profiling tags simplify data authorization. Fortify data with row & column-level access control policies. Empower businesses to protect & monitor data access patterns. Let data governance initiatives leap with our innovative flagship product Abstracta™. Protect business from internal & external security threats. Ekahaa believes in scale with simplicity for 10X growth. With its fluid UI design, any data can be exposed as a service in 2 minutes.
  • 28
    Redpanda

    Redpanda

    Redpanda Data

    Breakthrough data streaming capabilities that let you deliver customer experiences never before possible. Kafka API and ecosystem are compatible. Redpanda BulletPredictable low latencies with zero data loss. Redpanda BulletUpto 10x faster than Kafka. Redpanda BulletEnterprise-grade support and hotfixes. Redpanda BulletAutomated backups to S3/GCS. Redpanda Bullet100% freedom from routine Kafka operations. Redpanda BulletSupport for AWS and GCP. Redpanda was designed from the ground up to be easily installed to get streaming up and running quickly. After you see its power, put Redpanda to the test in production. Use the more advanced Redpanda features. We manage provisioning, monitoring, and upgrades. Without any access to your cloud credentials. Sensitive data never leaves your environment. Provisioned, operated, and maintained for you. Configurable instance types. Expand cluster as your needs grow.
  • 29
    Kovair QuickSync

    Kovair QuickSync

    Kovair Software

    Kovair QuickSync is a one stop, cost-effective, wide-range data migration solution for any enterprise across industry. Kovair QuickSync is a Windows-based desktop solution, which can be easily installed and used. Requirement of minimal infrastructure for operation makes it a very cost effective and efficient solution for the industry. It not only helps to migrate data from one source to one target but also helps to migrate data from one source to multiple targets. Its Instinctive UI makes it easily adaptable and adorable to the users. Offers a built-in disaster recovery mechanism and re-migration capability to ensure 100% data migration with zero data loss. Supports template-based migration capability. Once the configuration is done for one project it can be reused for others. Provides on-screen monitoring of migration status providing a real-time update on the health of migration.
  • 30
    Meltano

    Meltano

    Meltano

    Meltano provides the ultimate flexibility in deployment options. Own your data stack, end to end. Ever growing connector library of 300+ connectors have been running in production for years. Run workflows in isolated environments, execute end-to-end tests, and version control everything. Open source gives you the power to build your ideal data stack. Define your entire project as code and collaborate confidently with your team. The Meltano CLI enables you to rapidly create your project, making it easy to start replicating data. Meltano is designed to be the best way to run dbt to manage your transformations. Your entire data stack is defined in your project, making it simple to deploy it to production. Validate your changes in development before moving to CI, and in staging before moving to production.