Join/Login
Business Software
Open Source Software
For Vendors
Blog
About
More

For Vendors Add a Product Join Login

Business Software

Open Source Software

SourceForge Podcast

Resources

Articles
Case Studies
Blog

Help
Create
Join
Login

Home
Compare Business Software
Data Management Software
Apache Kafka

Best Data Management Software for Apache Kafka - Page 3

View:

Open Source Commercial

Clear All Filters

Data Management Features

Data Migration 14
Data Capture 13
Data Integration 13
Data Security 13
More...
Data Analysis 12
Data Quality Control 12
Master Data Management 12
Customer Data 11
Information Governance 11
Match & Merge 8

Deployment

Cloud 158
Windows 37
On-Premises 36
Linux 34
Mac 31
Android 4
iPad 4
iPhone 4
Chromebook 2

Compare the Top Data Management Software that integrates with Apache Kafka as of November 2025 - Page 3

Sort By:

Sponsored

Apache Kafka Data Management Clear Filters

This a list of Data Management software that integrates with Apache Kafka. Use the filters on the left to add additional filters for products that have integrations with Apache Kafka. View the products that work with Apache Kafka in the table below.

1

Aerospike

Aerospike

Aerospike is the global leader in next-generation, real-time NoSQL data solutions for any scale. Aerospike enterprises overcome seemingly impossible data bottlenecks to compete and win with a fraction of the infrastructure complexity and cost of legacy NoSQL databases. Aerospike’s patented Hybrid Memory Architecture™ delivers an unbreakable competitive advantage by unlocking the full potential of modern hardware, delivering previously unimaginable value from vast amounts of data at the edge, to the core and in the cloud. Aerospike empowers customers to instantly fight fraud; dramatically increase shopping cart size; deploy global digital payment networks; and deliver instant, one-to-one personalization for millions of customers. Aerospike customers include Airtel, Banca d’Italia, Nielsen, PayPal, Snap, Verizon Media and Wayfair. The company is headquartered in Mountain View, Calif., with additional locations in London; Bengaluru, India; and Tel Aviv, Israel.

View Software
2

Spotfire

Cloud Software Group

Spotfire is the most complete analytics solution on the market, enabling everyone to explore and visualize new discoveries in data through immersive dashboards and advanced analytics. Spotfire analytics delivers capabilities at scale, including predictive analytics, geolocation analytics, and streaming analytics. And with Spotfire Mods, you can build tailored analytic apps rapidly, repeatedly, and to scale. With the Spotfire analytics platform you get a seamless, single-pane-of-glass experience for visual analytics, data discovery, and point-and-click insights. Immerse yourself in both historic and real-time data, interactively. Drill down or across multi-layer, disparate data sources with fully brush-linked, responsive visualizations. Imagine, then rapidly build, scalable tailored analytics apps using the Spotfire Mods framework, to get all the power of Spotfire software in your own fit-for-purpose analytics apps.

Starting Price: $25 per month

View Software
3

BigID

BigID

BigID is data visibility and control for all types of data, everywhere. Reimagine data management for privacy, security, and governance across your entire data landscape. With BigID, you can automatically discover and manage personal and sensitive data – and take action for privacy, protection, and perspective. BigID uses advanced machine learning and data intelligence to help enterprises better manage and protect their customer & sensitive data, meet data privacy and protection regulations, and leverage unmatched coverage for all data across all data stores. 2

View Software
4

Querona

YouNeedIT

We make BI & Big Data analytics work easier and faster. Our goal is to empower business users and make always-busy business and heavily loaded BI specialists less dependent on each other when solving data-driven business problems. If you have ever experienced a lack of data you needed, time to consuming report generation or long queue to your BI expert, consider Querona. Querona uses a built-in Big Data engine to handle growing data volumes. Repeatable queries can be cached or calculated in advance. Optimization needs less effort as Querona automatically suggests query improvements. Querona empowers business analysts and data scientists by putting self-service in their hands. They can easily discover and prototype data models, add new data sources, experiment with query optimization and dig in raw data. Less IT is needed. Now users can get live data no matter where it is stored. If databases are too busy to be queried live, Querona will cache the data.

View Software
5

Ataccama ONE

Ataccama

Ataccama reinvents the way data is managed to create value on an enterprise scale. Unifying Data Governance, Data Quality, and Master Data Management into a single, AI-powered fabric across hybrid and Cloud environments, Ataccama gives your business and data teams the ability to innovate with unprecedented speed while maintaining trust, security, and governance of your data.

View Software
6

Utilihive

Greenbird Integration Technology

Utilihive is a cloud-native big data integration platform, purpose-built for the digital data-driven utility, offered as a managed service (SaaS). Utilihive is the leading Enterprise-iPaaS (iPaaS) that is purpose-built for energy and utility usage scenarios. Utilihive provides both the technical infrastructure platform (connectivity, integration, data ingestion, data lake, API management) and pre-configured integration content or accelerators (connectors, data flows, orchestrations, utility data model, energy data services, monitoring and reporting dashboards) to speed up the delivery of innovative data driven services and simplify operations. Utilities play a vital role towards achieving the Sustainable Development Goals and now have the opportunity to build universal platforms to facilitate the data economy in a new world including renewable energy. Seamless access to data is crucial to accelerate the digital transformation.

View Software
7

ScyllaDB

ScyllaDB

ScyllaDB is the database for data-intensive apps that require high performance and low latency. It enables teams to harness the ever-increasing computing power of modern infrastructures – eliminating barriers to scale as data grows. Unlike any other database, ScyllaDB is a distributed NoSQL database fully compatible with Apache Cassandra and Amazon DynamoDB, yet is built with deep architectural advancements that enable exceptional end-user experiences at radically lower costs. Over 400 game-changing companies like Disney+ Hotstar, Expedia, FireEye, Discord, Zillow, Starbucks, Comcast, and Samsung use ScyllaDB for their toughest database challenges. ScyllaDB is available as free open source software, a fully-supported enterprise product, and a fully managed database-as-a-service (DBaaS) on multiple cloud providers.

View Software
8

SQLstream

Guavus, a Thales company

SQLstream ranks #1 for IoT stream processing & analytics (ABI Research). Used by Verizon, Walmart, Cisco, & Amazon, our technology powers applications across data centers, the cloud, & the edge. Thanks to sub-ms latency, SQLstream enables live dashboards, time-critical alerts, & real-time action. Smart cities can optimize traffic light timing or reroute ambulances & fire trucks. Security systems can shut down hackers & fraudsters right away. AI / ML models, trained by streaming sensor data, can predict equipment failures. With lightning performance, up to 13M rows / sec / CPU core, companies have drastically reduced their footprint & cost. Our efficient, in-memory processing permits operations at the edge that are otherwise impossible. Acquire, prepare, analyze, & act on data in any format from any source. Create pipelines in minutes not months with StreamLab, our interactive, low-code GUI dev environment. Export SQL scripts & deploy with the flexibility of Kubernetes.

View Software
9

Tengu

Tengu

TENGU is a DataOps Orchestration Platform that works as a central workspace for data profiles of all levels. It provides data integration, extraction, transformation, loading all within it’s graph view UI in which you can intuitively monitor your data environment. By using the platform, business, analytics & data teams need fewer meetings and service tickets to collect data, and can start right away with the data relevant to furthering the company. The Platform offers a unique graph view in which every element is automatically generated with all available info based on metadata. While allowing you to perform all necessary actions from the same workspace. Enhance collaboration and efficiency, with the ability to quickly add and share comments, documentation, tags, groups. The platform enables anyone to get straight to the data with self-service. Thanks to the many automations and low to no-code functionalities and built-in assistant.

View Software
10

E-MapReduce

Alibaba

EMR is an all-in-one enterprise-ready big data platform that provides cluster, job, and data management services based on open-source ecosystems, such as Hadoop, Spark, Kafka, Flink, and Storm. Alibaba Cloud Elastic MapReduce (EMR) is a big data processing solution that runs on the Alibaba Cloud platform. EMR is built on Alibaba Cloud ECS instances and is based on open-source Apache Hadoop and Apache Spark. EMR allows you to use the Hadoop and Spark ecosystem components, such as Apache Hive, Apache Kafka, Flink, Druid, and TensorFlow, to analyze and process data. You can use EMR to process data stored on different Alibaba Cloud data storage service, such as Object Storage Service (OSS), Log Service (SLS), and Relational Database Service (RDS). You can quickly create clusters without the need to configure hardware and software. All maintenance operations are completed on its Web interface.

View Software
11

Apache Druid

Druid

Apache Druid is an open source distributed data store. Druid’s core design combines ideas from data warehouses, timeseries databases, and search systems to create a high performance real-time analytics database for a broad range of use cases. Druid merges key characteristics of each of the 3 systems into its ingestion layer, storage format, querying layer, and core architecture. Druid stores and compresses each column individually, and only needs to read the ones needed for a particular query, which supports fast scans, rankings, and groupBys. Druid creates inverted indexes for string values for fast search and filter. Out-of-the-box connectors for Apache Kafka, HDFS, AWS S3, stream processors, and more. Druid intelligently partitions data based on time and time-based queries are significantly faster than traditional databases. Scale up or down by just adding or removing servers, and Druid automatically rebalances. Fault-tolerant architecture routes around server failures.

View Software
12

Lenses

Lenses.io

Enable everyone to discover and observe streaming data. Sharing, documenting and cataloging your data can increase productivity by up to 95%. Then from data, build apps for production use cases. Apply a data-centric security model to cover all the gaps of open source technology, and address data privacy. Provide secure and low-code data pipeline capabilities. Eliminate all darkness and offer unparalleled observability in data and apps. Unify your data mesh and data technologies and be confident with open source in production. Lenses is the highest rated product for real-time stream analytics according to independent third party reviews. With feedback from our community and thousands of engineering hours invested, we've built features that ensure you can focus on what drives value from your real time data. Deploy and run SQL-based real time applications over any Kafka Connect or Kubernetes infrastructure including AWS EKS.

Starting Price: $49 per month

View Software
13

Amazon MSK

Amazon

Amazon Managed Streaming for Apache Kafka (Amazon MSK) is a fully managed service that makes it easy for you to build and run applications that use Apache Kafka to process streaming data. Apache Kafka is an open-source platform for building real-time streaming data pipelines and applications. With Amazon MSK, you can use native Apache Kafka APIs to populate data lakes, stream changes to and from databases, and power machine learning and analytics applications. Apache Kafka clusters are challenging to setup, scale, and manage in production. When you run Apache Kafka on your own, you need to provision servers, configure Apache Kafka manually, replace servers when they fail, orchestrate server patches and upgrades, architect the cluster for high availability, ensure data is durably stored and secured, setup monitoring and alarms, and carefully plan scaling events to support load changes.

Starting Price: $0.0543 per hour

View Software
14

Apache Ranger

The Apache Software Foundation

Apache Ranger™ is a framework to enable, monitor and manage comprehensive data security across the Hadoop platform. The vision with Ranger is to provide comprehensive security across the Apache Hadoop ecosystem. With the advent of Apache YARN, the Hadoop platform can now support a true data lake architecture. Enterprises can potentially run multiple workloads, in a multi tenant environment. Data security within Hadoop needs to evolve to support multiple use cases for data access, while also providing a framework for central administration of security policies and monitoring of user access. Centralized security administration to manage all security related tasks in a central UI or using REST APIs. Fine grained authorization to do a specific action and/or operation with Hadoop component/tool and managed through a central administration tool. Standardize authorization method across all Hadoop components. Enhanced support for different authorization methods - Role based access control etc.

View Software
15

Vaultspeed

VaultSpeed

Experience faster data warehouse automation. The Vaultspeed automation tool is built on the Data Vault 2.0 standard and a decade of hands-on experience in data integration projects. Get support for all Data Vault 2.0 objects and implementation options. Generate quality code fast for all scenarios in a Data Vault 2.0 integration system. Plug Vaultspeed into your current set-up and leverage your investments in tools and knowledge. Get guaranteed compliance with the latest Data Vault 2.0 standard. We are in continuous interaction with Scalefree, the body of knowledge for the Data Vault 2.0 community. The Data Vault 2.0 modelling approach strips the model components to their bare minimum so they can be loaded through the same loading pattern (repeatable pattern) and have the same database structure. Vaultspeed works with a template system, which understands the structure of the object types, and easy-to-set configuration parameters.

Starting Price: €600 per user per month

View Software
16

Y42

Datos-Intelligence GmbH

Y42 is the first fully managed Modern DataOps Cloud. It is purpose-built to help companies easily design production-ready data pipelines on top of their Google BigQuery or Snowflake cloud data warehouse. Y42 provides native integration of best-of-breed open-source data tools, comprehensive data governance, and better collaboration for data teams. With Y42, organizations enjoy increased accessibility to data and can make data-driven decisions quickly and efficiently.

View Software
17

Actian Avalanche

Actian

Actian Avalanche is a fully managed hybrid cloud data warehouse service designed from the ground up to deliver high performance and scale across all dimensions – data volume, concurrent user, and query complexity – at a fraction of the cost of alternative solutions. It is a true hybrid platform that can be deployed on-premises as well as on multiple clouds, including AWS, Azure, and Google Cloud, enabling you to migrate or offload applications and data to the cloud at your own pace. Actian Avalanche delivers the best price-performance in the industry outof-the-box without DBA tuning and optimization techniques. For the same cost as alternative solutions, you can benefit from substantially better performance or chose the same performance for significantly lower cost. For example, Avalanche provides up to 6x the price-performance advantage over Snowflake as measured by GigaOm’s TPC-H industry standard benchmark and even more against many of the appliance vendors.

View Software
18

Intel Tiber AI Studio

Intel

Intel® Tiber™ AI Studio is a comprehensive machine learning operating system that unifies and simplifies the AI development process. The platform supports a wide range of AI workloads, providing a hybrid and multi-cloud infrastructure that accelerates ML pipeline development, model training, and deployment. With its native Kubernetes orchestration and meta-scheduler, Tiber™ AI Studio offers complete flexibility in managing on-prem and cloud resources. Its scalable MLOps solution enables data scientists to easily experiment, collaborate, and automate their ML workflows while ensuring efficient and cost-effective utilization of resources.

View Software
19

Lyftrondata

Lyftrondata

Whether you want to build a governed delta lake, data warehouse, or simply want to migrate from your traditional database to a modern cloud data warehouse, do it all with Lyftrondata. Simply create and manage all of your data workloads on one platform by automatically building your pipeline and warehouse. Analyze it instantly with ANSI SQL, BI/ML tools, and share it without worrying about writing any custom code. Boost the productivity of your data professionals and shorten your time to value. Define, categorize, and find all data sets in one place. Share these data sets with other experts with zero codings and drive data-driven insights. This data sharing ability is perfect for companies that want to store their data once, share it with other experts, and use it multiple times, now and in the future. Define dataset, apply SQL transformations or simply migrate your SQL data processing logic to any cloud data warehouse.

View Software
20

CrateDB

CrateDB

The enterprise database for time series, documents, and vectors. Store any type of data and combine the simplicity of SQL with the scalability of NoSQL. CrateDB is an open source distributed database running queries in milliseconds, whatever the complexity, volume and velocity of data.

View Software
21

EraSearch

Era Software

Purpose-built for cloud-native deployments, EraSearch brings you a dynamic data fabric that leverages decoupled storage & compute, a true zero-schema design, and adaptive indexing to deliver an infinitely-scalable log management experience with an unparalleled reduction in cost and complexity. Lots of log management products are built on top of Elasticsearch. We built EraSearch from scratch to solve its key problems. By embracing a stateless design for all of the core components, managing EraSearch with k8s is a dream. Lots of log management products are built on top of Elasticsearch. We built EraSearch from scratch to solve its key problems. By using a modern, coordination-free ingest design, EraSearch can handle data at a greatly-reduced cost. Running EraSearch is completely hands-off, so you never have to worry about cluster health again.

Starting Price: ¢65 per GB

View Software
22

GigaSpaces

GigaSpaces

Smart DIH is an operational data hub that powers real-time modern applications. It unleashes the power of customers’ data by transforming data silos into assets, turning organizations into data-driven enterprises. Smart DIH consolidates data from multiple heterogeneous systems into a highly performant data layer. Low code tools empower data professionals to deliver data microservices in hours, shortening developing cycles and ensuring data consistency across all digital channels. XAP Skyline is a cloud-native, in memory data grid (IMDG) and developer framework designed for mission critical, cloud-native apps. XAP Skyline delivers maximal throughput, microsecond latency and scale, while maintaining transactional consistency. It provides extreme performance, significantly reducing data access time, which is crucial for real-time decisioning, and transactional applications. XAP Skyline is used in financial services, retail, and other industries where speed and scalability are critical.

View Software
23

Mozart Data

Mozart Data

Mozart Data is the all-in-one modern data platform that makes it easy to consolidate, organize, and analyze data. Start making data-driven decisions by setting up a modern data stack in an hour - no engineering required.

View Software
24

IRI Voracity

IRI, The CoSort Company

Voracity is the only high-performance, all-in-one data management platform accelerating AND consolidating the key activities of data discovery, integration, migration, governance, and analytics. Voracity helps you control your data in every stage of the lifecycle, and extract maximum value from it. Only in Voracity can you: 1) CLASSIFY, profile and diagram enterprise data sources 2) Speed or LEAVE legacy sort and ETL tools 3) MIGRATE data to modernize and WRANGLE data to analyze 4) FIND PII everywhere and consistently MASK it for referential integrity 5) Score re-ID risk and ANONYMIZE quasi-identifiers 6) Create and manage DB subsets or intelligently synthesize TEST data 7) Package, protect and provision BIG data 8) Validate, scrub, enrich and unify data to improve its QUALITY 9) Manage metadata and MASTER data. Use Voracity to comply with data privacy laws, de-muck and govern the data lake, improve the reliability of your analytics, and create safe, smart test data

View Software
25

Warp 10

SenX

Warp 10 is a modular open source platform that collects, stores, and analyzes data from sensors. Shaped for the IoT with a flexible data model, Warp 10 provides a unique and powerful framework to simplify your processes from data collection to analysis and visualization, with the support of geolocated data in its core model (called Geo Time Series). Warp 10 is both a time series database and a powerful analytics environment, allowing you to make: statistics, extraction of characteristics for training models, filtering and cleaning of data, detection of patterns and anomalies, synchronization or even forecasts. The analysis environment can be implemented within a large ecosystem of software components such as Spark, Kafka Streams, Hadoop, Jupyter, Zeppelin and many more. It can also access data stored in many existing solutions, relational or NoSQL databases, search engines and S3 type object storage system.

View Software
26

Oracle Cloud Infrastructure Streaming

Oracle

Streaming service is a real-time, serverless, Apache Kafka-compatible event streaming platform for developers and data scientists. Streaming is tightly integrated with Oracle Cloud Infrastructure (OCI), Database, GoldenGate, and Integration Cloud. The service also provides out-of-the-box integrations for hundreds of third-party products across categories such as DevOps, databases, big data, and SaaS applications. Data engineers can easily set up and operate big data pipelines. Oracle handles all infrastructure and platform management for event streaming, including provisioning, scaling, and security patching. With the help of consumer groups, Streaming can provide state management for thousands of consumers. This helps developers easily build applications at scale.

View Software
27

Gravity Data

Gravity

Gravity's mission is to make streaming data easy from over 100 sources while only paying for what you use. Gravity removes the reliance on engineering teams to deliver streaming pipelines with a simple interface to get streaming up and running in minutes from databases, event data and APIs. Everyone in the data team can now build with simple point and click so that you can focus on building apps, services and customer experiences. Full Execution trace and detailed error messaging for quick diagnosis and resolution. We have implemented new, feature-rich ways for you to quickly get started. From bulk set-up, default schemas and data selection to different job modes and statuses. Spend less time wrangling with infrastructure and more time analysing data while allowing our intelligent engine to keep your pipelines running. Gravity integrates with your systems for notifications and orchestration.

View Software
28

Equalum

Equalum

Equalum’s continuous data integration & streaming platform is the only solution that natively supports real-time, batch, and ETL use cases under one, unified platform with zero coding required. Make the move to real-time with a fully orchestrated, drag-and-drop, no-code UI. Experience rapid deployment, powerful transformations, and scalable streaming data pipelines in minutes. Multi-modal, robust, and scalable CDC enabling real-time streaming and data replication. Tuned for best-in-class performance no matter the source. The power of open-source big data frameworks, without the hassle. Equalum harnesses the scalability of open-source data frameworks such as Apache Spark and Kafka in the Platform engine to dramatically improve the performance of streaming and batch data processes. Organizations can increase data volumes while improving performance and minimizing system impact using this best-in-class infrastructure.

View Software
29

CorralData

CorralData

CorralData is a full-service, AI-powered platform that radically simplifies unifying and analyzing data across multiple sources. With CorralData, you can access human data experts and industry-leading AI tools to easily and cost-effectively drive remarkable, data-fueled results. No coding, SQL, or engineering resources required. Here’s how CorralData works. First, unify all of your data sources using our 450+ pre-built data connectors. Our U.S.-based support team will guide you along the way. Then, begin instantly and securely unlocking powerful data insights using natural language AI questions. Or work directly with a human data expert to surface the insight you need. The CorralData platform includes everything you need to start using data to drive results in one simple, monthly subscription—stunning data visualization tools, built-in chat, custom reporting, and access to human experts. Schedule a free demo today and see how simple and easy AI-powered data can be.

Starting Price: $200/month

View Software
30

FairCom EDGE

FairCom

FairCom EDGE simplifies the integration of sensor and machine data at the source – whether it’s a factory, water treatment plant, oil platform or wind farm. The world’s first converged IoT/Industrial IoT hub, FairCom EDGE unifies messaging, persistence and analytics with an all-in-one solution – complete with browser-based administration, configuration and monitoring. FairCom EDGE supports MQTT and OPC UA for machine-to-machine (M2M) communication, SQL for interactive analytics and HTTP/REST for real-time monitoring. It continuously retrieves data from sensors and machines with OPC UA support, and receives messages from those with MQTT support. The data is automatically parsed, persisted and made accessible via MQTT and SQL.

Starting Price: Free

View Software

Previous
1
2
You're on page 3
4
5
6
Next

Related Categories

DataOps Data Warehouse Master Data Management Data Entry Data Fabric Data Lake Data Pipeline Data Orchestration Data Ingestion Data Observability Data Exchange Database Management Systems (DBMS)

SourceForge

Open Source Software
Business Software
Add Your Software
Business Software Advertising

Company

About
Team
SourceForge Headquarters
1320 Columbia Street Suite 310
San Diego, CA 92101
+1 (858) 422-6466

Resources

Support / Documentation
Site Status
SourceForge Reviews

Terms Privacy Opt Out Advertise

Thanks for helping keep SourceForge clean.

You seem to have CSS turned off. Please don't fill out this field.

Briefly describe the problem (required):

Upload screenshot of ad (required):

Select a file, or drag & drop file here.

✔

✘

Screenshot instructions:

Click URL instructions:
Right-click on the ad, choose "Copy Link", then paste here →
(This may not be possible with some types of ads)

More information about our ad policies

Ad destination/click URL:

Best Data Management Software for Apache Kafka - Page 3

Compare the Top Data Management Software that integrates with Apache Kafka as of November 2025 - Page 3

Aerospike

Spotfire

BigID

Querona

Ataccama ONE

Utilihive

ScyllaDB

SQLstream

Tengu

E-MapReduce

Apache Druid

Lenses

Amazon MSK

Apache Ranger

Vaultspeed

Y42

Actian Avalanche

Intel Tiber AI Studio

Lyftrondata

CrateDB

EraSearch

GigaSpaces

Mozart Data

IRI Voracity

Warp 10

Oracle Cloud Infrastructure Streaming

Gravity Data

Equalum

CorralData

FairCom EDGE

Related Categories