Join/Login
Business Software
Open Source Software
For Vendors
Blog
About
More

For Vendors Add a Product Join Login

Business Software

Open Source Software

SourceForge Podcast

Resources

Articles
Case Studies
Blog

Help
Create
Join
Login

Home
Compare Business Software
Data Management Software
Apache Cassandra

Best Data Management Software for Apache Cassandra - Page 3

View:

Open Source Commercial

Clear All Filters

Data Management Features

Data Capture 7
Data Migration 7
Data Analysis 6
Data Integration 6
More...
Data Quality Control 6
Data Security 6
Information Governance 6
Customer Data 5
Match & Merge 5
Master Data Management 4

Deployment

Cloud 71
Windows 33
Mac 28
Linux 26
On-Premises 14
iPad 6
iPhone 6
Android 5
Chromebook 1

Compare the Top Data Management Software that integrates with Apache Cassandra as of December 2025 - Page 3

Sort By:

Sponsored

Apache Cassandra Data Management Clear Filters

This a list of Data Management software that integrates with Apache Cassandra. Use the filters on the left to add additional filters for products that have integrations with Apache Cassandra. View the products that work with Apache Cassandra in the table below.

1

Nightfall

Nightfall

Discover, classify, and protect your sensitive data. Nightfall™ uses machine learning to identify business-critical data, like customer PII, across your SaaS, APIs, and data infrastructure, so you can manage & protect it. Integrate in minutes with cloud services via APIs to monitor data without agents. Machine learning classifies your sensitive data & PII with high accuracy, so nothing gets missed. Setup automated workflows for quarantines, deletions, alerts, and more - saving you time and keeping your business safe. Nightfall integrates directly with all your SaaS, APIs, and data infrastructure. Start building with Nightfall’s APIs for sensitive data classification & protection for free. Via REST API, programmatically get structured results from Nightfall’s deep learning-based detectors for things like credit card numbers, API keys, and more. Integrate with just a few lines of code. Seamlessly add data classification to your applications & workflows using Nightfall's REST API.

View Software
2

Molecula

Molecula

Molecula is an enterprise feature store that simplifies, accelerates, and controls big data access to power machine-scale analytics and AI. Continuously extracting features, reducing the dimensionality of data at the source, and routing real-time feature changes into a central store enables millisecond queries, computation, and feature re-use across formats and locations without copying or moving raw data. The Molecula feature store provides data engineers, data scientists, and application developers a single access point to graduate from reporting and explaining with human-scale data to predicting and prescribing real-time business outcomes with all data. Enterprises spend a lot of money preparing, aggregating, and making numerous copies of their data for every project before they can make decisions with it. Molecula brings an entirely new paradigm for continuous, real-time data analysis to be used for all your mission-critical applications.

View Software
3

JanusGraph

JanusGraph

JanusGraph is a scalable graph database optimized for storing and querying graphs containing hundreds of billions of vertices and edges distributed across a multi-machine cluster. JanusGraph is a project under The Linux Foundation, and includes participants from Expero, Google, GRAKN.AI, Hortonworks, IBM and Amazon. Elastic and linear scalability for a growing data and user base. Data distribution and replication for performance and fault tolerance. Multi-datacenter high availability and hot backups. All functionality is totally free. No need to buy commercial licenses. JanusGraph is fully open source under the Apache 2 license. JanusGraph is a transactional database that can support thousands of concurrent users executing complex graph traversals in real time. Support for ACID and eventual consistency. In addition to online transactional processing (OLTP), JanusGraph supports global graph analytics (OLAP) with its Apache Spark integration.

View Software
4

Secuvy AI

Secuvy

Secuvy is a next-generation cloud platform to automate data security, privacy compliance and governance via AI-driven workflows. Best in class data intelligence especially for unstructured data. Secuvy is a next-generation cloud platform to automate data security, privacy compliance and governance via ai-driven workflows. Best in class data intelligence especially for unstructured data. Automated data discovery, customizable subject access requests, user validations, data maps & workflows for privacy regulations such as ccpa, gdpr, lgpd, pipeda and other global privacy laws. Data intelligence to find sensitive and privacy information across multiple data stores at rest and in motion. In a world where data is growing exponentially, our mission is to help organizations to protect their brand, automate processes, and improve trust with customers. With ever-expanding data sprawls we wish to reduce human efforts, costs & errors for handling Sensitive Data.

View Software
5

WSO2 Enterprise Service Bus

WSO2

The WSO2 integration runtime engine is capable of playing multiple roles in your enterprise architecture. It can act as an ESB or a microservices integrator. When deployed as an ESB, it caters to your message routing, transformation, message mediation, service orchestration, as well as service and API hosting needs. Header-based, content-based, rule-based, and priority-based routing. Implement Enterprise Integration Patterns (EIPs), database integration, event stream integrations. Transform messages with XSLT 1.0/2.0, XPath, XQuery, and Smooks. Visual data mapping, CSV, JSON, and XML transformation connectors. Supports any RDBMS, CSV, Excel, ODS, Cassandra, and Google spreadsheets. Supports OData v4 protocol for any RDBMS and Cassandra data sources. Supports MSSQL, DB2, Oracle, OpenEdge, TerraData, MySQL, PostgreSQL/EnterpriseDB, H2, Derby or any database with a JDBC driver. Support for nested queries across data sources.

View Software
6

Amundsen

Amundsen

Discover & trust data for your analysis and models. Be more productive by breaking silos. Get immediate context into the data and see how others are using it. Search for data within your organization by a simple text search. A PageRank-inspired search algorithm recommends results based on names, descriptions, tags, and querying/viewing activity on the table/dashboard. Build trust in data using automated and curated metadata, descriptions of tables and columns, other frequent users, when the table was last updated, statistics, a preview of the data if permitted, etc. Easy triage by linking the ETL job and code that generated the data. Update tables and columns with descriptions, reduce unnecessary back and forth about which table to use and what a column contains. See what data fellow co-workers frequently use, own or have bookmarked. Learn what most common queries for a table look like by seeing dashboards built on a given table.

View Software
7

witboost

Agile Lab

witboost is a modular, scalable, fast, efficient data management system for your company to truly become data driven, reduce time-to-market, it expenditures and overheads. witboost comprises a series of modules. These are building blocks that can work as standalone solutions to address and solve a single need or problem, or they can be combined to create the perfect data management ecosystem for your company. Each module improves a specific data engineering function and they can be combined to create the perfect solution to answer your specific needs, guaranteeing a blazingly fact and smooth implementation, thus dramatically reducing time-to-market, time-to-value and consequently the TCO of your data engineering infrastructure. Smart Cities need digital twins to predict needs and avoid unforeseen problems, gathering data from thousands of sources and managing ever more complex telematics.

View Software
8

Apache Hudi

Apache Corporation

Hudi is a rich platform to build streaming data lakes with incremental data pipelines on a self-managing database layer, while being optimized for lake engines and regular batch processing. Hudi maintains a timeline of all actions performed on the table at different instants of time that helps provide instantaneous views of the table, while also efficiently supporting retrieval of data in the order of arrival. A Hudi instant consists of the following components. Hudi provides efficient upserts, by mapping a given hoodie key consistently to a file id, via an indexing mechanism. This mapping between record key and file group/file id, never changes once the first version of a record has been written to a file. In short, the mapped file group contains all versions of a group of records.

View Software
9

Blueflood

Blueflood

Blueflood is a high throughput, low latency, multi-tenant distributed metric processing system behind Rackspace Metrics, which is currently used in production by the Rackspace Monitoring team and Rackspace public cloud team to store metrics generated by their systems. In addition to Rackspace metrics, other large scale deployments of Blueflood can be found at community Wiki. Data from Blueflood can be used to construct dashboards, generate reports, graphs or for any other use involving time-series data. It focuses on near-realtime data, with data that is queryable mere milliseconds after ingestion. You send metrics to the ingestion service. You query your metrics from the Query service. And in the background, rollups are batch-processed offline so that queries for large time-periods are returned quickly.

View Software
10

Hawkular Metrics

Hawkular Metrics

Hawkular Metrics is a scalable, asynchronous, multi tenant, long term metrics storage engine that uses Cassandra as the data store and REST as the primary interface. This section provides an overview of some of the key features of Hawkular Metrics. The following sections provide in-depth discussions on these as well as other features. Hawkular Metrics is all about scalability. You can run a single instance backed by a single Cassandra node. You can also scale out Cassandra to multiple nodes to handle increasing loads. The Hawkular Metrics server employs a stateless architecture, which makes it easy to scale out as well. This diagram illustrates the various deployment options made possible with Hawkular Metrics' scalable architecture. The upper left shows the simplest deployment with a single Cassandra node and single Hawkular Metrics node. The bottom right picture shows that it is possible to run more Hawkular Metrics nodes than Cassandra nodes.

View Software
11

Heroic

Heroic

Heroic is an open-source monitoring system originally built at Spotify to address problems faced with large scale gathering and near real-time analysis of metrics. Heroic uses a small set of components which are responsible for very specific things. Indefinite retention, as long as you have the hardware spend. Federation support to connect multiple Heroic clusters into a global interface. Heroic uses a small set of components which are responsible for very specific things. Consumers are the component responsible for consuming metrics. When building Heroic it was quickly realized that navigating hundreds of millions of time series without context is hard. Heroic has support for federating requests, which allows multiple independent Heroic clusters to serve clients through a single global interface. This can be used to reduce the amount of geographical traffic by allowing one cluster to operate completely isolated within its zone.

View Software
12

KairosDB

KairosDB

Data can be pushed in KairosDB via multiple protocols like Telnet, Rest and Graphite. Other mechanisms such as plugins can also be used. KairosDB stores time series in Cassandra, the popular and performant NoSQL datastore. The schema consists of 3 column families. This API provides operations to list existing metric names, list tag names and values, store metric data points, and query for metric data points. With a default install, KairosDB serve up a query page whereby you can query data within the data store. It's designed primarily for development purposes. Aggregators perform an operation on data points and down samples. Standard functions like min, max, sum, count, mean and more are available. Import and export is available on the KairosDB server from the command line. Internal metrics to the data store can monitor the server’s performance.

View Software
13

Varada

Varada

Varada’s dynamic and adaptive big data indexing solution enables to balance performance and cost with zero data-ops. Varada’s unique big data indexing technology serves as a smart acceleration layer on your data lake, which remains the single source of truth, and runs in the customer cloud environment (VPC). Varada enables data teams to democratize data by operationalizing the entire data lake while ensuring interactive performance, without the need to move data, model or manually optimize. Our secret sauce is our ability to automatically and dynamically index relevant data, at the structure and granularity of the source. Varada enables any query to meet continuously evolving performance and concurrency requirements for users and analytics API calls, while keeping costs predictable and under control. The platform seamlessly chooses which queries to accelerate and which data to index. Varada elastically adjusts the cluster to meet demand and optimize cost and performance.

View Software
14

StreamFlux

Fractal

Data is crucial when it comes to building, streamlining and growing your business. However, getting the full value out of data can be a challenge, many organizations are faced with poor access to data, incompatible tools, spiraling costs and slow results. Simply put, leaders who can turn raw data into real results will thrive in today’s landscape. The key to this is empowering everyone across your business to be able to analyze, build and collaborate on end-to-end AI and machine learning solutions in one place, fast. Streamflux is a one-stop shop to meet your data analytics and AI challenges. Our self-serve platform allows you the freedom to build end-to-end data solutions, uses models to answer complex questions and assesses user behaviors. Whether you’re predicting customer churn and future revenue, or generating recommendations, you can go from raw data to genuine business impact in days, not months.

View Software
15

Meltano

Meltano

Meltano provides the ultimate flexibility in deployment options. Own your data stack, end to end. Ever growing connector library of 300+ connectors have been running in production for years. Run workflows in isolated environments, execute end-to-end tests, and version control everything. Open source gives you the power to build your ideal data stack. Define your entire project as code and collaborate confidently with your team. The Meltano CLI enables you to rapidly create your project, making it easy to start replicating data. Meltano is designed to be the best way to run dbt to manage your transformations. Your entire data stack is defined in your project, making it simple to deploy it to production. Validate your changes in development before moving to CI, and in staging before moving to production.

View Software
16

Semarchy xDI

Semarchy

Experience Semarchy’s flexible unified data platform to empower better business decisions enterprise-wide. Integrate all your data with xDI, the high-performance, agile, and extensible data integration for all styles and use cases. Its single technology federates all forms of data integration, and mapping converts business rules into deployable code. xDI has extensible and open architecture supporting on-premise, cloud, hybrid, and multi-cloud environments.

View Software
17

Luna for Apache Cassandra

DataStax

Luna is a subscription to the Apache Cassandra support and expertise at DataStax. It allows you to enjoy all the benefits of open-source Cassandra, with the peace of mind knowing you have direct access to the team that authored the majority of the code and supported some of the largest deployments in the world. Best practices, advice, and SLA-based support to keep your Cassandra deployment in top shape. Scale without compromising on performance or latency to seamlessly manage the most demanding real-time workloads. Create real-time and highly-interactive customer experiences with blisteringly fast read and writes. Luna provides assistance with resolving issues and following best practices with Cassandra clusters. Services provide help through the full application life cycle, with a deeper integration in your team working together on implementation.

View Software
18

QueryPie

QueryPie

QueryPie is a centralized platform to manage scattered data sources and security policies all in one place. Put your company on the fast track to success without changing the existing data environment. Data governance is vital to today's data-driven world. Ensure you're on the right side of data governance standards while giving many users access to growing amounts of critical information. Establish data access policies by including key attributes such as IP address and access time. Privilege types can be created based on SQL commands classified as DML, DCL, and DDL to secure data analysis and editing. Manage details of SQL events at a glance and discover user behavior and potential security concerns by browsing logs based on permissions. All histories can be exported as a file and used for reporting purposes.

View Software
19

Kestra

Kestra

Kestra is an open-source, event-driven orchestrator that simplifies data operations and improves collaboration between engineers and business users. By bringing Infrastructure as Code best practices to data pipelines, Kestra allows you to build reliable workflows and manage them with confidence. Thanks to the declarative YAML interface for defining orchestration logic, everyone who benefits from analytics can participate in the data pipeline creation process. The UI automatically adjusts the YAML definition any time you make changes to a workflow from the UI or via an API call. Therefore, the orchestration logic is defined declaratively in code, even if some workflow components are modified in other ways.

View Software
20

Salesforce Data Cloud

Salesforce

Salesforce Data Cloud is a real-time data platform designed to unify and manage customer data from multiple sources across an organization, enabling a single, comprehensive view of each customer. It allows businesses to collect, harmonize, and analyze data in real time, creating a 360-degree customer profile that can be leveraged across Salesforce’s various applications, such as Marketing Cloud, Sales Cloud, and Service Cloud. This platform enables faster, more personalized customer interactions by integrating data from online and offline channels, including CRM data, transactional data, and third-party data sources. Salesforce Data Cloud also offers advanced AI gents and analytics capabilities, helping organizations gain deeper insights into customer behavior and predict future needs. By centralizing and refining data for actionable use, Salesforce Data Cloud supports enhanced customer experiences, targeted marketing, and efficient, data-driven decision-making across departments.

View Software
21

Borneo

Borneo

Borneo is a real-time data security and privacy observability platform designed to help organizations discover, remediate, and govern data risks while ensuring privacy and compliance. It enables users to discover where health data, financial data, and PII are stored across unstructured data, SaaS apps, and public cloud environments. Borneo's risk correlation engine identifies data that violates security frameworks and privacy regulations, prompting immediate action. It offers automatic remediation through data masking, access changes, and encryption, and continuously monitors changes across the data landscape to maintain compliance and eliminate regulatory risk. Built by security practitioners from Uber, Facebook, and Yahoo, Borneo is crafted to handle data at scale. It features a powerful connector framework to integrate across diverse data landscapes, supports flexible and modular deployment, and ensures that data never leaves the user's cloud environment.

View Software
22

HyperCube

BearingPoint

Whatever your business need, discover hidden insights quickly and easily using HyperCube, the platform designed for the way data scientists work. Put your business data to work. Unlock understanding, discover unrealized opportunities, generate predictions and avoid risks before they happen. HyperCube takes huge volumes of data and turns it into actionable insights. Whether a beginner in analytics or a machine learning expert, HyperCube is designed with you in mind. It is the Swiss Army knife of data science, combining proprietary and open source code to deliver a wide range of data analysis features straight out of the box or as business apps, customized just for you. We are constantly updating and perfecting our technology so we can deliver the most innovative, intuitive and adaptable results Choose from apps, data-as-a-services (DaaS) and vertical market solutions.

View Software
23

DataStax

DataStax

The Open, Multi-Cloud Stack for Modern Data Apps. Built on open-source Apache Cassandra™. Global-scale and 100% uptime without vendor lock-in. Deploy on multi-cloud, on-prem, open-source, and Kubernetes. Elastic and pay-as-you-go for improved TCO. Start building faster with Stargate APIs for NoSQL, real-time, reactive, JSON, REST, and GraphQL. Skip the complexity of multiple OSS projects and APIs that don’t scale. Ideal for commerce, mobile, AI/ML, IoT, microservices, social, gaming, and richly interactive applications that must scale-up and scale-down with demand. Get building modern data applications with Astra, a database-as-a-service powered by Apache Cassandra™. Use REST, GraphQL, JSON with your favorite full-stack framework Richly interactive apps that are elastic and viral-ready from Day 1. Pay-as-you-go Apache Cassandra DBaaS that scales effortlessly and affordably.

View Software
24

CData Python Connectors

CData Software

CData Python Connectors simplify the way that Python users connect to SaaS, Big Data, NoSQL, and relational data sources. Our Python Connectors offer simple Python database interfaces (DB-API), making it easy to connect with popular tooling like Jupyter Notebook, SQLAlchemy, pandas, Dash, Apache Airflow, petl, and more. CData Python Connectors create a SQL wrapper around APIs and data protocols, simplifying data access from within Python and enabling Python users to easily connect more than 150 SaaS, Big Data, NoSQL, and relational data sources with advanced Python processing. The CData Python Connectors fill a critical gap in Python tooling by providing consistent connectivity with data-centric interfaces to hundreds of different SaaS/Cloud, NoSQL, and Big Data sources. Download a 30-day free trial or learn more at: https://www.cdata.com/python/

View Software
25

OctoData

SoyHuCe

OctoData is deployed at a lower cost, in Cloud hosting and includes personalized support from the definition of your needs to the use of the solution. OctoData is based on innovative open-source technologies and knows how to adapt to open up to future possibilities. Its Supervisor offers a management interface that allows you to quickly capture, store and exploit a growing quantity and variety of data. With OctoData, prototype and industrialize your massive data recovery solutions in the same environment, including in real time. Thanks to the exploitation of your data, obtain precise reports, explore new possibilities, increase your productivity and gain in profitability.

View Software
26

SSIS Integration Toolkit

KingswaySoft

Jump right to our product page to see our full range of data integration software, including solutions for SharePoint and Active Directory. With over 300 individual data integration tools for connectivity and productivity, our data integration solutions allow developers to take advantage of the flexibility and power of the SSIS ETL engine to integrate virtually any application or data source. You don't have to write a single line of code to make data integration happen so your development can be done in a matter of minutes. We make the most flexible integration solution on the market. Our software offers intuitive user interfaces that are flexible and easy to use. With a streamlined development experience and an extremely simple licensing model, our solution offers the best value for your investment. Our software offers many specifically designed features that help you achieve the best possible performance without having to hijack your budget.

View Software
27

BigBI

BigBI

BigBI enables data specialists to build their own powerful big data pipelines interactively & efficiently, without any coding! BigBI unleashes the power of Apache Spark enabling: Scalable processing of real Big Data (up to 100X faster) Integration of traditional data (SQL, batch files) with modern data sources including semi-structured (JSON, NoSQL DBs, Elastic, Hadoop), and unstructured (Text, Audio, video), Integration of streaming data, cloud data, AI/ML & graphs

View Software
28

CipherTrust Tokenization

Thales Cloud Security

CipherTrust Tokenization dramatically reduces the cost and effort required to comply with security policies and regulatory mandates like PCI DSS while also making it simple to protect other sensitive data including personally identifiable information (PII). While there are no tokenization standards in the industry, most tokenization solutions fall into one of two architectures: vaultless- or vaulted tokenization Both secure and anonymize sensitive assets. Tokenization software can reside in the data center, big data environments or the cloud. Tokenization can remove card holder data from PCI DSS scope with minimal cost and effort, enabling organizations to save on costs associated with compliance with the industry standard. Modern IT architectures require both use and protection of personally identifiable information (PII). With CipherTrust tokenization, PII protection is gained without encryption key management required by the software developer.

View Software
29

Data Sentinel

Data Sentinel

As a business leader, you need to trust your data and be 100% certain that it’s well-governed, compliant, and accurate. Including all data, in all sources, and in all locations, without limitations. Understand your data assets. Audit for risk, compliance, and quality in support of your project. Catalog a complete data inventory across all sources and data types, creating a shared understanding of your data assets. Run a one-time, fast, affordable, and accurate audit of your data. PCI, PII, and PHI audits are fast, accurate, and complete. As a service, with no software to purchase. Measure and audit data quality and data duplication across all of your enterprise data assets, cloud-native and on-premises. Comply with global data privacy regulations at scale. Discover, classify, track, trace and audit privacy compliance. Monitor PII/PCI/PHI data propagation and automate DSAR compliance processes.

View Software
30

CData Sync

CData Software

CData Sync is a universal data pipeline that delivers automated continuous replication between hundreds of SaaS applications & cloud data sources and any major database or data warehouse, on-premise or in the cloud. Replicate data from hundreds of cloud data sources to popular database destinations, such as SQL Server, Redshift, S3, Snowflake, BigQuery, and more. Configuring replication is easy: login, select the data tables to replicate, and select a replication interval. Done. CData Sync extracts data iteratively, causing minimal impact on operational systems by only querying and updating data that has been added or changed since the last update. CData Sync offers the utmost flexibility across full and partial replication scenarios and ensures that critical data is stored safely in your database of choice. Download a 30-day free trial of the Sync application or request more information at www.cdata.com/sync

View Software

Previous
1
2
You're on page 3
4
Next

Related Categories

DataOps Data Warehouse Master Data Management Data Entry Data Fabric Data Lake Data Pipeline Data Orchestration Data Ingestion Data Observability Data Exchange Database Management Systems (DBMS)

SourceForge

Open Source Software
Business Software
Add Your Software
Business Software Advertising

Company

About
Team
SourceForge Headquarters
1320 Columbia Street Suite 310
San Diego, CA 92101
+1 (858) 422-6466

Resources

Support / Documentation
Site Status
SourceForge Reviews

Terms Privacy Opt Out Advertise

Thanks for helping keep SourceForge clean.

You seem to have CSS turned off. Please don't fill out this field.

Briefly describe the problem (required):

Upload screenshot of ad (required):

Select a file, or drag & drop file here.

✔

✘

Screenshot instructions:

Click URL instructions:
Right-click on the ad, choose "Copy Link", then paste here →
(This may not be possible with some types of ads)

More information about our ad policies

Ad destination/click URL:

Best Data Management Software for Apache Cassandra - Page 3

Compare the Top Data Management Software that integrates with Apache Cassandra as of December 2025 - Page 3

Nightfall

Molecula

JanusGraph

Secuvy AI

WSO2 Enterprise Service Bus

Amundsen

witboost

Apache Hudi

Blueflood

Hawkular Metrics

Heroic

KairosDB

Varada

StreamFlux

Meltano

Semarchy xDI

Luna for Apache Cassandra

QueryPie

Kestra

Salesforce Data Cloud

Borneo

HyperCube

DataStax

CData Python Connectors

OctoData

SSIS Integration Toolkit

BigBI

CipherTrust Tokenization

Data Sentinel

CData Sync

Related Categories