Business Software for Hadoop

Top Software that integrates with Hadoop as of April 2026

Hadoop Clear Filters

Compare business software, products, and services to find the best solution for your business or organization. Use the filters on the left to drill down by category, pricing, features, organization size, organization type, region, user reviews, integrations, and more. View and sort the products and solutions that match your needs in the results below.

  • 1
    Couchbase

    Couchbase

    Couchbase

    Couchbase’s operational data platform for AI is a scalable foundation for enterprise operational, analytical, mobile and AI workloads that replaces legacy infrastructure and data services. Bring your data to life in new ways with Couchbase’s enterprise data partnership: launch game-changing customer experiences, explore the infinite possibilities of AI, scale your global operations, and move your data from the cloud to the edge, and beyond. Couchbase’s operational data platform for AI eliminates fragmented tech stacks, so teams can stay innovative and agile, with less risk and lower cost of ownership. With enterprise partnership and scalable, AI-ready technology, Couchbase turns your data into the foundation for your next breakthrough.
    View Software
    Visit Website
  • 2
    ActiveBatch Workload Automation

    ActiveBatch Workload Automation

    ActiveBatch by Redwood

    ActiveBatch by Redwood makes setting up and launching automation easy with no custom scripting required. With a low-code Super REST API adapter, over 100 pre-built job steps and a user-friendly drag-and-drop workflow designer, you can integrate across any system, application and data source, on-prem, in the cloud or in hybrid environments. Maintain complete control and visibility and meet SLAs with monitoring of all automation from a single pane of glass and get custom alerts via emails or SMS. Managed Smart Queues dynamically scale resources for high-volume workloads, reducing process times while the self-service portal enables business users to run and monitor workflows independently. ActiveBatch meets security and compliance standards, with ISO 27001 and SOC 2, Type II certifications, encrypted connections and regular third-party tests, always keeping security at the forefront. Along with ongoing product advancements, get the added benefit of 24x7 support and on-site training.
    Leader badge
    View Software
    Visit Website
  • 3
    AnalyticsCreator

    AnalyticsCreator

    AnalyticsCreator

    AnalyticsCreator is a metadata-driven data warehouse automation application for teams working in the Microsoft data ecosystem. It enables data engineers to design, generate, and maintain production-ready data products across Microsoft SQL Server, Azure Data Factory, and Microsoft Fabric. By using centralized metadata, AnalyticsCreator generates ELT pipelines, dimensional models, historization logic, and analytical models in a consistent, version-controlled way. This reduces manual implementation effort and tool sprawl while ensuring transparency through built-in lineage tracking and clear visibility into data dependencies and change impact. With CI/CD integration via Azure DevOps and GitHub, plus support for custom SQL, AnalyticsCreator helps data teams scale delivery, enforce standards, and maintain control as complexity grows.
    View Software
    Visit Website
  • 4
    Pandora FMS

    Pandora FMS

    Pandora FMS

    With more than 50,000 customer installations across the five continents, Pandora FMS is a truly all-in-one monitoring solution, covering all traditional silos for specific monitoring: servers, networks, applications, logs, synthetic/transactional, remote control, inventory, etc. Pandora FMS gives you the agility to find and solve problems quickly, scaling them so they can be derived from any source, on-premise, multi cloud or both of them mixed. Now you have that capability across your entire IT stack and analytics to find any problem, even the ones that are hard to find. Thanks to more than 500 plugins available, you can control and manage any application and technology, from SAP, Oracle, Lotus, Citrix or Jboss to VMware, AWS, SQL Server, Redhat, Websphere, etc.
    Leader badge
    Starting Price: €90/month
  • 5
    Composable DataOps Platform

    Composable DataOps Platform

    Composable Analytics

    Composable is an enterprise-grade DataOps platform built for business users that want to architect data intelligence solutions and deliver operational data-driven products leveraging disparate data sources, live feeds, and event data regardless of the format or structure of the data. With a modern, intuitive dataflow visual designer, built-in services to facilitate data engineering, and a composable architecture that enables abstraction and integration of any software or analytical approach, Composable is the leading integrated development environment to discover, manage, transform and analyze enterprise data.
    Starting Price: $8/hr - pay-as-you-go
  • 6
    Peekdata

    Peekdata

    Peekdata

    Consume data from any database, organize it into consistent metrics, and use it with every app. Build your Data and Reporting APIs faster with automated SQL generation, query optimization, access control, consistent metrics definitions, and API design. It takes only days to wrap any data source with a single reference Data API and simplify access to reporting and analytics data across your teams. Make it easy for data engineers and application developers to access the data from any source in a streamlined manner. - The single schema-less Data API endpoint - Review and configure metrics and dimensions in one place via UI - Data model visualization to make faster decisions - Data Export management scheduling AP Ready-to-use Report Builder and JavaScript components for charting libraries (Highcharts, BizCharts, Chart.js, etc.) makes it easy to embed data-rich functionality into your products. And you will not have to make custom report queries anymore!
    Starting Price: $349 per month
  • 7
    Zuar Runner

    Zuar Runner

    Zuar, Inc.

    Utilizing the data that's spread across your organization shouldn't be so difficult! With Zuar Runner you can automate the flow of data from hundreds of potential sources into a single destination. Collect, transform, model, warehouse, report, monitor and distribute: it's all managed by Zuar Runner. Pull data from Amazon/AWS products, Google products, Microsoft products, Avionte, Backblaze, BioTrackTHC, Box, Centro, Citrix, Coupa, DigitalOcean, Dropbox, CSV, Eventbrite, Facebook Ads, FTP, Firebase, Fullstory, GitHub, Hadoop, Hubic, Hubspot, IMAP, Jenzabar, Jira, JSON, Koofr, LeafLogix, Mailchimp, MariaDB, Marketo, MEGA, Metrc, OneDrive, MongoDB, MySQL, Netsuite, OpenDrive, Oracle, Paycom, pCloud, Pipedrive, PostgreSQL, put.io, Quickbooks, RingCentral, Salesforce, Seafile, Shopify, Skybox, Snowflake, Sugar CRM, SugarSync, Tableau, Tamarac, Tardigrade, Treez, Wurk, XML Tables, Yandex Disk, Zendesk, Zoho, and more!
  • 8
    SCIKIQ

    SCIKIQ

    SCIKIQ

    We help make AI possible for enterprises. SCIKIQ is a unified AI and Data platform designed to move enterprises from fragmented data to production-ready AI. By combining a Unified Data Layer with a powerful Data Hub & AI Co-pilot, SCIKIQ eliminates data silos and provides a "single version of truth" across your entire organization. SCIKIQ brings together everything an enterprise needs to scale AI, Integrations, clean data, trusted governance, semantic context, real-time orchestration, and intelligent agents. all in one platform. Recognized Leader: Named a Top 34 AI Platform by Forrester and a Tech30 company by YourStory. Global Validation: Selected by AWS for showcase at MWC and re:Invent. for the product innovation. Companies We work with are leaders in their categories. We work with leading Banks, financial organisations, Retail, Manufacturing, Supply Chain and other industries. A NoCode, Platform-as-a-Service, Cloud Agnostic, 30-90 Day Installation and Fastest ROI.
  • 9
    Kyvos Semantic Layer

    Kyvos Semantic Layer

    Kyvos Insights

    Kyvos is a semantic layer for AI and BI. It gives organizations a single, consistent, business-friendly view of their entire data estate. By standardizing how data is defined and understood, Kyvos eliminates metric drift across BI tools and ensures that LLMs and AI agents work with governed business semantics rather than raw tables. Kyvos also delivers lightning-fast analytics at massive scale and high concurrency — including granular multidimensional analysis on the cloud — without the sluggish query times and escalating cloud costs that typically come with it. Kyvos semantic layer provides a unified semantic foundation for AI and BI, standardizing metrics, KPIs, and business logic across tools. It grounds AI in governed business context, eliminates metric drift, and delivers sub-second analytics at scale with high concurrency. It also enables deep multidimensional analysis and reduces cloud costs by serving analytics through its semantic layer.
  • 10
    Netdata

    Netdata

    Netdata, Inc.

    The open-source observability platform everyone needs! Netdata collects metrics per second and presents them in beautiful low-latency dashboards. It is designed to run on all of your physical and virtual servers, cloud deployments, Kubernetes clusters, and edge/IoT devices, to monitor your systems, containers, and applications. It scales nicely from just a single server to thousands of servers, even in complex multi/mixed/hybrid cloud environments, and given enough disk space it can keep your metrics for years. KEY FEATURES: 💥 Collects metrics from 800+ integrations 💪 Real-Time, Low-Latency, High-Resolution 😶‍🌫️ Unsupervised Anomaly Detection 🔥 Powerful Visualization 🔔 Out of box Alerts 📖 systemd Journal Logs Explorer 😎 Low Maintenance ⭐ Open and Extensible Try Netdata today and feel the pulse of your infrastructure, with high-resolution metrics, journal logs and real-time visualizations.
    Leader badge
    Starting Price: Free
  • 11
    MongoDB

    MongoDB

    MongoDB

    MongoDB is a general purpose, document-based, distributed database built for modern application developers and for the cloud era. No database is more productive to use. Ship and iterate 3–5x faster with our flexible document data model and a unified query interface for any use case. Whether it’s your first customer or 20 million users around the world, meet your performance SLAs in any environment. Easily ensure high availability, protect data integrity, and meet the security and compliance standards for your mission-critical workloads. An integrated suite of cloud database services that allow you to address a wide variety of use cases, from transactional to analytical, from search to data visualizations. Launch secure mobile apps with native, edge-to-cloud sync and automatic conflict resolution. Run MongoDB anywhere, from your laptop to your data center.
    Leader badge
    Starting Price: Free
  • 12
    Flex83

    Flex83

    IoT83

    Re-imagine IoT innovation with the Flex83 Application Enablement Platform! Build compelling & powerful IoT solutions up to 80% faster & at a fraction of the cost. - Use no-code workflows to build professional-grade connect/monitor/analyze/manage solutions fast. - Use low-code tools to connect to virtually anything, add custom business logic, build analytics, custom dashboards, and launch multiple applications. - Use the hassle-free SaaS model to build & prove your solution – and then scale - using a "pay as you grow" model! You can create sophisticated IoT applications - literally - in a day with tools & workflows that give you the agility to build what your business or customers need without worrying about long development cycles, underlying complexity, or huge budgets. Iteratively enhance you solution to broaden your capabilities and drive more customer value. And, proven to 65M devices, you know the Flex83 platform is reliable! Give Flex83 a try today!
    Starting Price: $200 per month
  • 13
    Jupyter Notebook

    Jupyter Notebook

    Project Jupyter

    The Jupyter Notebook is an open-source web application that allows you to create and share documents that contain live code, equations, visualizations and narrative text. Uses include: data cleaning and transformation, numerical simulation, statistical modeling, data visualization, machine learning, and much more.
  • 14
    Pentaho

    Pentaho

    Hitachi Vantara

    With an integrated product suite providing data integration, analytics, cataloging, optimization and quality, Pentaho+ enables seamless data management, driving innovation and informed decision-making. Pentaho+ has helped customers achieve a 3x increase in improved data trust, a 7x increase in impactful business results and most importantly, a 70% increase in productivity.
  • 15
    Apache Cassandra

    Apache Cassandra

    Apache Software Foundation

    The Apache Cassandra database is the right choice when you need scalability and high availability without compromising performance. Linear scalability and proven fault-tolerance on commodity hardware or cloud infrastructure make it the perfect platform for mission-critical data. Cassandra's support for replicating across multiple datacenters is best-in-class, providing lower latency for your users and the peace of mind of knowing that you can survive regional outages.
  • 16
    SingleStore

    SingleStore

    SingleStore

    SingleStore (formerly MemSQL) is a distributed, highly-scalable SQL database that can run anywhere. We deliver maximum performance for transactional and analytical workloads with familiar relational models. SingleStore is a scalable SQL database that ingests data continuously to perform operational analytics for the front lines of your business. Ingest millions of events per second with ACID transactions while simultaneously analyzing billions of rows of data in relational SQL, JSON, geospatial, and full-text search formats. SingleStore delivers ultimate data ingestion performance at scale and supports built in batch loading and real time data pipelines. SingleStore lets you achieve ultra fast query response across both live and historical data using familiar ANSI SQL. Perform ad hoc analysis with business intelligence tools, run machine learning algorithms for real-time scoring, perform geoanalytic queries in real time.
    Starting Price: $0.69 per hour
  • 17
    Cleo Integration Cloud
    Cleo Integration Cloud (CIC) award-winning EDI software that enables the best B2B integration, visibility and control. CIC accelerates EDI automation, expedites partner onboarding, and easily tackles EDI issue resolution. Bringing end-to-end integration visibility across EDI, non-EDI, and API integrations enabling you to grow your revenue-generating business processes better and faster. CIC is optimizing thousands of supply chains for logistics providers, manufacturers, and wholesalers. Encompassing seamless ERP integration, WMS integration, TMS integration and more, our cloud-based B2B integration platform transforms costly, complicated processes into truly efficient, agile, and scalable operations. Our ecosystem integration approach offers the best B2B capabilities so you can automate EDI and API transactions, rapidly onboard partners, and gain competitive control.
  • 18
    IBM DevOps Deploy
    Continuous delivery of any application to any environment. IBM DevOps Deploy (formerly IBM UrbanCode Deploy) is an application-release solution that combines continuous delivery and deployment automation with robust visibility, traceability and auditing capabilities. Increase frequency of software delivery through automated, repeatable deployment processes across development, testing and production. Simplify the deployment of multichannel applications to all environments, whether on premises or in the cloud, with consistency and repeatability. Use a single centralized server to manage tens of thousands of endpoints to any number of clouds, data centers or mainframes. Make processes more robust and easier to design by using tested integrations with dozens of tools and technologies, including Jira, Jenkins, Kubernetes, Microsoft, ServiceNow and WebSphere.
  • 19
    Qlik Cloud Analytics
    The modern analytics era truly began with the launch of QlikView, our first analytics solution, and the game-changing associative engine it is built on. It revolutionized the way organizations use data with intuitive visual discovery that put business intelligence in the hands of more people than ever. And we continue to lead the way with Qlik Cloud® Analytics for a cloud-based SaaS deployment and Qlik Sense® for an on-premises solution. Both options augment and enhance human intuition with AI-powered insights, and help your team move from passive to active analytics for real-time collaboration and action. Take advantage of analytics in the cloud and on-premises. You get maximum choice and deployment flexibility when deciding where to store, transform, and analyze your data.
  • 20
    ER/Studio Enterprise Edition
    ER/Studio is an enterprise data modeling and architecture platform that enables organizations to design, manage, and govern data assets across complex, distributed environments, including data warehouses, lakehouses, data mesh frameworks, and data vault architectures. It connects business requirements to technical implementation through conceptual, logical, and physical models, providing clarity from strategy through deployment. By establishing a consistent modeling foundation, ER/Studio creates a reliable, shared view of enterprise data that supports analytics, AI initiatives, modernization, compliance, and operational systems. Design data models and keep teams aligned with ER/Studio’s multi-user shared repository and web-based collaboration portal, Team Server. The repository supports version control, role-based access, parallel development, and change tracking so modelers can work simultaneously without conflict, preserving integrity and full history.
    Starting Price: $2,687 per user
  • 21
    StarTree

    StarTree

    StarTree

    StarTree, powered by Apache Pinot™, is a fully managed real-time analytics platform built for customer-facing applications that demand instant insights on the freshest data. Unlike traditional data warehouses or OLTP databases—optimized for back-office reporting or transactions—StarTree is engineered for real-time OLAP at true scale, meaning: - Data Volume: query performance sustained at petabyte scale - Ingest Rates: millions of events per second, continuously indexed for freshness - Concurrency: thousands to millions of simultaneous users served with sub-second latency With StarTree, businesses deliver always-fresh insights at interactive speed, enabling applications that personalize, monitor, and act in real time.
    Starting Price: Free
  • 22
    Trino

    Trino

    Trino

    Trino is a query engine that runs at ludicrous speed. Fast-distributed SQL query engine for big data analytics that helps you explore your data universe. Trino is a highly parallel and distributed query engine, that is built from the ground up for efficient, low-latency analytics. The largest organizations in the world use Trino to query exabyte-scale data lakes and massive data warehouses alike. Supports diverse use cases, ad-hoc analytics at interactive speeds, massive multi-hour batch queries, and high-volume apps that perform sub-second queries. Trino is an ANSI SQL-compliant query engine, that works with BI tools such as R, Tableau, Power BI, Superset, and many others. You can natively query data in Hadoop, S3, Cassandra, MySQL, and many others, without the need for complex, slow, and error-prone processes for copying the data. Access data from multiple systems within a single query.
    Starting Price: Free
  • 23
    Style Intelligence
    Style Intelligence by InetSoft is a complete business intelligence (BI) software platform that empowers companies to explore, analyze, monitor, report, and collaborate on critical business and operational data from disparate sources in real time. Its top features include a real-time data mashup Data Block architecture, professional atomic data block modeling tool, and database write-back option. Robust and easy to use, Style Intelligence is also fully scalable and offers granular security, multi-tenancy support, and multiple integrations. InetSoft's cloud flexible business intelligence solution delivers the benefit of cloud computing and software-as-a-service while giving you the maximum level of control. In terms of software-as-a-service, BI software is unique because it inherently depends on the data not being embedded in the application. InetSoft provides free expert fast-start mentoring that delivers the expertise even when no in-house dedicated BI expert is available.
    Starting Price: $165/month
  • 24
    DreamFactory

    DreamFactory

    DreamFactory Software

    DreamFactory Software is the fastest way to build secure, internal REST APIs. Instantly generate APIs from any database with built-in enterprise security controls that operates on-premises, air-gapped, or in the cloud. Develop 4x faster, save 70% on new projects, remove project management uncertainty, focus talent on truly critical issues, win more clients, and integrate with newer & legacy technologies instantly as needed. DreamFactory is the easiest and fastest way to automatically generate, publish, manage, and secure REST APIs, convert SOAP to REST, and aggregate disparate data sources through a single API platform. See why companies like Disney, Bosch, Netgear, T-Mobile, Intel, and many more are embracing DreamFactory's innovative platform to get a competitive edge. Start a hosted trial or talk to our engineers to get access to an on-prem environment!
    Starting Price: $1500/month
  • 25
    Toucan

    Toucan

    Toucan

    Toucan is a customer-facing analytics platform that empowers organizations to drive engagement with the best end-user experience. From data connections to the distribution of insights anywhere they're needed, Toucan makes it easy. As a result, Toucan analytics are used 3x more than the industry average. Users can connect to any data, cloud-based or other, streaming or stored, with hundreds of connectors. Preparation of data is equally simple with data readiness features that lets business people perform tasks that would ordinarily require an expert. Visualization takes the form of “data storytelling” where every chart is accompanied by context, collaboration, and annotation so that users understand the “why” and not just the “what” of their data. Finally, deployment and management are made easy with one-touch deployment from staging to production, easy embedding, and publishing to any device.
  • 26
    Bacula Enterprise

    Bacula Enterprise

    Bacula Systems

    Bacula Enterprise delivers Physical, Virtual, Container and Hybrid Cloud Backup & Recovery software for the Modern Data Center - all from a single platform. Designed for medium and large organizations, Bacula Enterprise backup and recovery software brings unique innovation, modern architecture, business value benefits and low cost of ownership. Bacula Enterprise corporate data backup software solution uses exclusive technologies that increase the interoperability, power, flexibility and functionality of Bacula Enterprise into a wide range of IT environments such as enterprise data centers, managed service providers, software vendors or cloud providers. Thousands of organizations worldwide use Bacula Enterprise in mission-critical environments, including NASA, Texas A&M University, Unicredit, Swisscom, Sky, and many more. Bacula provides additional security features over other vendors and offers advanced, hybrid Cloud connectivity to Amazon, S3, Google, Oracle and many more.
  • 27
    IBM StreamSets
    IBM® StreamSets enables users to create and manage smart streaming data pipelines through an intuitive graphical interface, facilitating seamless data integration across hybrid and multicloud environments. This is why leading global companies rely on IBM StreamSets to support millions of data pipelines for modern analytics, intelligent applications and hybrid integration. Decrease data staleness and enable real-time data at scale—handling millions of records of data, across thousands of pipelines within seconds. Insulate data pipelines from change and unexpected shifts with drag-and-drop, prebuilt processors designed to automatically identify and adapt to data drift. Create streaming pipelines to ingest structured, semistructured or unstructured data and deliver it to a wide range of destinations.
    Starting Price: $1000 per month
  • 28
    Prometheus

    Prometheus

    Prometheus

    Power your metrics and alerting with a leading open-source monitoring solution. Prometheus fundamentally stores all data as time series: streams of timestamped values belonging to the same metric and the same set of labeled dimensions. Besides stored time series, Prometheus may generate temporary derived time series as the result of queries. Prometheus provides a functional query language called PromQL (Prometheus Query Language) that lets the user select and aggregate time series data in real time. The result of an expression can either be shown as a graph, viewed as tabular data in Prometheus's expression browser, or consumed by external systems via the HTTP API. Prometheus is configured via command-line flags and a configuration file. While the command-line flags configure immutable system parameters (such as storage locations, amount of data to keep on disk and in memory, etc.). Download: https://sourceforge.net/projects/prometheus.mirror/
    Starting Price: Free
  • 29
    Enterprise Recon

    Enterprise Recon

    Ground Labs

    With Enterprise Recon by Ground Labs, organizations can find and remediate sensitive information across the broadest range of structured and unstructured data, whether it’s stored on your servers, on your employees’ devices, or in the cloud. Enterprise Recon enables organizations worldwide to seamlessly discover all data and comply with 50+ country regulations inc GDPR, PCI DSS, CCPA, HIPAA, Australian Privacy and other data security standards that require the ability to locate and secure PII data as well as information on gender, ethnicity and health… or even non-PII financial data. Enterprise Recon is powered by GLASS™, Ground Labs' proprietary technology that enables the quickest and most accurate data discovery across the broadest set of platforms available. Enterprise Recon natively supports sensitive data discovery on Windows, macOS, Linux, FreeBSD, Solaris, HP-UX and IBM AIX using agent and agentless options. Additional remote options also enable almost any network data stored.
  • 30
    IRI DMaaS

    IRI DMaaS

    IRI, The CoSort Company

    Data may be the most important asset, and risk, that your company holds. It describes customers, products, transaction histories, and everything else that you use and plan in business. This data can be in databases, files, spreadsheets, Hadoop, cloud platforms or apps. If you don't have the time or expertise to find and de-identify the personally identifiable information (PII) in those sources yourself, IRI Data Masking as a Service (DMaaS) can help. With IRI DMaaS, you can minimize risk and cost because you only pay for the data you need protected. IRI can do all, or some, of the work to classify, find, and mask that data. IRI can also provide your auditors with the logs and targets that verify that your sensitive data was protected and now complies with privacy laws. To facilitate the service, you can transfer unprotected data to a secure on-premise or cloud-based staging area, or provide remote, supervised access to IRI to the data sources(s) at issue under a strict NDA.
    Starting Price: $1000 per day
  • Previous
  • You're on page 1
  • 2
  • 3
  • 4
  • 5
  • Next
MongoDB Logo MongoDB