Business Software for Hadoop - Page 3

Top Software that integrates with Hadoop as of August 2025 - Page 3

Hadoop Clear Filters
  • 1
    Oracle Big Data Service
    Oracle Big Data Service makes it easy for customers to deploy Hadoop clusters of all sizes, with VM shapes ranging from 1 OCPU to a dedicated bare metal environment. Customers choose between high-performance NVmE storage or cost-effective block storage, and can grow or shrink their clusters. Quickly create Hadoop-based data lakes to extend or complement customer data warehouses, and ensure that all data is both accessible and managed cost-effectively. Query, visualize and transform data so data scientists can build machine learning models using the included notebook with its R, Python and SQL support. Move customer-managed Hadoop clusters to a fully-managed cloud-based service, reducing management costs and improving resource utilization.
    Starting Price: $0.1344 per hour
  • 2
    IBM Spectrum Symphony
    Deliver enterprise-class management for running compute and data-intensive distributed applications on a scalable, shared grid. IBM Spectrum Symphony® software delivers powerful enterprise-class management for running compute-intensive and data-intensive distributed applications on a scalable, shared grid. It accelerates dozens of parallel applications for faster results and better utilization of all available resources. With IBM Spectrum Symphony, you can improve IT performance, reduce infrastructure costs and expenses and quickly meet business demands. Get faster throughput and performance for compute-intensive and data-intensive analytics applications to accelerate time-to-results. Achieve higher levels of resource utilization by controlling and optimizing the massive compute power available in your technical computing systems. Reduce infrastructure, application development, deployment and management costs by gaining control of large-scale jobs.
  • 3
    AdvancedMiner

    AdvancedMiner

    Algolytics Technologies

    Algolytics provides software solutions and consulting services in the area of predictive analytics, risk management, data quality, social network analysis and advanced analysis of large data sets. Try one tool for data processing, analysis and modeling! The user-friendly workflow interface helps you explore all your data & more. Extracting and saving data from/to different database systems, files, and data transformations. Performing a wide range of operations on data, such as sampling, joining datasets, and dividing. AdvancedMiner offers limitless functionalities for advanced users that can be easily created and/or extended within the application. Support for SQL language (including analytical functions).
  • 4
    IRI Voracity

    IRI Voracity

    IRI, The CoSort Company

    Voracity is the only high-performance, all-in-one data management platform accelerating AND consolidating the key activities of data discovery, integration, migration, governance, and analytics. Voracity helps you control your data in every stage of the lifecycle, and extract maximum value from it. Only in Voracity can you: 1) CLASSIFY, profile and diagram enterprise data sources 2) Speed or LEAVE legacy sort and ETL tools 3) MIGRATE data to modernize and WRANGLE data to analyze 4) FIND PII everywhere and consistently MASK it for referential integrity 5) Score re-ID risk and ANONYMIZE quasi-identifiers 6) Create and manage DB subsets or intelligently synthesize TEST data 7) Package, protect and provision BIG data 8) Validate, scrub, enrich and unify data to improve its QUALITY 9) Manage metadata and MASTER data. Use Voracity to comply with data privacy laws, de-muck and govern the data lake, improve the reliability of your analytics, and create safe, smart test data
  • 5
    Datatron

    Datatron

    Datatron

    Datatron offers tools and features built from scratch, specifically to make machine learning in production work for you. Most teams discover that there’s more to just deploying models, which is already a very manual and time-consuming task. Datatron offers single model governance and management platform for all of your ML, AI, and Data Science models in production. We help you automate, optimize, and accelerate your ML models to ensure that they are running smoothly and efficiently in production. Data Scientists use a variety of frameworks to build the best models. We support anything you’d build a model with ( e.g. TensorFlow, H2O, Scikit-Learn, and SAS ). Explore models built and uploaded by your data science team, all from one centralized repository. Create a scalable model deployment in just a few clicks. Deploy models built using any language or framework. Make better decisions based on your model performance.
  • 6
    Xtendlabs

    Xtendlabs

    Xtendlabs

    Installing, and configuring today’s complex software technology platforms takes an extraordinary investment in time and resources. Not with Xtendlabs. Xtendlabs Emerging Technology Platform-as-a-Services provides immediate access to emerging Big Data, Data Sciences, and Database technology platforms online, from any device and location, 24/7. Xtendlabs are available on-demand, any time, from any location, including home, office or the road. Xtendlabs scale to meet your needs on-demand, so you can focus on your business problem and learning rather than struggling to find and set up infrastructure . Just sign-in to get immediate access to your virtual lab environment. Xtendlabs requires no virtual machine installation, system setup or configuration, saving valuable time and resources. Pay as you go monthly. With Xtendlabs there are no upfront investments in software or hardware.
  • 7
    Warp 10
    Warp 10 is a modular open source platform that collects, stores, and analyzes data from sensors. Shaped for the IoT with a flexible data model, Warp 10 provides a unique and powerful framework to simplify your processes from data collection to analysis and visualization, with the support of geolocated data in its core model (called Geo Time Series). Warp 10 is both a time series database and a powerful analytics environment, allowing you to make: statistics, extraction of characteristics for training models, filtering and cleaning of data, detection of patterns and anomalies, synchronization or even forecasts. The analysis environment can be implemented within a large ecosystem of software components such as Spark, Kafka Streams, Hadoop, Jupyter, Zeppelin and many more. It can also access data stored in many existing solutions, relational or NoSQL databases, search engines and S3 type object storage system.
  • 8
    Promethium

    Promethium

    Promethium

    Promethium helps data and analytics teams work smarter so they can stay ahead of growing data volumes and business needs. Simply connecting to a data warehouse or data lake to get access to raw data is not enough. Datasets require a lot of hard work from data teams! Data Teams aren't growing as fast as data volumes or business demand for data. Promethium helps overloaded data teams work smarter so they can deliver faster. Rely less on ETL, with access data on demand where it lives. Moving less data saves time and money. With Promethium one person can do in minutes what typically takes a team months using 6 or more tools. With a few clicks and without writing code, connect and catalog data sources and create and query cross-source datasets. Less custom code and ETL. Validate data is correct in real-time, not after months of work and ETL. Instantly share work so that it is reused, instead of recreated.
  • 9
    Hosting UK

    Hosting UK

    Hosting UK

    We make domain names easy, just search, buy and use. Secure your domain name today and get free web and email forwarding with full DNS management from a great control panel. Linux or Windows, beginner or expert, we have a plan for you. Fast, affordable, solid web hosting with ASP.NET, ASP Classic, and PHP on Windows Server 2019 with SQL Server 2016 or Linux hosting with PHP, MySQL and Ruby on Linux. Screamingly fast SSD-based VPS servers. Choose between Windows or a choice of Linux operating systems and control panels including Plesk and cPanel on our reliable and self-healing cloud. Full administrator or root access for when you need full control and a solution fast. Fast and reliable Dell dedicated servers connected to the fastest network. With a range of managed or un-managed servers giving you a rock-solid platform all backed up with great UK-based support for total peace of mind.
    Starting Price: $3.91 per month
  • 10
    SAS Federation Server
    Create federated source data names to enable users to access multiple data sources via the same connection. Use the web-based administrative console for simplified maintenance of user access, privileges and authorizations. Apply data quality functions such as match-code generation, parsing and other tasks inside the view. Improved performance with in-memory data caches & scheduling. Secured information with data masking & encryption. Lets you keep application queries current and available to users, and reduce loads on operational systems. Enables you to define access permissions for a user or group at the catalog, schema, table, column and row levels. Advanced data masking and encryption capabilities let you determine not only who’s authorized to view your data, but also what they see on an extremely granular level. It all helps ensure sensitive data doesn’t fall into the wrong hands.
  • 11
    IBM Db2 Big SQL
    A hybrid SQL-on-Hadoop engine delivering advanced, security-rich data query across enterprise big data sources, including Hadoop, object storage and data warehouses. IBM Db2 Big SQL is an enterprise-grade, hybrid ANSI-compliant SQL-on-Hadoop engine, delivering massively parallel processing (MPP) and advanced data query. Db2 Big SQL offers a single database connection or query for disparate sources such as Hadoop HDFS and WebHDFS, RDMS, NoSQL databases, and object stores. Benefit from low latency, high performance, data security, SQL compatibility, and federation capabilities to do ad hoc and complex queries. Db2 Big SQL is now available in 2 variations. It can be integrated with Cloudera Data Platform, or accessed as a cloud-native service on the IBM Cloud Pak® for Data platform. Access and analyze data and perform queries on batch and real-time data across sources, like Hadoop, object stores and data warehouses.
  • 12
    Oracle Big Data SQL Cloud Service
    Oracle Big Data SQL Cloud Service enables organizations to immediately analyze data across Apache Hadoop, NoSQL and Oracle Database leveraging their existing SQL skills, security policies and applications with extreme performance. From simplifying data science efforts to unlocking data lakes, Big Data SQL makes the benefits of Big Data available to the largest group of end users possible. Big Data SQL gives users a single location to catalog and secure data in Hadoop and NoSQL systems, Oracle Database. Seamless metadata integration and queries which join data from Oracle Database with data from Hadoop and NoSQL databases. Utilities and conversion routines support automatic mappings from metadata stored in HCatalog (or the Hive Metastore) to Oracle Tables. Enhanced access parameters give administrators the flexibility to control column mapping and data access behavior. Multiple cluster support enables one Oracle Database to query multiple Hadoop clusters and/or NoSQL systems.
  • 13
    ThinkData Works

    ThinkData Works

    ThinkData Works

    Data is the backbone of effective decision-making. However, employees spend more time managing it than using it. ThinkData Works provides a robust catalog platform for discovering, managing, and sharing data from both internal and external sources. Enrichment solutions combine partner data with your existing datasets to produce uniquely valuable assets that can be shared across your entire organization. Unlock the value of your data investment by making data teams more efficient, improving project outcomes, replacing multiple existing tech solutions, and providing you with a competitive advantage.
  • 14
    Huawei Cloud Data Lake Governance Center
    Simplify big data operations and build intelligent knowledge libraries with Data Lake Governance Center (DGC), a one-stop data lake operations platform that manages data design, development, integration, quality, and assets. Build an enterprise-class data lake governance platform with an easy-to-use visual interface. Streamline data lifecycle processes, utilize metrics and analytics, and ensure good governance across your enterprise. Define and monitor data standards, and get real-time alerts. Build data lakes quicker by easily setting up data integrations, models, and cleaning rules, to enable the discovery of new reliable data sources. Maximize the business value of data. With DGC, end-to-end data operations solutions can be designed for scenarios such as smart government, smart taxation, and smart campus. Gain new insights into sensitive data across your entire organization. DGC allows enterprises to define business catalogs, classifications, and terms.
    Starting Price: $428 one-time payment
  • 15
    WEBDEV

    WEBDEV

    Windev

    Responsive web design, WEBDEV allows you to easily develop Internet and Intranet sites and applications (WEB & SaaS) to manage data and processes. WEBDEV also generates PHP. WINDEV supports all databases. WEBDEV also supports all the databases that use ODBC drivers or OLEDB providers. The WINDEV, WEBDEV and WINDEV Mobile environments are compatible and share project elements. It has never been easier to build multi-target applications. The developer can focus on key business requirements, and not on the code, applications can finally meet your needs. Up to 20 times less code, develop applications in no time! Shorter time to market, allows you to gain market share. Software is easier to develop and improved reliability. Complete application RAD generator for PC, web, and mobile, template creation (patterns, inheritance & MVP). The ease of use and speed that allow you to develop and realize even your most ambitious projects.
    Starting Price: $1,703 one-time payment
  • 16
    jethro

    jethro

    jethro

    Data-driven decision-making has unleashed a surge of business data and a rise in user demand to analyze it. This trend drives IT departments to migrate off expensive Enterprise Data Warehouses (EDW) toward cost-effective Big Data platforms like Hadoop or AWS. These new platforms come with a Total Cost of Ownership (TCO) that is about 10 times lower. They are not ideal for interactive BI applications, however, as they fail to match the high performance and user concurrency of legacy EDWs. For this exact reason, we developed Jethro. Customers use Jethro for interactive BI on Big Data. Jethro is a transparent middle tier that requires no changes to existing apps or data. It is self-driving with no maintenance required. Jethro is compatible with BI tools like Tableau, Qlik, and Microstrategy and is data source agnostic. Jethro delivers on the demands of business users allowing for thousands of concurrent users to run complicated queries over billions of records.
  • 17
    FairCom EDGE
    FairCom EDGE simplifies the integration of sensor and machine data at the source – whether it’s a factory, water treatment plant, oil platform or wind farm. The world’s first converged IoT/Industrial IoT hub, FairCom EDGE unifies messaging, persistence and analytics with an all-in-one solution – complete with browser-based administration, configuration and monitoring. FairCom EDGE supports MQTT and OPC UA for machine-to-machine (M2M) communication, SQL for interactive analytics and HTTP/REST for real-time monitoring. It continuously retrieves data from sensors and machines with OPC UA support, and receives messages from those with MQTT support. The data is automatically parsed, persisted and made accessible via MQTT and SQL.
    Starting Price: Free
  • 18
    NXLog

    NXLog

    NXLog

    Achieve complete security observability with powerful insights from your log data. Improve your infrastructure visibility and enhance threat prevention with a versatile multi-platform tool. With support for over 100 operating system versions and more than 120 configurable modules, gain comprehensive insights and increased security. Cut the cost of your SIEM solution by reducing noisy and unnecessary log data. Filter events, truncate unused fields, and remove duplicates to increase the quality of your logs. Collect and aggregate logs from systems across the entire breadth of your organization with a single tool. Reduce complexity in managing security-related events and decrease detection and response times. Empower your organization to meet compliance requirements by centralizing some logs in an SIEM and archiving others in your long-term storage. NXLog Platform is an on-premises solution for centralized log management, with versatile processing.
  • 19
    IBM watsonx.data
    Put your data to work, wherever it resides, with the open, hybrid data lakehouse for AI and analytics. Connect your data from anywhere, in any format, and access through a single point of entry with a shared metadata layer. Optimize workloads for price and performance by pairing the right workloads with the right query engine. Embed natural-language semantic search without the need for SQL, so you can unlock generative AI insights faster. Manage and prepare trusted data to improve the relevance and precision of your AI applications. Use all your data, everywhere. With the speed of a data warehouse, the flexibility of a data lake, and special features to support AI, watsonx.data can help you scale AI and analytics across your business. Choose the right engines for your workloads. Flexibly manage cost, performance, and capability with access to multiple open engines including Presto, Presto C++, Spark Milvus, and more.
  • 20
    eQube®-DaaS

    eQube®-DaaS

    eQ Technologic

    Our platform establishes a data fabric with a connected network of integrated data, applications, and devices that puts the power of analytics in the hands of end users leading to actionable insight. Data from any source can be aggregated using eQube's data virtualization layer and exposed as a web service, REST service, OData service, or API. Efficiently and rapidly integrate many legacy systems and new COTS (Commercial off-the-shelf) systems. Responsibly retire legacy systems in an orderly manner without disrupting the business. Provide on-demand 'visibility' across the business processes with analytics and business intelligence (A/BI) capabilities. eQube®-MI-based application integration infrastructure can be readily extended for secure, scalable, and robust information collaboration across networks, partners, suppliers, and customers that are geographically dispersed.
  • 21
    Alibaba Cloud Data Integration
    Alibaba Cloud Data Integration is a comprehensive data synchronization platform that facilitates both real-time and offline data exchange across various data sources, networks, and locations. It supports data synchronization between more than 400 pairs of disparate data sources, including RDS databases, semi-structured storage, non-structured storage (such as audio, video, and images), NoSQL databases, and big data storage. The platform also enables real-time data reading and writing between data sources such as Oracle, MySQL, and DataHub. Data Integration allows users to schedule offline tasks by setting specific trigger times, including year, month, day, hour, and minute, simplifying the configuration of periodic incremental data extraction. It integrates seamlessly with DataWorks data modeling, providing an operations and maintenance integrated workflow. The platform leverages the computing capability of Hadoop clusters to synchronize HDFS data to MaxCompute.
  • 22
    Qlik Sense
    Empower people at all skill levels to make data-driven decisions and take action when it matters most. Deeper interactivity. Broader context. Lightning fast. No one else compares. Qlik’s one-of-a-kind Associative technology brings unmatched power to the core of our industry-leading analytics experience. Empower all your users to explore freely at the speed of thought with hyperfast calculations, always in context, at scale. Yeah, it’s a big deal. And it’s why Qlik Sense takes you way beyond the limits of query-based analytics and dashboards our competitors offer. Insight Advisor in Qlik Sense uses AI to help your users understand and use data more effectively, minimizing cognitive bias, amplifying discovery, and elevating data literacy. Organizations need a dynamic relationship with information that reflects the current moment. Traditional, passive BI falls short.
  • 23
    Couchbase

    Couchbase

    Couchbase

    Unlike other NoSQL databases, Couchbase provides an enterprise-class, multicloud to edge database that offers the robust capabilities required for business-critical applications on a highly scalable and available platform. As a distributed cloud-native database, Couchbase runs in modern dynamic environments and on any cloud, either customer-managed or fully managed as-a-service. Couchbase is built on open standards, combining the best of NoSQL with the power and familiarity of SQL, to simplify the transition from mainframe and relational databases. Couchbase Server is a multipurpose, distributed database that fuses the strengths of relational databases such as SQL and ACID transactions with JSON’s versatility, with a foundation that is extremely fast and scalable. It’s used across industries for things like user profiles, dynamic product catalogs, GenAI apps, vector search, high-speed caching, and much more.
  • 24
    Hyper Historian
    ICONICS’ Hyper Historian™ is an advanced 64-bit high-speed, reliable, and robust historian. Designed for the most mission-critical applications, Hyper Historian's advanced high compression algorithm delivers unparalleled performance with very efficient use of resources. Hyper Historian integrates with our ISA-95-compliant asset database and the latest big data technologies, including Azure SQL, Microsoft Data Lakes, Kafka, and Hadoop. This makes Hyper Historian the most efficient and secure real-time plant historian for any Microsoft operating system. Hyper Historian includes a module for automatic or manual insertion of data, empowering users to import historical or log data from databases, other historians, or intermittently connected field devices and equipment. This also provides for greatly increased reliability in capturing all data, even when network disruptions occur. Leverage rapid collection for enterprise-wide storage.
  • 25
    Mage Sensitive Data Discovery
    Uncover hidden sensitive data locations within your enterprise through Mage's patented Sensitive Data Discovery module. Find data hidden in all types of data stores in the most obscure locations, be it structured, unstructured, Big Data, or on the Cloud. Leverage the power of Artificial Intelligence and Natural Language Processing to uncover data in the most complex of locations. Ensure efficient identification of sensitive data with minimal false positives with a patented approach to data discovery. Configure any additional data classifications over and above the 70+ out of the box data classifications covering all popular PII and PHI data. Schedule sample, full, or even incremental scans through a simplified discovery process.
  • 26
    Deep.BI

    Deep.BI

    Deep BI

    Deep.BI enables Media, Insurance, E-commerce and Banking enterprises to effectively increase revenues by anticipating specific user behaviors then automating actions to convert these users to paying customers and retaining them. Predictive customer data platform with real-time user scoring, based on Deep.BI's next-gen, enterprise data warehouse. We help digital businesses and platforms improve their products, content and distribution. Deep.BI's platform collects extensive data about product usage and content consumption and provides real-time, actionable insights. Real-time, actionable insights are generated within seconds through the Deep.Conveyor data pipeline, available for analysis in the Deep.Explorer business intelligence platform, augmented through the Deep.Score event scoring engine built with custom AI algorithms for your use case, and are ready for automation using the Deep.Conductor high-speed API and AI model serving platform.
  • 27
    Oracle Big Data Discovery
    Oracle Big Data Discovery is a stunningly visual, intuitive product that leverages the power of Hadoop to transform raw data into business insight in minutes, without the need to learn complex tools or rely only on highly specialized resources. With Oracle Big Data Discovery, customers can easily find relevant data sets in Hadoop, explore the data and quickly understand its potential, transform and enrich data to make it better, analyze the data to discover new insights, share results and publish back to Hadoop for use across the enterprise. In your organization, use BDD as the center of your data lab, as a unified environment for navigating and exploring all of your data sources in Hadoop, and to create projects and BDD applications. In BDD, a wider number of people can work with big data, compared with traditional analytics tools. You spend less time on data loading and updates, and can focus on actual data analysis of big data.
  • 28
    Informatica MDM

    Informatica MDM

    Informatica

    Our market-leading, multidomain solution supports any master data domain, implementation style, and use case, in the cloud or on premises. Integrates best-in-class data integration, data quality, business process management, and data privacy. Tackle complex issues head-on with trusted views of business-critical master data. Automatically link master, transaction, and interaction data relationships across master data domains. Increase accuracy of data records with contact data verification, B2B, and B2C enrichment services. Update multiple master data records, dynamic data models, and collaborative workflows with one click. Reduce maintenance costs and speed deployment with AI-powered match tuning and rule recommendations. Increase productivity using search and pre-configured, highly granular charts and dashboards. Create high-quality data that helps you improve business outcomes with trusted, relevant information.
  • 29
    Apache Drill

    Apache Drill

    The Apache Software Foundation

    Schema-free SQL Query Engine for Hadoop, NoSQL and Cloud Storage
  • 30
    HEAVY.AI

    HEAVY.AI

    HEAVY.AI

    HEAVY.AI is the pioneer in accelerated analytics. The HEAVY.AI platform is used in business and government to find insights in data beyond the limits of mainstream analytics tools. Harnessing the massive parallelism of modern CPU and GPU hardware, the platform is available in the cloud and on-premise. HEAVY.AI originated from research at Harvard and MIT Computer Science and Artificial Intelligence Laboratory (CSAIL). Expand beyond the limitations of traditional BI and GIS by leveraging the full power of modern GPU and CPU hardware so you can extract decision-quality information from your massive datasets without lag. Unify and explore your largest geospatial and time-series datasets to get the complete picture of the what, when, and where. Combine interactive visual analytics, hardware-accelerated SQL, and an advanced analytics & data science framework to find opportunity and risk hidden in your enterprise when you need to most.