Best Data Management Software for Cloud - Page 75

Compare the Top Data Management Software for Cloud as of June 2026 - Page 75

  • 1
    Presto

    Presto

    Presto Foundation

    Presto is an open source distributed SQL query engine for running interactive analytic queries against data sources of all sizes ranging from gigabytes to petabytes. For data engineers who struggle with managing multiple query languages and interfaces to siloed databases and storage, Presto is the fast and reliable engine that provides one simple ANSI SQL interface for all your data analytics and your open lakehouse. Different engines for different workloads means you will have to re-platform down the road. With Presto, you get 1 familar ANSI SQL language and 1 engine for your data analytics so you don't need to graduate to another lakehouse engine. Presto can be used for interactive and batch workloads, small and large amounts of data, and scales from a few to thousands of users. Presto gives you one simple ANSI SQL interface for all of your data in various siloed data systems, helping you join your data ecosystem together.
  • 2
    Infobright DB

    Infobright DB

    IgniteTech

    Infobright DB is a high-performance enterprise database leveraging a columnar storage engine to enable business analysts to dissect data efficiently and more quickly obtain reports. InfoBright DB can be deployed on-premise or in the cloud. Store & analyze big data for interactive business intelligence and complex queries. Improve query performance, reduce storage cost and increase overall efficiency in business analytics and reporting. Easily store up to several hundred TB of data — traditionally not achievable with conventional databases. Run big data applications and eliminate indexing and partitioning — with zero administrative overhead. With the volumes of machine data exploding, IgniteTech’s Infobright DB is specifically designed to achieve high performance for large volumes of machine-generated data. Manage a complex ad hoc analytic environments without the database administration required by other products.
  • 3
    Broadcom IDMS
    Broadcom IDMS™ is a high-performance and scalable mainframe relational database management system designed to support mission-critical enterprise workloads. The platform has been trusted by organizations for more than 40 years to deliver reliable and secure database processing across industries such as finance, healthcare, manufacturing, and government. IDMS enables businesses to modernize operations by supporting open access and automated API generation for cloud, mobile, web, and modern mainframe applications. The solution is built to leverage the latest hardware and software technologies, helping teams efficiently build, maintain, and manage enterprise applications. IDMS provides strong scalability and cost-effective performance to handle growing business demands without compromising reliability. By combining modernization capabilities with enterprise-grade database management, Broadcom helps organizations maximize the long-term value of their mainframe investments.
  • 4
    Altibase

    Altibase

    Altibase

    Altibase is an enterprise-grade, high-performance and relational open source database. A single database that delivers high-intensity data processing through an in-memory database portion and large storage capacity through an on-disk database portion. 10 times faster than conventional on-disk databases. Clients have consistently chosen Altibase over Oracle, IBM, Microsoft, and others. Altibase has replaced many traditional on-disk databases in various industries that require real time solutions since 1999. Altibase now has over 650 global enterprise clients including 8 Fortune Global 500 companies with thousands of mission-critical deployments worldwide. Product maturity rich with function and feature. Altibase is open source which includes its cutting-edge scale-out technology, sharding. No license costs with flexible and competitive subscription fees. 20 years’ accumulated know-how of dealing with over 6,000 mission-critical use cases.
  • 5
    HEAVY.AI

    HEAVY.AI

    HEAVY.AI

    HEAVY.AI is the pioneer in accelerated analytics. The HEAVY.AI platform is used in business and government to find insights in data beyond the limits of mainstream analytics tools. Harnessing the massive parallelism of modern CPU and GPU hardware, the platform is available in the cloud and on-premise. HEAVY.AI originated from research at Harvard and MIT Computer Science and Artificial Intelligence Laboratory (CSAIL). Expand beyond the limitations of traditional BI and GIS by leveraging the full power of modern GPU and CPU hardware so you can extract decision-quality information from your massive datasets without lag. Unify and explore your largest geospatial and time-series datasets to get the complete picture of the what, when, and where. Combine interactive visual analytics, hardware-accelerated SQL, and an advanced analytics & data science framework to find opportunity and risk hidden in your enterprise when you need to most.
  • 6
    Dell EMC Avamar
    Dell EMC Avamar enables fast, efficient backup and recovery through its integrated variable-length deduplication technology. Avamar is optimized for fast, daily full backups of physical and virtual environments, NAS servers, enterprise applications, remote offices and desktops/laptops. Avamar is available as a virtual edition or as a component of Dell EMC Data Protection Suite, which offers you a complete suite of data protection software options. Backup and recovery optimized for virtual environments. Enables application-consistent recovery of enterprise applications. Uses variable-length deduplication for high performance and lower cost. Provides intuitive centralized management and encryption for data security. Dell Technologies On Demand delivers the industry's broadest end-to-end portfolio of consumption-based and as-a-service solutions ideally suited for the way on-premises infrastructure and services are consumed in the on-demand economy.
  • 7
    OneView

    OneView

    OneView

    Working exclusively with real data creates significant challenges for machine learning model training. Synthetic data enables limitless machine learning model training, addressing the drawbacks and challenges of real data. Boost the performance of your geospatial analytics by creating the imagery you need. Customizable satellite, drone, and aerial imagery. Create scenarios, change object ratios, and adjust imaging parameters quickly and iteratively. Any rare objects or occurrences can be created. The resulting datasets are fully-annotated, error-free, and ready for training. The OneView simulation engine creates 3D worlds as the base for synthetic satellite and aerial images, layered with multiple randomization factors, filters, and variation parameters. The synthetic images replace real data for remote sensing systems in machine learning model training. They achieve superior interpretation results, especially in cases with limited coverage or poor-quality data.
  • 8
    Confluent

    Confluent

    Confluent

    Infinite retention for Apache Kafka® with Confluent. Be infrastructure-enabled, not infrastructure-restricted Legacy technologies require you to choose between being real-time or highly-scalable. Event streaming enables you to innovate and win - by being both real-time and highly-scalable. Ever wonder how your rideshare app analyzes massive amounts of data from multiple sources to calculate real-time ETA? Ever wonder how your credit card company analyzes millions of credit card transactions across the globe and sends fraud notifications in real-time? The answer is event streaming. Move to microservices. Enable your hybrid strategy through a persistent bridge to cloud. Break down silos to demonstrate compliance. Gain real-time, persistent event transport. The list is endless.
  • 9
    Hadoop

    Hadoop

    Apache Software Foundation

    The Apache Hadoop software library is a framework that allows for the distributed processing of large data sets across clusters of computers using simple programming models. It is designed to scale up from single servers to thousands of machines, each offering local computation and storage. Rather than rely on hardware to deliver high-availability, the library itself is designed to detect and handle failures at the application layer, so delivering a highly-available service on top of a cluster of computers, each of which may be prone to failures. A wide variety of companies and organizations use Hadoop for both research and production. Users are encouraged to add themselves to the Hadoop PoweredBy wiki page. Apache Hadoop 3.3.4 incorporates a number of significant enhancements over the previous major release line (hadoop-3.2).
  • 10
    Aircloak Insights
    Aircloak Insights is a transparent proxy sitting between analysts and the sensitive data they need to work with. Analysts query the system like normal, using SQL or dashboards like Tableau. Aircloak Insights intercepts the query and tailors it to the data backend which may be SQL or a NoSQL big data store. Results are returned via the proxy which ensures they are aggregated and fully anonymized. Aircloak Insights integrates directly in your existing workflow. You can query your sensitive datasets using the query editor in our easy-to-use web interface, Insights Air, or connect using business intelligence tools like Tableau or any other tools or dashboards that know how to communicate using the Postgres Message Protocol. Aircloak Insights also allows you to run queries programmatically using a RESTful API.
  • 11
    Apache Spark

    Apache Spark

    Apache Software Foundation

    Apache Spark™ is a unified analytics engine for large-scale data processing. Apache Spark achieves high performance for both batch and streaming data, using a state-of-the-art DAG scheduler, a query optimizer, and a physical execution engine. Spark offers over 80 high-level operators that make it easy to build parallel apps. And you can use it interactively from the Scala, Python, R, and SQL shells. Spark powers a stack of libraries including SQL and DataFrames, MLlib for machine learning, GraphX, and Spark Streaming. You can combine these libraries seamlessly in the same application. Spark runs on Hadoop, Apache Mesos, Kubernetes, standalone, or in the cloud. It can access diverse data sources. You can run Spark using its standalone cluster mode, on EC2, on Hadoop YARN, on Mesos, or on Kubernetes. Access data in HDFS, Alluxio, Apache Cassandra, Apache HBase, Apache Hive, and hundreds of other data sources.
  • 12
    CData Query Federation Drivers
    The Query Federation Drivers provide a universal data access layer that simplifies application development and data access. The drivers make it easy to query data across systems with SQL through a common driver interface. The Query Federation Drivers enable users to embed Logical Data Warehousing capabilities into any application or process. A Logical Data Warehouse is an architectural layer that enables access to multiple data sources on-demand, without relocating or transforming data in advance. Essentially the Query Federation Drivers give users simple, SQL-based access to all of your databases, data warehouses, and cloud applications through a single interface. Developers can pick multiple data processing systems and access all of them with a single SQL-based interface.
  • 13
    Amazon Kinesis
    Easily collect, process, and analyze video and data streams in real time. Amazon Kinesis makes it easy to collect, process, and analyze real-time, streaming data so you can get timely insights and react quickly to new information. Amazon Kinesis offers key capabilities to cost-effectively process streaming data at any scale, along with the flexibility to choose the tools that best suit the requirements of your application. With Amazon Kinesis, you can ingest real-time data such as video, audio, application logs, website clickstreams, and IoT telemetry data for machine learning, analytics, and other applications. Amazon Kinesis enables you to process and analyze data as it arrives and respond instantly instead of having to wait until all your data is collected before the processing can begin. Amazon Kinesis enables you to ingest, buffer, and process streaming data in real-time, so you can derive insights in seconds or minutes instead of hours or days.
  • 14
    Kibana

    Kibana

    Elastic

    Kibana is a free and open user interface that lets you visualize your Elasticsearch data and navigate the Elastic Stack. Do anything from tracking query load to understanding the way requests flow through your apps. Kibana gives you the freedom to select the way you give shape to your data. With its interactive visualizations, start with one question and see where it leads you. Kibana core ships with the classics: histograms, line graphs, pie charts, sunbursts, and more. And, of course, you can search across all of your documents. Leverage Elastic Maps to explore location data, or get creative and visualize custom layers and vector shapes. Perform advanced time series analysis on your Elasticsearch data with our curated time series UIs. Describe queries, transformations, and visualizations with powerful, easy-to-learn expressions.
  • 15
    Incorta

    Incorta

    Incorta

    Direct is the shortest path from data to insight. Incorta empowers everyone in your business with a true self-service data experience and breakthrough performance for better decisions and incredible results. What if you could bypass fragile ETL and expensive data warehouses, and deliver data projects in days, instead of weeks or months? Our direct approach to analytics delivers true self-service in the cloud or on-premises with agility and performance. Incorta is used by the world’s largest brands to succeed where other analytics solutions fail. Across multiple industries and lines of business, we boast connectors and pre-built solutions for your enterprise applications and technologies. Game-changing innovation and customer success happen through Incorta’s partners including Microsoft, AWS, eCapital, and Wipro. Explore or join our thriving partner ecosystem.
  • 16
    VeriAS

    VeriAS

    Verias

    Our unique software systems enable SMS routing and delivery, Data Management and Analytics, and Email Scoring. This empowers our clients to reach customers with the highest propensity to engage and convert.
  • 17
    tye.io

    tye.io

    tye GmbH

    tye is a Software-as-a-Service (SaaS) personal assistant that helps companies keep the contact information of their customers up-to-date.
  • 18
    Phynd

    Phynd

    Phynd Technologies

    A single platform to operationalize and optimize provider data enterprise-wide. Phynd 360 is an innovative provider data platform, serving as health systems’ central hub for all provider data. Phynd optimizes provider data – people, places and services – for use in EHR, Marketing and Claims systems via platform tools which offer provider enrollment, management, outreach and search across the enterprise. Manage data specific to employed, referring and affiliated providers. Maintain profiles of your care locations for consumers, providers and care coordinators. Match providers with the right specialty, subspecialty, clinical and consumer-friendly terms. Track provider and location participation in commercial and health system plans, ACOs, and narrow networks.
  • 19
    Kaggle

    Kaggle

    Google

    Kaggle is a global AI and machine learning platform that brings together developers, researchers, organizations, and data science enthusiasts to build, evaluate, and improve artificial intelligence technologies. The platform offers access to AI competitions, benchmarks, hackathons, datasets, notebooks, pre-trained models, and educational courses that help users develop real-world machine learning skills. Kaggle enables organizations and researchers to host competitions, crowdsource evaluations, publish benchmarks, and discover top AI talent through its large global community of over 31 million users. Users can access free GPU and TPU-powered notebook environments, collaborate on public datasets, explore pre-trained AI models, and participate in large-scale AI research initiatives. The platform also provides learning resources including hands-on courses, solution write-ups, and reproducible notebooks that support both beginners and advanced machine learning practitioners.
  • 20
    Cloud Dataprep
    Cloud Dataprep by Trifacta is an intelligent data service for visually exploring, cleaning, and preparing structured and unstructured data for analysis, reporting, and machine learning. Because Cloud Dataprep is serverless and works at any scale, there is no infrastructure to deploy or manage. Your next ideal data transformation is suggested and predicted with each UI input, so you don’t have to write code. Cloud Dataprep is an integrated partner service operated by Trifacta and based on their industry-leading data preparation solution. Google works closely with Trifacta to provide a seamless user experience that removes the need for up-front software installation, separate licensing costs, or ongoing operational overhead. Cloud Dataprep is fully managed and scales on demand to meet your growing data preparation needs so you can stay focused on analysis.
  • 21
    Amazon EMR
    Amazon EMR is the industry-leading cloud big data platform for processing vast amounts of data using open-source tools such as Apache Spark, Apache Hive, Apache HBase, Apache Flink, Apache Hudi, and Presto. With EMR you can run Petabyte-scale analysis at less than half of the cost of traditional on-premises solutions and over 3x faster than standard Apache Spark. For short-running jobs, you can spin up and spin down clusters and pay per second for the instances used. For long-running workloads, you can create highly available clusters that automatically scale to meet demand. If you have existing on-premises deployments of open-source tools such as Apache Spark and Apache Hive, you can also run EMR clusters on AWS Outposts. Analyze data using open-source ML frameworks such as Apache Spark MLlib, TensorFlow, and Apache MXNet. Connect to Amazon SageMaker Studio for large-scale model training, analysis, and reporting.
  • 22
    Kogni

    Kogni

    Kogni

    Kogni's Discover feature enables enterprises to locate and detect all sensitive and critical information. Discover sensitive data from any source, in any format and in any type. Employ Kogni’s expert sensitive data discovery software to automate data discovery and classification. Our ease of implementation allows for seamless integration with your enterprise’s data warehouse. Accelerate compliance with international data regulations and industry standards with Kogni’s sensitive data discovery tool. Minimize the risk of data leak and the cost of non-compliance with data protection and privacy regulations like HIPAA, GDPR, CCPA, PCI, and PII amongst others. Scans and pin-points sensitive data from 10+ data sources. Produces a comprehensive sensitive information dashboard with an array of special features. Custom-build your sensitive data classification groups as per your company’s needs. Supports a wide range of data types and formats.
  • 23
    Sightcorp

    Sightcorp

    Sightcorp

    Get real-time shopper insights. Measure customer satisfaction. Get to know who your converted customers are. Our Face Analysis Technology is empowering brick-and-mortar businesses with real-time, anonymous shoppers’ insights to help you optimize customer experiences and day-to-day retail operations. Check out our recommended products. Create smarter in-store experiences that maximize engagement and customer satisfaction. We all want to boost customer engagement because an engaged customer is a return customer. Our software provides you real-time insights into your shoppers’ demographic profile, interest, and behavior. Get to know who your converted customers are, optimize everything, from shelf-level displays to store layout based on customers’ behavior. Are you reaching your target audience? Our software can help you get answers to these questions.
  • 24
    Google Cloud Bigtable
    Google Cloud Bigtable is a fully managed, scalable NoSQL database service for large analytical and operational workloads. Fast and performant: Use Cloud Bigtable as the storage engine that grows with you from your first gigabyte to petabyte-scale for low-latency applications as well as high-throughput data processing and analytics. Seamless scaling and replication: Start with a single node per cluster, and seamlessly scale to hundreds of nodes dynamically supporting peak demand. Replication also adds high availability and workload isolation for live serving apps. Simple and integrated: Fully managed service that integrates easily with big data tools like Hadoop, Dataflow, and Dataproc. Plus, support for the open source HBase API standard makes it easy for development teams to get started.
  • 25
    Altinity

    Altinity

    Altinity

    Altinity's expert engineering team can implement everything from core ClickHouse features to Kubernetes operator behavior to client library improvements. A flexible docker-based GUI manager for ClickHouse that can do the following: Install ClickHouse clusters; Add, delete, and replace nodes; Monitor cluster status; Help with troubleshooting and diagnostics. 3rd party tools and software integrations: Ingest: Kafka, ClickTail; APIs: Python, Golang, ODBC, Java; Kubernetes; UI tools: Grafana, Superset, Tabix, Graphite; Databases: MySQL, PostgreSQL; BI tools: Tableau and many more. Altinity.Cloud incorporates lessons from helping hundreds of customers operate ClickHouse-based analytics. Altinity.Cloud has a Kubernetes-based architecture that delivers portability and user choice of where to operate. Designed from the beginning to run anywhere without lock-in. Cost management is critical for SaaS businesses.
  • 26
    Oracle Advertising
    Discover the hidden potential in your digital strategy. Oracle Data Cloud provides award-winning solutions made for every stage of the marketing journey. Reduce waste and protect your ad spend, reach your ideal buyers and prospects, and ensure you measure the metrics that matter to quantify the impact of your online advertising campaigns. Make the biggest impact with your advertising by understanding your customers and most valuable prospects on a whole new level. Discover what makes your audience take action and where to engage them with best-in-class audience and contextual intelligence solutions from Oracle Data Cloud. Protect against fraud while ensuring your ads are in-view and appearing alongside safe, relevant content. Drive campaign success with solutions for viewability, invalid traffic (IVT), and brand safety.
  • 27
    Appen

    Appen

    Appen

    The Appen platform combines human intelligence from over one million people all over the world with cutting-edge models to create the highest-quality training data for your ML projects. Upload your data to our platform and we provide the annotations, judgments, and labels you need to create accurate ground truth for your models. High-quality data annotation is key for training any AI/ML model successfully. After all, this is how your model learns what judgments it should be making. Our platform combines human intelligence at scale with cutting-edge models to annotate all sorts of raw data, from text, to video, to images, to audio, to create the accurate ground truth needed for your models. Create and launch data annotation jobs easily through our plug and play graphical user interface, or programmatically through our API.
  • 28
    Digital Twin Streaming Service
    ScaleOut Digital Twin Streaming Service™ Easily build and deploy real-time digital twins for streaming analytics Connect to many data sources with Azure & AWS IoT hubs, Kafka, and more Maximize situational awareness with live, aggregate analytics. Introducing a breakthrough cloud service that simultaneously tracks telemetry from millions of data sources with “real-time” digital twins — enabling immediate, deep introspection with state-tracking and highly targeted, real-time feedback for thousands of devices. A powerful UI simplifies deployment and displays aggregate analytics in real time to maximize situational awareness. Ideal for a wide range of applications, including the Internet of Things (IoT), real-time intelligent monitoring, logistics, and financial services. Simplified pricing makes getting started fast and easy. Combined with the ScaleOut Digital Twin Builder software toolkit, the ScaleOut Digital Twin Streaming Service enables the next generation in stream processing.
  • 29
    Experian Aperture Data Studio
    Whether you’re preparing for a data migration, looking to achieve reliable customer insight, or complying with regulation, you can rely on our data quality management solutions. With Experian, it means powerful data profiling, data discovery, data cleansing and enrichment, process orchestration, and the ability to run full-volume analyses, among other things. Getting insight into your business’s data is now easier and faster than ever before. Our solutions allow you to seamlessly connect to hundreds of data sources to remove duplicates, correct errors, and standardize formats. With improved data quality, comes a more comprehensive view of your customers, business operations, and more.
  • 30
    Nightfall

    Nightfall

    Nightfall AI

    Discover, classify, and protect your sensitive data. Nightfall™ uses machine learning to identify business-critical data, like customer PII, across your SaaS, APIs, and data infrastructure, so you can manage & protect it. Integrate in minutes with cloud services via APIs to monitor data without agents. Machine learning classifies your sensitive data & PII with high accuracy, so nothing gets missed. Setup automated workflows for quarantines, deletions, alerts, and more - saving you time and keeping your business safe. Nightfall integrates directly with all your SaaS, APIs, and data infrastructure. Start building with Nightfall’s APIs for sensitive data classification & protection for free. Via REST API, programmatically get structured results from Nightfall’s deep learning-based detectors for things like credit card numbers, API keys, and more. Integrate with just a few lines of code. Seamlessly add data classification to your applications & workflows using Nightfall's REST API.
Auth0 Logo