Best Big Data Platforms - Page 3

Compare the Top Big Data Platforms as of August 2025 - Page 3

Big Data Clear Filters
  • 1
    PolyAnalyst

    PolyAnalyst

    Megaputer Intelligence

    PolyAnalyst is a data analysis software used by large organizations across several industries (Insurance, Manufacturing, Finance, etc.). Some of its most notable features and capabilities include its use of a visual composer for complex data analysis modeling rather than coding/programming. It couples structured and poly-structured forms of data for unified analysis (ie multiple-choice questions and open-ended responses) and it can process text data in over 16+ different languages. PolyAnalyst has many features that meet comprehensive data analysis needs, such as loading data, cleansing and preparing data for analysis, deploying machine learning and supervised analysis techniques, and building reports that non-analysts can use to uncover insights.
  • 2
    MANTA

    MANTA

    Manta

    Manta is the world-class automated approach to visualize, optimize, and modernize how data moves through your organization through code-level lineage. By automatically scanning your data environment with the power of 50+ out-of-the-box scanners, Manta builds a powerful map of all data pipelines to drive efficiency and productivity. Visit manta.io to learn more. With Manta platform, you can make your data a truly enterprise-wide asset, bridge the understanding gap, enable self-service, and easily: • Increase productivity • Accelerate development • Shorten time-to-market • Reduce costs and manual effort • Run instant and accurate root cause and impact analyses • Scope and perform effective cloud migrations • Improve data governance and regulatory compliance (GDPR, CCPA, HIPAA, and more) • Increase data quality • Enhance data privacy and data security
  • 3
    Stata

    Stata

    StataCorp LLC

    Stata delivers everything you need for reproducible data analysis—powerful statistics, visualization, data manipulation, and automated reporting—all in one intuitive platform. Stata is fast and accurate. It is easy to learn through the extensive graphical interface yet completely programmable. With Stata's menus and dialogs, you get the best of both worlds. You can easily point and click or drag and drop your way to all of Stata's statistical, graphical, and data management features. Use Stata's intuitive command syntax to quickly execute commands. Whether you enter commands directly or use the menus and dialogs, you can create a log of all actions and their results to ensure the reproducibility and integrity of your analysis. Stata also has complete command-line scripting and programming facilities, including a full matrix programming language. You have access to everything you need to script your analysis or even to create new Stata commands.
    Starting Price: $48.00/6-month/student
  • 4
    Centrifuge Analytics

    Centrifuge Analytics

    Culmen Internal LLC

    Centrifuge Analytics™ is a big data discovery technology that provides the power and flexibility to connect, visualize and collaborate without complex data integration, costly services or a data science degree. It combines sophisticated link-analysis, interactive visualizations and discovery features to dramatically simplify data pattern and connection recognition. - First and foremost, a fully integrated solution that empowers analysts to work with no IT support - Sophisticated link-analysis features such as pattern Identification, intelligent bundling and various unique visual interactive features - 100% Browser footprint ensures no client-side data retention that simplifies security and client administration Patent-pending server-side rendering engine enables highly scalable network graphs Agile data integration – No need to stage, warehouse or apply a fixed ontology Model-based analytics – Setup once and reuse – build upon the experience of more seasoned analysts
    Starting Price: Call
  • 5
    Indicative

    Indicative

    Indicative

    Marketers, product managers, and business analysts use Indicative to optimize customer conversion, engagement, and retention. Indicative connects to all your customer data sources, synthesizes them into a complete view of behavior, and gives you the actionable insights you need to grow your customer base and build great products. Indicative's free plan offers up to 1 Billion user actions per month and complete access to the robust behavioral analytics platform!
    Starting Price: $0.00
  • 6
    Immuta

    Immuta

    Immuta

    Immuta is the market leader in secure Data Access, providing data teams one universal platform to control access to analytical data sets in the cloud. Only Immuta can automate access to data by discovering, securing, and monitoring data. Data-driven organizations around the world trust Immuta to speed time to data, safely share more data with more users, and mitigate the risk of data leaks and breaches. Founded in 2015, Immuta is headquartered in Boston, MA. Immuta is the fastest way for algorithm-driven enterprises to accelerate the development and control of machine learning and advanced analytics. The company's hyperscale data management platform provides data scientists with rapid, personalized data access to dramatically improve the creation, deployment and auditability of machine learning and AI.
  • 7
    Instaclustr

    Instaclustr

    Instaclustr

    Instaclustr is the Open Source-as-a-Service company, delivering reliability at scale. We operate an automated, proven, and trusted managed environment, providing database, analytics, search, and messaging. We enable companies to focus internal development and operational resources on building cutting edge customer-facing applications. Instaclustr works with cloud providers including AWS, Heroku, Azure, IBM Cloud, and Google Cloud Platform. The company has SOC 2 certification and provides 24/7 customer support.
    Starting Price: $20 per node per month
  • 8
    Keen

    Keen

    Keen.io

    Keen is the fully managed event streaming platform. Built upon trusted Apache Kafka, we make it easier than ever for you to collect massive volumes of event data with our real-time data pipeline. Use Keen’s powerful REST API and SDKs to collect event data from anything connected to the internet. Our platform allows you to store your data securely decreasing your operational and delivery risk with Keen. With storage infrastructure powered by Apache Cassandra, data is totally secure through transfer through HTTPS and TLS, then stored with multi-layer AES encryption. Once data is securely stored, utilize our Access Keys to be able to present data in arbitrary ways without having to re-architect your security or data model. Or, take advantage of Role-based Access Control (RBAC), allowing for completely customizable permission tiers, down to specific data points or queries.
    Starting Price: $149 per month
  • 9
    Hopsworks

    Hopsworks

    Logical Clocks

    Hopsworks is an open-source Enterprise platform for the development and operation of Machine Learning (ML) pipelines at scale, based around the industry’s first Feature Store for ML. You can easily progress from data exploration and model development in Python using Jupyter notebooks and conda to running production quality end-to-end ML pipelines, without having to learn how to manage a Kubernetes cluster. Hopsworks can ingest data from the datasources you use. Whether they are in the cloud, on‑premise, IoT networks, or from your Industry 4.0-solution. Deploy on‑premises on your own hardware or at your preferred cloud provider. Hopsworks will provide the same user experience in the cloud or in the most secure of air‑gapped deployments. Learn how to set up customized alerts in Hopsworks for different events that are triggered as part of the ingestion pipeline.
    Starting Price: $1 per month
  • 10
    Qrvey

    Qrvey

    Qrvey

    Qrvey is the only solution for embedded analytics with a built-in data lake. Qrvey saves engineering teams time and money with a turnkey solution connecting your data warehouse to your SaaS application. Qrvey’s full-stack solution includes the necessary components so that your engineering team can build less. Qrvey’s multi-tenant data lake includes: - Elasticsearch as the analytics engine - A unified data pipeline for ingestion and transformation - A complete semantic layer for simple user and data security integration Qrvey’s embedded visualizations support everything from: - standard dashboards and templates - self-service reporting - user-level personalization - individual dataset creation - data-driven workflow automation Qrvey delivers this as a self-hosted package for cloud environments. This offers the best security as your data never leaves your environment while offering a better analytics experience to users. Less time and money on analytics
  • 11
    ChaosSearch

    ChaosSearch

    ChaosSearch

    Log analytics should not break the bank. Because most logging solutions use one or both of these technologies - Elasticsearch database and/ or Lucene index - the cost of operation is unreasonably high. ChaosSearch takes a revolutionary approach. We reinvented indexing, which allows us to pass along substantial cost savings to our customers. See for yourself with this price comparison calculator. ChaosSearch is a fully managed SaaS platform that allows you to focus on search and analytics in AWS S3 rather than spend time managing and tuning databases. Leverage your existing AWS S3 infrastructure and let us do the rest. Watch this short video to learn how our unique approach and architecture allow ChaosSearch to address the challenges of today’s data & analytic requirements. ChaosSearch indexes your data as-is, for log, SQL and ML analytics, without transformation, while auto-detecting native schemas. ChaosSearch is an ideal replacement for the commonly deployed Elasticsearch solutions.
    Starting Price: $750 per month
  • 12
    tgndata

    tgndata

    tgndata

    With tgndata, you gain access to a comprehensive overview of your competitors' product prices and availability status, conveniently presented in your customized dashboard. tgndata is also known for its expertise in offering a diverse range of dynamic pricing rules and strategies to cater to your specific requirements. For Brands, tgndata offers a comprehensive summary of their resellers enabling them to assess their performance, particularly concerning MAP & MSRP.
    Starting Price: 299€/month
  • 13
    Powerslide

    Powerslide

    Datarocks

    Powerslide is a brand-new data storytelling and data visualization solution. This software helps business users to create usages around data, simply and efficiently. Powerslide is an intuitive and innovative solution for data analysis, visualization and presentation. Interactive and collaborative, Powerslide is the answer to your data issues in a simple, practical and design interface Simplify the analysis and communication of your data, with a simple, interactive and efficient platform. Both intuitive and design, thanks to Powerslide, you can create your KPIs and data visualization in just a few clicks to stage them through a report, a dashboard, or an infographic to make them easier to understand. Powerslide is a: - An intuitive interface designed for business - A wide choice of data visualisations - A collaborative mode - Automated updates - Several connectors: CSV, Excel, Denodo, Snowflake, Google Sheets, API Rest, Zapier, Oracle, SQL Server
    Starting Price: Gratuit
  • 14
    Rinalogy Search
    Almost any search query applied to Big Data returns a very large number of results that are often practically impossible to review. Every user has specific needs. Finding information based on a user query and general data statistics does not produce useful results. eDiscovery, healthcare, financial services, crime, consulting, academia and other fields need to be able to quickly find accurate information. Rinalogy Search is a next generation search tool that uses machine learning to interactively learn from each user to return personalized results based on user’s feedback in real time. Rinalogy Search returns relevancy scores for individual documents in the results for each query. Rinalogy Search can be deployed in clients’ IT infrastructure, close to your data and behind your firewall. Rinalogy allows users to define the level of importance of search concepts by assigning weights to them, which helps finding the results You are looking for.
    Starting Price: $50 per month
  • 15
    Dataleyk

    Dataleyk

    Dataleyk

    Dataleyk is the secure, fully-managed cloud data platform for SMBs. Our mission is to make Big Data analytics easy and accessible to all. Dataleyk is the missing link in reaching your data-driven goals. Our platform makes it quick and easy to have a stable, flexible and reliable cloud data lake with near-zero technical knowledge. Bring all of your company data from every single source, explore with SQL and visualize with your favorite BI tool or our advanced built-in graphs. Modernize your data warehousing with Dataleyk. Our state-of-the-art cloud data platform is ready to handle your scalable structured and unstructured data. Data is an asset, Dataleyk is a secure, cloud data platform that encrypts all of your data and offers on-demand data warehousing. Zero maintenance, as an objective, may not be easy to achieve. But as an initiative, it can be a driver for significant delivery improvements and transformational results.
    Starting Price: €0.1 per GB
  • 16
    Tugger

    Tugger

    Tugger

    Tugger swiftly and securely copies your data out of your business system(s) and into data analytics tools Microsoft Power BI or Tableau for first-rate business reporting. Once your data is transferred, Tugger also gets you set up with key business reports for a complete end-to-end solution, no other ETL tool offers this complete package. Tugger makes your life easier by removing the need for any manual API integrations and reduces the risk of skewed data. No technical knowledge is required and all users get access to Tugger's popular support. Data Sources that Tugger integrates with include: HubSpot, Harvest, Microsoft Teams, JIRA, GitHub and more.
    Starting Price: £75 per month
  • 17
    Azure Data Share
    Share data, in any format and any size, from multiple sources with other organizations. Easily control what you share, who receives your data, and the terms of use. Data Share provides full visibility into your data-sharing relationships with a user-friendly interface. Share data in just a few clicks, or build your own application using the REST API. Serverless code-free data-sharing service that requires no infrastructure setup or management. Intuitive interface to govern all your data-sharing relationships. Automated data-sharing processes for productivity and predictability. Secure data-sharing service that uses underlying Azure security measures. Share structured and unstructured data from multiple Azure data stores with other organizations in just a few clicks. There’s no infrastructure to set up or manage, no SAS keys are required, and sharing is all code-free. You control data access and set terms of use aligned with your enterprise policies.
    Starting Price: $0.05 per dataset-snapshot
  • 18
    Indexima Data Hub
    Reshape your perception of time in data analytics. Instantly access your business’ data in no time and work directly on your dashboard without going back and forth with the IT team. Meet Indexima DataHub, a new space-time where operational and functional users gain instant access to their data, in no time. With a combination of its unique indexing engine and machine learning, Indexima allows businesses to access all their data to simplify and speed up analytics. Robust and scalable, the solution allows organizations to query all their data directly at the source, in volumes of tens of billions of rows in just a few milliseconds. Our Indexima platform allows users to implement instant analytics on all their data in just one click. Thanks to Indexima’s new ROI and TCO calculator, find out in 30 seconds the ROI of your data platform. Infrastructure costs, project deployment time, and data engineering costs, while boosting your analytical performances.
    Starting Price: $3,290 per month
  • 19
    Hydrolix

    Hydrolix

    Hydrolix

    Hydrolix is a streaming data lake that combines decoupled storage, indexed search, and stream processing to deliver real-time query performance at terabyte-scale for a radically lower cost. CFOs love the 4x reduction in data retention costs. Product teams love 4x more data to work with. Spin up resources when you need them and scale to zero when you don’t. Fine-tune resource consumption and performance by workload to control costs. Imagine what you can build when you don’t have to sacrifice data because of budget. Ingest, enrich, and transform log data from multiple sources including Kafka, Kinesis, and HTTP. Return just the data you need, no matter how big your data is. Reduce latency and costs, eliminate timeouts, and brute force queries. Storage is decoupled from ingest and query, allowing each to independently scale to meet performance and budget targets. Hydrolix’s high-density compression (HDX) typically reduces 1TB of stored data to 55GB.
    Starting Price: $2,237 per month
  • 20
    DoubleCloud

    DoubleCloud

    DoubleCloud

    Save time & costs by streamlining data pipelines with zero-maintenance open source solutions. From ingestion to visualization, all are integrated, fully managed, and highly reliable, so your engineers will love working with data. You choose whether to use any of DoubleCloud’s managed open source services or leverage the full power of the platform, including data storage, orchestration, ELT, and real-time visualization. We provide leading open source services like ClickHouse, Kafka, and Airflow, with deployment on Amazon Web Services or Google Cloud. Our no-code ELT tool allows real-time data syncing between systems, fast, serverless, and seamlessly integrated with your existing infrastructure. With our managed open-source data visualization you can simply visualize your data in real time by building charts and dashboards. We’ve designed our platform to make the day-to-day life of engineers more convenient.
    Starting Price: $0.024 per 1 GB per month
  • 21
    WarpStream

    WarpStream

    WarpStream

    WarpStream is an Apache Kafka-compatible data streaming platform built directly on top of object storage, with no inter-AZ networking costs, no disks to manage, and infinitely scalable, all within your VPC. WarpStream is deployed as a stateless and auto-scaling agent binary in your VPC with no local disks to manage. Agents stream data directly to and from object storage with no buffering on local disks and no data tiering. Create new “virtual clusters” in our control plane instantly. Support different environments, teams, or projects without managing any dedicated infrastructure. WarpStream is protocol compatible with Apache Kafka, so you can keep using all your favorite tools and software. No need to rewrite your application or use a proprietary SDK. Just change the URL in your favorite Kafka client library and start streaming. Never again have to choose between reliability and your budget.
    Starting Price: $2,987 per month
  • 22
    5X

    5X

    5X

    5X is an all-in-one data platform that provides everything you need to centralize, clean, model, and analyze your data. Designed to simplify data management, 5X offers seamless integration with over 500 data sources, ensuring uninterrupted data movement across all your systems with pre-built and custom connectors. The platform encompasses ingestion, warehousing, modeling, orchestration, and business intelligence, all rendered in an easy-to-use interface. 5X supports various data movements, including SaaS apps, databases, ERPs, and files, automatically and securely transferring data to data warehouses and lakes. With enterprise-grade security, 5X encrypts data at the source, identifying personally identifiable information and encrypting data at a column level. The platform is designed to reduce the total cost of ownership by 30% compared to building your own platform, enhancing productivity with a single interface to build end-to-end data pipelines.
    Starting Price: $350 per month
  • 23
    Etleap

    Etleap

    Etleap

    Etleap was built from the ground up on AWS to support Redshift and snowflake data warehouses and S3/Glue data lakes. Their solution simplifies and automates ETL by offering fully-managed ETL-as-a-service. Etleap's data wrangler and modeling tools let users control how data is transformed for analysis, without writing any code. Etleap monitors and maintains data pipelines for availability and completeness, eliminating the need for constant maintenance, and centralizes data from 50+ disparate sources and silos into your data warehouse or data lake.
  • 24
    Adverity

    Adverity

    Adverity GmbH

    Adverity is the fully-integrated data platform for automating the connectivity, transformation, governance and utilization of data at scale. The platform enables businesses to blend disparate datasets such as sales, finance, marketing, and advertising, to create a single source of truth over business performance. Through automated connectivity to hundreds of data sources and destinations, unrivaled data transformation options, and powerful data governance features, Adverity is the easiest way to get your data how you want it, where you want it, and when you need it. Adverity was founded in 2015 and is headquartered in Vienna with offices in London and New York, and currently works with leading brands and agencies including Unilever, Bosch, IKEA, Forbes, GroupM, Publicis, and Dentsu.
  • 25
    AnswerRocket

    AnswerRocket

    AnswerRocket

    AnswerRocket, an American software company, has been innovating search-based data discovery analytics, via natural language since 2013. Their solution provides business the intelligence and analytics needed to run an organization that is data-driven in today's economy. Their elegant and top-notch engineered platform offers a more in-depth look at how data is analyzed and distributed throughout an organization, giving a business an unfair advantage against the competition.
  • 26
    Anodot

    Anodot

    Anodot

    Anodot applies AI to deliver autonomous analytics in real-time, across all data types, at enterprise scale. Unlike the manual limitations of traditional Business Intelligence, we provide analysts mastery over their business with a self-service AI platform that runs continuously to eliminate blind spots, alert incidents, and investigate root causes. Our platform uses patented machine learning algorithms to isolate issues and correlate them across multiple parameters. This helps eliminate business insight latency and supports smart, rapid business decision-making. Anodot has nearly 100 customers in digital transformation industries like eCommerce, FinTech, AdTech, Telco, Gaming, including Microsoft, Lyft, Waze, and King. Founded in 2014, Anodot is headquartered in Silicon Valley and Israel, with Sales offices worldwide.
  • 27
    TAMI

    TAMI

    TAMI

    TAMI provides the most complete and accurate picture of your market and gives you direct access to 130M companies and over 532M verified business contacts. Fully GDPR compliant. We specialise in European data and have a patent technology in Europe. Streamline your sales strategy and improve the efficiency of your teams' prospecting efforts: • Uncover new companies and decision makers that fit your ideal customer profile and use target specific filters to reach and convert these prospects into net new business • Enrich existing inbound records from incomplete lead capture forms • Export data directly to your CRM with custom field mapping tailored to campaign and reporting requirements • Enjoy the most qualified and accurate data in your CRM with real-time alerts that inform you when a prospect changes job role/company
  • 28
    Inventale

    Inventale

    Inventale

    Having 20+ years of programming background, Inventale specializes in the development of high-quality software engineering projects. Our expertise lies in forecasting and recommendation systems built on unstructured data, Big-Data processing and analytics, video recognition, geo-locations, and audience analysis in different spheres, including online advertising, logistics, finance, medicine, biology, HR, law, and many others. Also, we have not only developed a first-class platform for publishers and media companies, but we have successfully promoted it to the global market. In 2021, the product was acquired by BURT Intelligence to complement their platform. Inventale has: - an extensive experience in working with major global companies, market leaders and small businesses, and ambitious startups from the USA, the UK, Europe, and MENA Region; - 20+ clients worldwide; - 40+ enthusiastic professionals, ready to bring your ideas to life.
    Starting Price: $25,000
  • 29
    Alooma

    Alooma

    Google

    Alooma enables data teams to have visibility and control. It brings data from your various data silos together into BigQuery, all in real time. Set up and flow data in minutes or customize, enrich, and transform data on the stream before it even hits the data warehouse. Never lose an event. Alooma's built in safety nets ensure easy error handling without pausing your pipeline. Any number of data sources, from low to high volume, Alooma’s infrastructure scales to your needs.
  • 30
    Empolis

    Empolis

    Empolis

    Empolis Smart Cloud: We’re your partner with extensive experience in Smart Information Management. Based on Empolis Smart Cloud, we support you in developing your own service application, regardless of whether you wish to utilize the application in your own company or by licensing it to your customers. Empolis Service Express: Your company’s knowledge is distributed across different systems and lodged in your employees’ heads. That makes searching for answers difficult and time-consuming. Service Express compiles your company’s entire knowledge and establishes your central knowledge management, providing you with a single information source for all your questions.
    Starting Price: $16.50 per user per month