Compare the Top Big Data Software in Brazil as of November 2024 - Page 5

  • 1
    Qubole

    Qubole

    Qubole

    Qubole is a simple, open, and secure Data Lake Platform for machine learning, streaming, and ad-hoc analytics. Our platform provides end-to-end services that reduce the time and effort required to run Data pipelines, Streaming Analytics, and Machine Learning workloads on any cloud. No other platform offers the openness and data workload flexibility of Qubole while lowering cloud data lake costs by over 50 percent. Qubole delivers faster access to petabytes of secure, reliable and trusted datasets of structured and unstructured data for Analytics and Machine Learning. Users conduct ETL, analytics, and AI/ML workloads efficiently in end-to-end fashion across best-of-breed open source engines, multiple formats, libraries, and languages adapted to data volume, variety, SLAs and organizational policies.
  • 2
    Bizintel360
    AI powered self-service advanced analytics platform. Connect data sources and derive visualizations without any programming. Cloud native advanced analytics platform that provides high-quality data supply and intelligent real-time analysis across the enterprise without any code. Connect different data sources of different formats. Enables identification of root cause problems. Reduce cycle time: source to target. Analytics without programming knowledge. Real time data refresh on the go. Connect data source of any format, stream data in real time or defined frequency to data lake and visualize them in advanced interactive search engine-based dashboards. Descriptive, predictive and prescriptive analytics in a single platform with the power of search engine and advanced visualization. No traditional technology required to see data in various visualization formats. Roll up, slice and dice data with various mathematical computation right inside Bizintel360 visualization.
  • 3
    Quicksight

    Quicksight

    Amazon Web Services

    Amazon QuickSight allows everyone in your organization to understand your data by asking questions in natural language, exploring through interactive dashboards, or automatically looking for patterns and outliers powered by machine learning. QuickSight powers millions of dashboard views weekly for customers such as the NFL, Expedia, Volvo, Thomson Reuters, Best Western and Comcast, allowing their end-users to make better data-driven decisions. Ask conversational questions of your data and use Q’s ML-powered engine to receive relevant visualizations without the time-consuming data preparation from authors and admins. Discover hidden insights from your data, perform accurate forecasting and what-if analysis, or add easy-to-understand natural language narratives to dashboards by leveraging AWS' expertise in machine learning. Easily embed interactive visualizations and dashboards, sophisticated dashboard authoring, or natural language query capabilities in your applications.
  • 4
    Trendalyze

    Trendalyze

    Trendalyze

    Decisions can't wait. Compress machine learning projects from months to minutes. Like Google, our AI search engine brings you insights instantly. Inaccuracy costs money. Patterns reveal what KPIs and averages miss. TRND uncovers the patterns that provide the early warning signs missing from the KPIs. Empower the decision maker. Trends are most relevant to decision-makers who want to know whether a threat or an opportunity is bubbling up. In the digital economy knowledge is money. TRND enables creation of sharable pattern libraries that facilitate fast learning and deployment for business improvement. If you can't monitor all, you monetize none. TRND doesn't just find needles in haystacks; it constantly monitors all needles for relevant information. If you can't afford it, you can't do it. It used to be that scale broke the bank. Our search-based approach makes micro monitoring at scale affordable.
  • 5
    Arundo Enterprise
    Arundo Enterprise is a modular, flexible software suite to create data products for people. We connect live data to machine learning and other analytical models, and model outputs to business decisions. Arundo Edge Agent enables industrial connectivity and analytics in rugged, remote, or disconnected environments. Arundo Composer allows data scientists to quickly and easily deploy desktop-based analytical models into the Arundo Fabric cloud environment with a single command. Composer also enables companies to create and manage live data streams and integrate such streams with deployed data models. Arundo Fabric is the cloud-based hub for deployed machine learning models, data streams, edge agent management, and quick navigation to extended applications. Arundo offers a portfolio of high ROI SaaS products. Each of these solutions comes with a core out-of-the-box functional capability that leverages the core strengths of Arundo Enterprise.
  • 6
    Peak DSP

    Peak DSP

    Peak DSP (by Edge 226)

    Edge 226 is a global provider of data-driven tech solutions, focused on providing its clients with smart tools for quality and transparent user acquisition. Edge’s leading product is Peak DSP, a Performance-Driven DSP that enables programmatic buying for quality user acquisition and re-engagement. Peak DSP offers: • An AI-driven algorithm optimizing and predicting install & post-install events: registrations, subscriptions, purchases or any other action • Data-based targeting with Lookalike Audiences, External User Data and Audience Match • Direct integrations: ­ Owned & operated and direct apps ­ Mobile device manufacturers & carrier-based supply ­ Over 35 of the world’s top SSPs • All verticals and environments: Gaming, shopping, utilities, sports (etc.) campaigns across in-app, mobile web and desktop • Multiple creative types: ­ Rewarded video ­ Playable ads ­ Banners, native ads & text ads ­ HTML/Rich Media ­ JavaScript tags
  • 7
    Sigma

    Sigma

    Sigma Computing

    Sigma is a modern business intelligence (BI) and analytics application built for the cloud. Trusted by data-first companies, Sigma provides live access to cloud data warehouses using an intuitive spreadsheet interface empowering business experts to ask more of their data without writing a single line of code. With the full power of SQL, the cloud, and a familiar interface, business users have the freedom to analyze data in real time without limits. Sigma is self-service analytics as it was meant to be.
  • 8
    Atlan

    Atlan

    Atlan

    The modern data workspace. Make all your data assets from data tables to BI reports, instantly discoverable. Our powerful search algorithms combined with easy browsing experience, make finding the right asset, a breeze. Atlan auto-generates data quality profiles which make detecting bad data, dead easy. From automatic variable type detection & frequency distribution to missing values and outlier detection, we’ve got you covered. Atlan takes the pain away from governing and managing your data ecosystem! Atlan’s bots parse through SQL query history to auto construct data lineage and auto-detect PII data, allowing you to create dynamic access policies & best in class governance. Even non-technical users can directly query across multiple data lakes, warehouses & DBs using our excel-like query builder. Native integrations with tools like Tableau and Jupyter makes data collaboration come alive.
  • 9
    Gravwell

    Gravwell

    Gravwell

    Gravwell is an all-you-can-ingest data fusion analytics platform that enables complete context and root cause analytics for security and business data. Gravwell was founded to bring the benefits of usable machine data to all customers: large or small, text or binary, security or operational. When experienced hackers and big data experts team-up you get an analytics platform capable of things never seen before. Gravwell enables security analytics that go well beyond log data into industrial processes, vehicle fleets, IT infrastructure, or everything combined. Need to hunt down a suspected access breach? Gravwell can correlate building access logs and run facial recognition machine learning against camera data to isolate multiple subjects entering a facility with a single badge-in. We exist to provide analytics capabilities to people who need more than just text log searching and need it sooner rather than later at a price they can afford.
  • 10
    Intelligent Artifacts

    Intelligent Artifacts

    Intelligent Artifacts

    A new category of AI. Most current AI solutions are engineered through a statistical and purely mathematical lens. We took a different approach. With discoveries in information theory, the team at Intelligent Artifacts has built a new category of AI: a true AGI that eliminates current machine intelligence shortcomings. Our framework keeps the data and application layers separate from the intelligence layer allowing it to learn in real-time, and enabling it to explain predictions down to root cause. A true AGI demands a truly integrated platform. With Intelligent Artifacts, you'll model information, not data — predictions and decisions are real-time and transparent, and can be deployed across various domains without the need to rewrite code. And by combining specialized AI consultants with our dynamic platform, you'll get a customized solution that rapidly offers deep insights and greater outcomes from your data.
  • 11
    HEAVY.AI

    HEAVY.AI

    HEAVY.AI

    HEAVY.AI is the pioneer in accelerated analytics. The HEAVY.AI platform is used in business and government to find insights in data beyond the limits of mainstream analytics tools. Harnessing the massive parallelism of modern CPU and GPU hardware, the platform is available in the cloud and on-premise. HEAVY.AI originated from research at Harvard and MIT Computer Science and Artificial Intelligence Laboratory (CSAIL). Expand beyond the limitations of traditional BI and GIS by leveraging the full power of modern GPU and CPU hardware so you can extract decision-quality information from your massive datasets without lag. Unify and explore your largest geospatial and time-series datasets to get the complete picture of the what, when, and where. Combine interactive visual analytics, hardware-accelerated SQL, and an advanced analytics & data science framework to find opportunity and risk hidden in your enterprise when you need to most.
  • 12
    Incorta

    Incorta

    Incorta

    Direct is the shortest path from data to insight. Incorta empowers everyone in your business with a true self-service data experience and breakthrough performance for better decisions and incredible results. What if you could bypass fragile ETL and expensive data warehouses, and deliver data projects in days, instead of weeks or months? Our direct approach to analytics delivers true self-service in the cloud or on-premises with agility and performance. Incorta is used by the world’s largest brands to succeed where other analytics solutions fail. Across multiple industries and lines of business, we boast connectors and pre-built solutions for your enterprise applications and technologies. Game-changing innovation and customer success happen through Incorta’s partners including Microsoft, AWS, eCapital, and Wipro. Explore or join our thriving partner ecosystem.
  • 13
    TIBCO Clarity
    TIBCO Clarity is a data preparation tool that offers you on-demand software services from the web in the form of Software-as-a-Service. You can use TIBCO Clarity to discover, profile, cleanse, and standardize raw data collated from disparate sources and provide good quality data for accurate analysis and intelligent decision-making. You can collect your raw data from disparate sources in variety of data formats. The supported data sources are disk drives, databases, tables, and spreadsheets, both cloud and on-premise. TIBCO Clarity detects data patterns and data types for auto-metadata generation. You can profile row and column data for completeness, uniqueness, and variation. Predefined facets categorize data based on text occurrences and text patterns. You can use the numeric distributions to identify variations and outliers in the data.
  • 14
    EC2 Spot

    EC2 Spot

    Amazon

    Amazon EC2 Spot Instances let you take advantage of unused EC2 capacity in the AWS cloud. Spot Instances are available at up to a 90% discount compared to On-Demand prices. You can use Spot Instances for various stateless, fault-tolerant, or flexible applications such as big data, containerized workloads, CI/CD, web servers, high-performance computing (HPC), and test & development workloads. Because Spot Instances are tightly integrated with AWS services such as Auto Scaling, EMR, ECS, CloudFormation, Data Pipeline and AWS Batch, you can choose how to launch and maintain your applications running on Spot Instances. Moreover, you can easily combine Spot Instances with On-Demand, RIs and Savings Plans Instances to further optimize workload cost with performance. Due to the operating scale of AWS, Spot Instances can offer the scale and cost savings to run hyper-scale workloads.
    Starting Price: $0.01 per user, one-time payment,
  • 15
    Starburst Enterprise

    Starburst Enterprise

    Starburst Data

    Starburst helps you make better decisions with fast access to all your data; Without the complexity of data movement and copies. Your company has more data than ever before, but your data teams are stuck waiting to analyze it. Starburst unlocks access to data where it lives, no data movement required, giving your teams fast & accurate access to more data for analysis. Starburst Enterprise is a fully supported, production-tested and enterprise-grade distribution of open source Trino (formerly Presto® SQL). It improves performance and security while making it easy to deploy, connect, and manage your Trino environment. Through connecting to any source of data – whether it’s located on-premise, in the cloud, or across a hybrid cloud environment – Starburst lets your team use the analytics tools they already know & love while accessing data that lives anywhere.
  • 16
    E-MapReduce
    EMR is an all-in-one enterprise-ready big data platform that provides cluster, job, and data management services based on open-source ecosystems, such as Hadoop, Spark, Kafka, Flink, and Storm. Alibaba Cloud Elastic MapReduce (EMR) is a big data processing solution that runs on the Alibaba Cloud platform. EMR is built on Alibaba Cloud ECS instances and is based on open-source Apache Hadoop and Apache Spark. EMR allows you to use the Hadoop and Spark ecosystem components, such as Apache Hive, Apache Kafka, Flink, Druid, and TensorFlow, to analyze and process data. You can use EMR to process data stored on different Alibaba Cloud data storage service, such as Object Storage Service (OSS), Log Service (SLS), and Relational Database Service (RDS). You can quickly create clusters without the need to configure hardware and software. All maintenance operations are completed on its Web interface.
  • 17
    TiMi

    TiMi

    TIMi

    With TIMi, companies can capitalize on their corporate data to develop new ideas and make critical business decisions faster and easier than ever before. The heart of TIMi’s Integrated Platform. TIMi’s ultimate real-time AUTO-ML engine. 3D VR segmentation and visualization. Unlimited self service business Intelligence. TIMi is several orders of magnitude faster than any other solution to do the 2 most important analytical tasks: the handling of datasets (data cleaning, feature engineering, creation of KPIs) and predictive modeling. TIMi is an “ethical solution”: no “lock-in” situation, just excellence. We guarantee you a work in all serenity and without unexpected extra costs. Thanks to an original & unique software infrastructure, TIMi is optimized to offer you the greatest flexibility for the exploration phase and the highest reliability during the production phase. TIMi is the ultimate “playground” that allows your analysts to test the craziest ideas!
  • 18
    Apache Druid
    Apache Druid is an open source distributed data store. Druid’s core design combines ideas from data warehouses, timeseries databases, and search systems to create a high performance real-time analytics database for a broad range of use cases. Druid merges key characteristics of each of the 3 systems into its ingestion layer, storage format, querying layer, and core architecture. Druid stores and compresses each column individually, and only needs to read the ones needed for a particular query, which supports fast scans, rankings, and groupBys. Druid creates inverted indexes for string values for fast search and filter. Out-of-the-box connectors for Apache Kafka, HDFS, AWS S3, stream processors, and more. Druid intelligently partitions data based on time and time-based queries are significantly faster than traditional databases. Scale up or down by just adding or removing servers, and Druid automatically rebalances. Fault-tolerant architecture routes around server failures.
  • 19
    Privacera

    Privacera

    Privacera

    At the intersection of data governance, privacy, and security, Privacera’s unified data access governance platform maximizes the value of data by providing secure data access control and governance across hybrid- and multi-cloud environments. The hybrid platform centralizes access and natively enforces policies across multiple cloud services—AWS, Azure, Google Cloud, Databricks, Snowflake, Starburst and more—to democratize trusted data enterprise-wide without compromising compliance with regulations such as GDPR, CCPA, LGPD, or HIPAA. Trusted by Fortune 500 customers across finance, insurance, retail, healthcare, media, public and the federal sector, Privacera is the industry’s leading data access governance platform that delivers unmatched scalability, elasticity, and performance. Headquartered in Fremont, California, Privacera was founded in 2016 to manage cloud data privacy and security by the creators of Apache Ranger™ and Apache Atlas™.
  • 20
    TIBCO Data Science

    TIBCO Data Science

    TIBCO Software

    Democratize, collaborate, and operationalize, machine learning across your organization. Data science is a team sport. Data scientists, citizen data scientists, data engineers, business users, and developers need flexible and extensible tools that promote collaboration, automation, and reuse of analytic workflows. But algorithms are only one piece of the advanced analytic puzzle. To deliver predictive insights, companies need to increase focus on the deployment, management, and monitoring of analytic models. Smart businesses rely on platforms that support the end-to-end analytics lifecycle while providing enterprise security and governance. TIBCO® Data Science software helps organizations innovate and solve complex problems faster to ensure predictive findings quickly turn into optimal outcomes. TIBCO Data Science allows organizations to expand data science deployments across the organization by providing flexible authoring and deployment capabilities.
  • 21
    Oracle Big Data Preparation
    Oracle Big Data Preparation Cloud Service is a managed Platform as a Service (PaaS) cloud-based offering that enables you to rapidly ingest, repair, enrich, and publish large data sets with end-to-end visibility in an interactive environment. You can integrate your data with other Oracle Cloud Services, such as Oracle Business Intelligence Cloud Service, for down-stream analysis. Profile metrics and visualizations are important features of Oracle Big Data Preparation Cloud Service. When a data set is ingested, you have visual access to the profile results and summary of each column that was profiled, and the results of duplicate entity analysis completed on your entire data set. Visualize governance tasks on the service Home page with easily understood runtime metrics, data health reports, and alerts. Keep track of your transforms and ensure that files are processed correctly. See the entire data pipeline, from ingestion to enrichment and publishing.
  • 22
    Oracle Big Data Service
    Oracle Big Data Service makes it easy for customers to deploy Hadoop clusters of all sizes, with VM shapes ranging from 1 OCPU to a dedicated bare metal environment. Customers choose between high-performance NVmE storage or cost-effective block storage, and can grow or shrink their clusters. Quickly create Hadoop-based data lakes to extend or complement customer data warehouses, and ensure that all data is both accessible and managed cost-effectively. Query, visualize and transform data so data scientists can build machine learning models using the included notebook with its R, Python and SQL support. Move customer-managed Hadoop clusters to a fully-managed cloud-based service, reducing management costs and improving resource utilization.
    Starting Price: $0.1344 per hour
  • 23
    Obviously AI

    Obviously AI

    Obviously AI

    The entire process of building machine learning algorithms and predicting outcomes, packed in one single click. Not all data is built to be ready for ML, use the Data Dialog to seamlessly shape your dataset without wrangling your files. Share your prediction reports with your team or make them public. Allow anyone to start making predictions on your model. Bring dynamic ML predictions into your own app using our low-code API. Predict willingness to pay, score leads and much more in real-time. Obviously AI puts the world’s most cutting-edge algorithms in your hands, without compromising on performance. Forecast revenue, optimize supply chain, personalize marketing. You can now know what happens next. Add a CSV file OR integrate with your favorite data sources in minutes. Pick your prediction column from a dropdown, we'll auto build the AI. Beautifully visualize predicted results, top drivers and simulate "what-if" scenarios.
    Starting Price: $75 per month
  • 24
    IBM Cloud Pak for Data
    The biggest challenge to scaling AI-powered decision-making is unused data. IBM Cloud Pak® for Data is a unified platform that delivers a data fabric to connect and access siloed data on-premises or across multiple clouds without moving it. Simplify access to data by automatically discovering and curating it to deliver actionable knowledge assets to your users, while automating policy enforcement to safeguard use. Further accelerate insights with an integrated modern cloud data warehouse. Universally safeguard data usage with privacy and usage policy enforcement across all data. Use a modern, high-performance cloud data warehouse to achieve faster insights. Empower data scientists, developers and analysts with an integrated experience to build, deploy and manage trustworthy AI models on any cloud. Supercharge analytics with Netezza, a high-performance data warehouse.
    Starting Price: $699 per month
  • 25
    Bodo.ai

    Bodo.ai

    Bodo.ai

    Bodo’s powerful compute engine and parallel computing approach provides efficient execution and effective scalability even for 10,000+ cores and PBs of data. Bodo enables faster development and easier maintenance for data science, data engineering and ML workloads with standard Python APIs like Pandas. Avoid frequent failures with bare-metal native code execution and catch errors before they appear in production with end-to-end compilation. Experiment faster with large datasets on your laptop with the simplicity that only Python can provide. Write production-ready code without the hassle of refactoring for scaling on large infrastructure!
  • 26
    Oracle Cloud Infrastructure Data Flow
    Oracle Cloud Infrastructure (OCI) Data Flow is a fully managed Apache Spark service to perform processing tasks on extremely large data sets without infrastructure to deploy or manage. This enables rapid application delivery because developers can focus on app development, not infrastructure management. OCI Data Flow handles infrastructure provisioning, network setup, and teardown when Spark jobs are complete. Storage and security are also managed, which means less work is required for creating and managing Spark applications for big data analysis. With OCI Data Flow, there are no clusters to install, patch, or upgrade, which saves time and operational costs for projects. OCI Data Flow runs each Spark job in private dedicated resources, eliminating the need for upfront capacity planning. With OCI Data Flow, IT only needs to pay for the infrastructure resources that Spark jobs use while they are running.
    Starting Price: $0.0085 per GB per hour
  • 27
    Azure Data Lake Analytics
    Easily develop and run massively parallel data transformation and processing programs in U-SQL, R, Python, and .NET over petabytes of data. With no infrastructure to manage, you can process data on demand, scale instantly, and only pay per job. Process big data jobs in seconds with Azure Data Lake Analytics. There is no infrastructure to worry about because there are no servers, virtual machines, or clusters to wait for, manage, or tune. Instantly scale the processing power, measured in Azure Data Lake Analytics Units (AU), from one to thousands for each job. You only pay for the processing that you use per job. Act on all of your data with optimized data virtualization of your relational sources such as Azure SQL Database and Azure Synapse Analytics. Your queries are automatically optimized by moving processing close to the source data without data movement, which maximizes performance and minimizes latency.
    Starting Price: $2 per hour
  • 28
    IBM Db2 Big SQL
    A hybrid SQL-on-Hadoop engine delivering advanced, security-rich data query across enterprise big data sources, including Hadoop, object storage and data warehouses. IBM Db2 Big SQL is an enterprise-grade, hybrid ANSI-compliant SQL-on-Hadoop engine, delivering massively parallel processing (MPP) and advanced data query. Db2 Big SQL offers a single database connection or query for disparate sources such as Hadoop HDFS and WebHDFS, RDMS, NoSQL databases, and object stores. Benefit from low latency, high performance, data security, SQL compatibility, and federation capabilities to do ad hoc and complex queries. Db2 Big SQL is now available in 2 variations. It can be integrated with Cloudera Data Platform, or accessed as a cloud-native service on the IBM Cloud Pak® for Data platform. Access and analyze data and perform queries on batch and real-time data across sources, like Hadoop, object stores and data warehouses.
  • 29
    MOSTLY AI

    MOSTLY AI

    MOSTLY AI

    As physical customer interactions shift into digital, we can no longer rely on real-life conversations. Customers express their intents, share their needs through data. Understanding customers and testing our assumptions about them also happens through data. And privacy regulations such as GDPR and CCPA make a deep understanding even harder. The MOSTLY AI synthetic data platform bridges this ever-growing gap in customer understanding. A reliable, high-quality synthetic data generator can serve businesses in various use cases. Providing privacy-safe data alternatives is just the beginning of the story. In terms of versatility, MOSTLY AI's synthetic data platform goes further than any other synthetic data generator. MOSTLY AI's versatility and use case flexibility make it a must-have AI tool and a game-changing solution for software development and testing. From AI training to explainability, bias mitigation and governance to realistic test data with subsetting, referential integrity.
  • 30
    DataWorks

    DataWorks

    Alibaba Cloud

    DataWorks is a Big Data platform product launched by Alibaba Cloud. It provides one-stop Big Data development, data permission management, offline job scheduling, and other features. DataWorks works straight ‘out-the-box’ without the need to worry about complex underlying cluster establishment and operations & management. You can drag and drop nodes to create a workflow. You can also edit and debug your code online, and ask other developers to join you. Supports data integration, MaxCompute SQL, MaxCompute MR, machine learning, and shell tasks. Supports task monitoring and sends alarms when errors occur to avoid service interruptions. Runs millions of tasks concurrently and supports hourly, daily, weekly, and monthly schedules. DataWorks is the best platform for building big data warehouses and provides comprehensive data warehousing services. DataWorks provides a full solution for data aggregation, data processing, data governance, and data services.