+
+

Related Products

  • StarTree
    25 Ratings
    Visit Website
  • MongoDB Atlas
    1,632 Ratings
    Visit Website
  • DataBuck
    6 Ratings
    Visit Website
  • AnalyticsCreator
    46 Ratings
    Visit Website
  • Device42
    173 Ratings
    Visit Website
  • groundcover
    32 Ratings
    Visit Website
  • Kasm Workspaces
    123 Ratings
    Visit Website
  • Stonebranch
    129 Ratings
    Visit Website
  • JS7 JobScheduler
    1 Rating
    Visit Website
  • Cycloid
    5 Ratings
    Visit Website

About

IBM® StreamSets enables users to create and manage smart streaming data pipelines through an intuitive graphical interface, facilitating seamless data integration across hybrid and multicloud environments. This is why leading global companies rely on IBM StreamSets to support millions of data pipelines for modern analytics, intelligent applications and hybrid integration. Decrease data staleness and enable real-time data at scale—handling millions of records of data, across thousands of pipelines within seconds. Insulate data pipelines from change and unexpected shifts with drag-and-drop, prebuilt processors designed to automatically identify and adapt to data drift. Create streaming pipelines to ingest structured, semistructured or unstructured data and deliver it to a wide range of destinations.

About

You select the size of the cluster, node capacity, and a set of services, and Yandex Data Proc automatically creates and configures Spark and Hadoop clusters and other components. Collaborate by using Zeppelin notebooks and other web apps via a UI proxy. You get full control of your cluster with root permissions for each VM. Install your own applications and libraries on running clusters without having to restart them. Yandex Data Proc uses instance groups to automatically increase or decrease computing resources of compute subclusters based on CPU usage indicators. Data Proc allows you to create managed Hive clusters, which can reduce the probability of failures and losses caused by metadata unavailability. Save time on building ETL pipelines and pipelines for training and developing models, as well as describing other iterative tasks. The Data Proc operator is already built into Apache Airflow.

Platforms Supported

Windows
Mac
Linux
Cloud
On-Premises
iPhone
iPad
Android
Chromebook

Platforms Supported

Windows
Mac
Linux
Cloud
On-Premises
iPhone
iPad
Android
Chromebook

Audience

DevOps teams

Audience

Anyone interested in a solution for processing multi-terabyte data arrays

Support

Phone Support
24/7 Live Support
Online

Support

Phone Support
24/7 Live Support
Online

API

Offers API

API

Offers API

Screenshots and Videos

Screenshots and Videos

Pricing

$1000 per month
Free Version
Free Trial

Pricing

$0.19 per hour
Free Version
Free Trial

Reviews/Ratings

Overall 0.0 / 5
ease 0.0 / 5
features 0.0 / 5
design 0.0 / 5
support 0.0 / 5

This software hasn't been reviewed yet. Be the first to provide a review:

Review this Software

Reviews/Ratings

Overall 0.0 / 5
ease 0.0 / 5
features 0.0 / 5
design 0.0 / 5
support 0.0 / 5

This software hasn't been reviewed yet. Be the first to provide a review:

Review this Software

Training

Documentation
Webinars
Live Online
In Person

Training

Documentation
Webinars
Live Online
In Person

Company Information

IBM
Founded: 1911
United States
www.ibm.com/products/streamsets

Company Information

Yandex
Founded: 1997
Russia
cloud.yandex.com/en/services/data-proc

Alternatives

Alternatives

Amazon MWAA

Amazon MWAA

Amazon
Apache Airflow

Apache Airflow

The Apache Software Foundation
Astro

Astro

Astronomer

Categories

Categories

DevOps Features

Approval Workflow
Dashboard
KPIs
Policy Management
Portfolio Management
Prioritization
Release Management
Timeline Management
Troubleshooting Reports

Streaming Analytics Features

Data Enrichment
Data Wrangling / Data Prep
Multiple Data Source Support
Process Automation
Real-time Analysis / Reporting
Visualization Dashboards

Integrations

Hadoop
Amazon Redshift
Amazon S3
Apache Cassandra
Apache HBase
Apache Hive
Apache Spark
Apache Zeppelin
Azure Data Lake Storage
Azure Industrial IoT
Couchbase
HPE Ezmeral Data Fabric
Matplotlib
MongoDB
MySQL
Redis
TensorFlow
Yandex DataSphere
pandas
scikit-image

Integrations

Hadoop
Amazon Redshift
Amazon S3
Apache Cassandra
Apache HBase
Apache Hive
Apache Spark
Apache Zeppelin
Azure Data Lake Storage
Azure Industrial IoT
Couchbase
HPE Ezmeral Data Fabric
Matplotlib
MongoDB
MySQL
Redis
TensorFlow
Yandex DataSphere
pandas
scikit-image
Claim IBM StreamSets and update features and information
Claim IBM StreamSets and update features and information
Claim Yandex Data Proc and update features and information
Claim Yandex Data Proc and update features and information