Apache Gobblin

Apache Gobblin

Apache Software Foundation
MLlib

MLlib

Apache Software Foundation
+
+

Related Products

  • Google Cloud Platform
    60,933 Ratings
    Visit Website
  • RaimaDB
    12 Ratings
    Visit Website
  • MongoDB Atlas
    1,652 Ratings
    Visit Website
  • Microsoft Power BI
    8 Ratings
    Visit Website
  • Teradata VantageCloud
    1,107 Ratings
    Visit Website
  • OpenMetal
    39 Ratings
    Visit Website
  • PeerGFS
    28 Ratings
    Visit Website
  • Epicor Kinetic
    512 Ratings
    Visit Website
  • DataBuck
    6 Ratings
    Visit Website
  • Monitask
    368 Ratings
    Visit Website

About

A distributed data integration framework that simplifies common aspects of Big Data integration such as data ingestion, replication, organization, and lifecycle management for both streaming and batch data ecosystems. Runs as a standalone application on a single box. Also supports embedded mode. Runs as an mapreduce application on multiple Hadoop versions. Also supports Azkaban for launching mapreduce jobs. Runs as a standalone cluster with primary and worker nodes. This mode supports high availability and can run on bare metals as well. Runs as an elastic cluster on public cloud. This mode supports high availability. Gobblin as it exists today is a framework that can be used to build different data integration applications like ingest, replication, etc. Each of these applications is typically configured as a separate job and executed through a scheduler like Azkaban.

About

​Apache Spark's MLlib is a scalable machine learning library that integrates seamlessly with Spark's APIs, supporting Java, Scala, Python, and R. It offers a comprehensive suite of algorithms and utilities, including classification, regression, clustering, collaborative filtering, and tools for constructing machine learning pipelines. MLlib's high-quality algorithms leverage Spark's iterative computation capabilities, delivering performance up to 100 times faster than traditional MapReduce implementations. It is designed to operate across diverse environments, running on Hadoop, Apache Mesos, Kubernetes, standalone clusters, or in the cloud, and accessing various data sources such as HDFS, HBase, and local files. This flexibility makes MLlib a robust solution for scalable and efficient machine learning tasks within the Apache Spark ecosystem. ​

Platforms Supported

Windows
Mac
Linux
Cloud
On-Premises
iPhone
iPad
Android
Chromebook

Platforms Supported

Windows
Mac
Linux
Cloud
On-Premises
iPhone
iPad
Android
Chromebook

Audience

Anyone seeking a solution to simplify data integration for their streaming and batch data ecosystems

Audience

Data scientists and engineers wanting a machine learning solution for efficient data processing and analysis within the Apache Spark framework

Support

Phone Support
24/7 Live Support
Online

Support

Phone Support
24/7 Live Support
Online

API

Offers API

API

Offers API

Screenshots and Videos

Screenshots and Videos

Pricing

No information available.
Free Version
Free Trial

Pricing

No information available.
Free Version
Free Trial

Reviews/Ratings

Overall 0.0 / 5
ease 0.0 / 5
features 0.0 / 5
design 0.0 / 5
support 0.0 / 5

This software hasn't been reviewed yet. Be the first to provide a review:

Review this Software

Reviews/Ratings

Overall 0.0 / 5
ease 0.0 / 5
features 0.0 / 5
design 0.0 / 5
support 0.0 / 5

This software hasn't been reviewed yet. Be the first to provide a review:

Review this Software

Training

Documentation
Webinars
Live Online
In Person

Training

Documentation
Webinars
Live Online
In Person

Company Information

Apache Software Foundation
United States
gobblin.apache.org

Company Information

Apache Software Foundation
Founded: 1995
United States
spark.apache.org/mllib/

Alternatives

E-MapReduce

E-MapReduce

Alibaba

Alternatives

Apache Spark

Apache Spark

Apache Software Foundation
Apache Spark

Apache Spark

Apache Software Foundation
MLlib

MLlib

Apache Software Foundation
Apache Mahout

Apache Mahout

Apache Software Foundation
Amazon EMR

Amazon EMR

Amazon

Categories

Categories

Integrations

Hadoop
Amazon EC2
Apache Cassandra
Apache HBase
Apache Hive
Apache Mesos
Apache Spark
Java
Kubernetes
MapReduce
Python
R
Scala

Integrations

Hadoop
Amazon EC2
Apache Cassandra
Apache HBase
Apache Hive
Apache Mesos
Apache Spark
Java
Kubernetes
MapReduce
Python
R
Scala
Claim Apache Gobblin and update features and information
Claim Apache Gobblin and update features and information
Claim MLlib and update features and information
Claim MLlib and update features and information