17 Integrations with Hortonworks Data Platform

View a list of Hortonworks Data Platform integrations and software that integrates with Hortonworks Data Platform below. Compare the best Hortonworks Data Platform integrations as well as features, ratings, user reviews, and pricing of software that integrates with Hortonworks Data Platform. Here are the current Hortonworks Data Platform integrations in 2024:

  • 1
    TensorFlow

    TensorFlow

    TensorFlow

    An end-to-end open source machine learning platform. TensorFlow is an end-to-end open source platform for machine learning. It has a comprehensive, flexible ecosystem of tools, libraries and community resources that lets researchers push the state-of-the-art in ML and developers easily build and deploy ML powered applications. Build and train ML models easily using intuitive high-level APIs like Keras with eager execution, which makes for immediate model iteration and easy debugging. Easily train and deploy models in the cloud, on-prem, in the browser, or on-device no matter what language you use. A simple and flexible architecture to take new ideas from concept to code, to state-of-the-art models, and to publication faster. Build, deploy, and experiment easily with TensorFlow.
    Starting Price: Free
  • 2
    Docker

    Docker

    Docker

    Docker takes away repetitive, mundane configuration tasks and is used throughout the development lifecycle for fast, easy and portable application development, desktop and cloud. Docker’s comprehensive end-to-end platform includes UIs, CLIs, APIs and security that are engineered to work together across the entire application delivery lifecycle. Get a head start on your coding by leveraging Docker images to efficiently develop your own unique applications on Windows and Mac. Create your multi-container application using Docker Compose. Integrate with your favorite tools throughout your development pipeline, Docker works with all development tools you use including VS Code, CircleCI and GitHub. Package applications as portable container images to run in any environment consistently from on-premises Kubernetes to AWS ECS, Azure ACI, Google GKE and more. Leverage Docker Trusted Content, including Docker Official Images and images from Docker Verified Publishers.
    Starting Price: $7 per month
  • 3
    Amazon S3

    Amazon S3

    Amazon

    Amazon Simple Storage Service (Amazon S3) is an object storage service that offers industry-leading scalability, data availability, security, and performance. This means customers of all sizes and industries can use it to store and protect any amount of data for a range of use cases, such as data lakes, websites, mobile applications, backup and restore, archive, enterprise applications, IoT devices, and big data analytics. Amazon S3 provides easy-to-use management features so you can organize your data and configure finely-tuned access controls to meet your specific business, organizational, and compliance requirements. Amazon S3 is designed for 99.999999999% (11 9's) of durability, and stores data for millions of applications for companies all around the world. Scale your storage resources up and down to meet fluctuating demands, without upfront investments or resource procurement cycles. Amazon S3 is designed for 99.999999999% (11 9’s) of data durability.
  • 4
    Google Cloud Storage
    Object storage for companies of all sizes. Store any amount of data. Retrieve it as often as you’d like. Configure your data with Object Lifecycle Management (OLM) to automatically transition to lower-cost storage classes when it meets the criteria you specify, such as when it reaches a certain age or when you’ve stored a newer version of the data. Cloud Storage has an ever-growing list of storage bucket locations where you can store your data with multiple automatic redundancy options. Whether you are optimizing for split-second response time, or creating a robust disaster recovery plan, customize where and how you store your data. Storage Transfer Service and Transfer Service for on-premises data offer two highly performant, online pathways to Cloud Storage—both with the scalability and speed you need to simplify the data transfer process. For offline data transfer our Transfer Appliance is a shippable storage server.
  • 5
    Apache Hive

    Apache Hive

    Apache Software Foundation

    The Apache Hive data warehouse software facilitates reading, writing, and managing large datasets residing in distributed storage using SQL. Structure can be projected onto data already in storage. A command line tool and JDBC driver are provided to connect users to Hive. Apache Hive is an open source project run by volunteers at the Apache Software Foundation. Previously it was a subproject of Apache® Hadoop®, but has now graduated to become a top-level project of its own. We encourage you to learn about the project and contribute your expertise. Traditional SQL queries must be implemented in the MapReduce Java API to execute SQL applications and queries over distributed data. Hive provides the necessary SQL abstraction to integrate SQL-like queries (HiveQL) into the underlying Java without the need to implement queries in the low-level Java API.
  • 6
    Protegrity

    Protegrity

    Protegrity

    Our platform allows businesses to use data—including its application in advanced analytics, machine learning, and AI—to do great things without worrying about putting customers, employees, or intellectual property at risk. The Protegrity Data Protection Platform doesn't just secure data—it simultaneously classifies and discovers data while protecting it. You can't protect what you don't know you have. Our platform first classifies data, allowing users to categorize the type of data that can mostly be in the public domain. With those classifications established, the platform then leverages machine learning algorithms to discover that type of data. Classification and discovery finds the data that needs to be protected. Whether encrypting, tokenizing, or applying privacy methods, the platform secures the data behind the many operational systems that drive the day-to-day functions of business, as well as the analytical systems behind decision-making.
  • 7
    Querona

    Querona

    YouNeedIT

    We make BI & Big Data analytics work easier and faster. Our goal is to empower business users and make always-busy business and heavily loaded BI specialists less dependent on each other when solving data-driven business problems. If you have ever experienced a lack of data you needed, time to consuming report generation or long queue to your BI expert, consider Querona. Querona uses a built-in Big Data engine to handle growing data volumes. Repeatable queries can be cached or calculated in advance. Optimization needs less effort as Querona automatically suggests query improvements. Querona empowers business analysts and data scientists by putting self-service in their hands. They can easily discover and prototype data models, add new data sources, experiment with query optimization and dig in raw data. Less IT is needed. Now users can get live data no matter where it is stored. If databases are too busy to be queried live, Querona will cache the data.
  • 8
    Progress DataDirect
    Empowering applications with enterprise data is our passion here at Progress DataDirect. We offer cloud and on-premises data connectivity solutions across relational, NoSQL, Big Data, and SaaS data sources. Performance, reliability, and security are at the heart of everything we design for thousands of enterprises and the leading vendors in analytics, BI, and data management. Minimize your development costs with our portfolio of high-value connectors for a variety of data sources. Enjoy 24/7 world-class support and security for greater peace of mind. Connect with affordable, easy-to-use, and time-saving drivers for faster SQL access to your data. As a leader in data connectivity, keeping up with the evolving trends in space is our mission. But if we haven’t built the connector you need yet, reach out and we’ll help you develop the right solution. Embed connectivity in an application or service.
  • 9
    jethro

    jethro

    jethro

    Data-driven decision-making has unleashed a surge of business data and a rise in user demand to analyze it. This trend drives IT departments to migrate off expensive Enterprise Data Warehouses (EDW) toward cost-effective Big Data platforms like Hadoop or AWS. These new platforms come with a Total Cost of Ownership (TCO) that is about 10 times lower. They are not ideal for interactive BI applications, however, as they fail to match the high performance and user concurrency of legacy EDWs. For this exact reason, we developed Jethro. Customers use Jethro for interactive BI on Big Data. Jethro is a transparent middle tier that requires no changes to existing apps or data. It is self-driving with no maintenance required. Jethro is compatible with BI tools like Tableau, Qlik, and Microstrategy and is data source agnostic. Jethro delivers on the demands of business users allowing for thousands of concurrent users to run complicated queries over billions of records.
  • 10
    Hadoop

    Hadoop

    Apache Software Foundation

    The Apache Hadoop software library is a framework that allows for the distributed processing of large data sets across clusters of computers using simple programming models. It is designed to scale up from single servers to thousands of machines, each offering local computation and storage. Rather than rely on hardware to deliver high-availability, the library itself is designed to detect and handle failures at the application layer, so delivering a highly-available service on top of a cluster of computers, each of which may be prone to failures. A wide variety of companies and organizations use Hadoop for both research and production. Users are encouraged to add themselves to the Hadoop PoweredBy wiki page. Apache Hadoop 3.3.4 incorporates a number of significant enhancements over the previous major release line (hadoop-3.2).
  • 11
    IBM Netezza Performance Server
    100% compatible with Netezza. Single command-line upgrade path. Available on premises, on cloud or hybrid. IBM® Netezza® Performance Server for IBM Cloud Pak® for Data is an advanced data warehouse and analytics platform available both on premises and on cloud. With enhancements to in-database analytics capabilities, this next generation of Netezza enables you to do data science and machine learning with data volumes scaling into the petabytes. Failure detection and fast failure recovery. Single command-line upgrade to existing systems. Ability to query many systems as one. Choose the data center or availability zone closest to you, set the number of compute units and amount of storage required to run, and go. IBM® Netezza® Performance Server for IBM Cloud Pak® for Data is available on IBM Cloud®, Amazon Web Services (AWS) and Microsoft Azure. Deployable on a private cloud, Netezza is powered by IBM Cloud Pak for Data System.
  • 12
    Value Innovation Labs Marketing Automation Platform
    Track your user behavior with power analytics. Segment users based on their behavior. Create engagement strategies with powerful AI. OS/Device level restrictions by certain handset makers restrict push notification delivery. With our product, you can bypass those restrictions to reach and engage an additional 20% of users. We ensure higher inbox reach with email consultants and industry experts to help you with the best practices. Avoid sending blast messages that end up in spam, or taint your domain and brand reputation. Localize the communication based on language, seamlessly. Our platform supports multilingual architecture and you can reach out to your customers in the local language for a local touch. Target users with acquisition source, uninstall data and more. Segment users just the way you want. Initiate conversation, reduce churn and do much more with powerful insights.
  • 13
    Value Innovation Labs Enterprise HRMS
    Assign, track, execute tasks, track productivity with powerful insight. Automate over 100+ tasks related and amplify human interactions with bots, group chat and more. Actionable insights that help Line Managers, HR Professionals & CXO achieve more. Define organizational structure, assign roles & permissions, grant access rights. Manage your employee life cycle from onboarding to exit, publish letters. Run error-free payroll, manage loans & reimbursements, meet statutory norms. Real-time attendance for managing attendance, holiday calendar, shifts and integration. Meet organizational goals & improve performance with 360-degree feedback. Boost employee morale & improve employee engagement using engagement tools. Real-time attendance for managing attendance, holiday calendar, shifts and integration. Meet organizational goals & improve performance with 360-degree feedback. Boost employee morale & improve employee engagement using engagement tools.
  • 14
    doolytic

    doolytic

    doolytic

    doolytic is leading the way in big data discovery, the convergence of data discovery, advanced analytics, and big data. doolytic is rallying expert BI users to the revolution in self-service exploration of big data, revealing the data scientist in all of us. doolytic is an enterprise software solution for native discovery on big data. doolytic is based on best-of-breed, scalable, open-source technologies. Lightening performance on billions of records and petabytes of data. Structured, unstructured and real-time data from any source. Sophisticated advanced query capabilities for expert users, Integration with R for advanced and predictive applications. Search, analyze, and visualize data from any format, any source in real-time with the flexibility of Elastic. Leverage the power of Hadoop data lakes with no latency and concurrency issues. doolytic solves common BI problems and enables big data discovery without clumsy and inefficient workarounds.
  • 15
    Amadea

    Amadea

    ISoft

    Amadea technology relies on the fastest real-time calculation and modeling engine on the market. Speed up the creation, deployment and automation of your analytics projects within the same integrated environment. Data quality is the key to analytical projects. Thanks to the ISoft real-time calculation engine, the fastest on the market, Amadea allows companies to prepare and use massive and/or complex data in real-time, regardless of the volume. ISoft started from a simple observation, successful analytical projects must involve the business users at every stage. Founded on a no-code interface, accessible to all types of users, Amadea allows everyone involved in analytical projects to take part. As Amadea has the fastest real-time calculation engine on the market, it lets you specify, prototype and build your data applications simultaneously. Amadea incorporates the fastest real-time data analysis engine on the market, 10 million lines per second & per core for standard calculations.
  • 16
    R Systems

    R Systems

    R Systems

    R Systems’ aim is to enable organizations to not only identify hindrances in the customer journey but also mitigate them to improve loyalty & long-term profitability by harnessing the power of AI, data analytics, Natural Language Processing (NLP) & Deep Neural Networks (DNN). Understanding the essence of customer experience helps you augment the way people respond to your brand in terms of engagement, loyalty, and retention. Organizations need accurate data and metrics to discern, gather, and analyze customer data for gaining actionable insights and making educated decisions to retain and acquire customers. Our data analytics framework will help you improve First Contact Resolution (FCR), reduce customer effort, optimize self-service options, and handle seasonal spikes in demand. With R Systems on your side, your customer experience will get better with every single interaction. Our services enable you to gather data from customer interactions to gain insights into behavior.
  • 17
    Apache Hadoop YARN

    Apache Hadoop YARN

    Apache Software Foundation

    The fundamental idea of YARN is to split up the functionalities of resource management and job scheduling/monitoring into separate daemons. The idea is to have a global ResourceManager (RM) and per-application ApplicationMaster (AM). An application is either a single job or a DAG of jobs. The ResourceManager and the NodeManager form the data-computation framework. The ResourceManager is the ultimate authority that arbitrates resources among all the applications in the system. The NodeManager is the per-machine framework agent who is responsible for containers, monitoring their resource usage (cpu, memory, disk, network) and reporting the same to the ResourceManager/Scheduler. The per-application ApplicationMaster is, in effect, a framework specific library and is tasked with negotiating resources from the ResourceManager and working with the NodeManager(s) to execute and monitor the tasks.
  • Previous
  • You're on page 1
  • Next