172 Integrations with Hadoop

View a list of Hadoop integrations and software that integrates with Hadoop below. Compare the best Hadoop integrations as well as features, ratings, user reviews, and pricing of software that integrates with Hadoop. Here are the current Hadoop integrations in 2024:

  • 1
    Nightfall

    Nightfall

    Nightfall

    Discover, classify, and protect your sensitive data. Nightfall™ uses machine learning to identify business-critical data, like customer PII, across your SaaS, APIs, and data infrastructure, so you can manage & protect it. Integrate in minutes with cloud services via APIs to monitor data without agents. Machine learning classifies your sensitive data & PII with high accuracy, so nothing gets missed. Setup automated workflows for quarantines, deletions, alerts, and more - saving you time and keeping your business safe. Nightfall integrates directly with all your SaaS, APIs, and data infrastructure. Start building with Nightfall’s APIs for sensitive data classification & protection for free. Via REST API, programmatically get structured results from Nightfall’s deep learning-based detectors for things like credit card numbers, API keys, and more. Integrate with just a few lines of code. Seamlessly add data classification to your applications & workflows using Nightfall's REST API.
  • 2
    AutoSys Workload Automation
    Organizations need to effectively manage large volumes of complex, business-critical workloads across multiple applications and platforms. In such complex environments, there are number of business challenges you have to address. Availability of critical business services. A single workload failure can have a significant impact on an organization’s capability to deliver services. Respond to real time business events. Today’s on-demand business world requires real-time automation to efficiently respond to business events. Improve IT efficiency. Reducing IT costs continues to be a key requirement for organizations, at the same time IT is expected to improve service delivery. AutoSys Workload Automation enhances visibility and control of complex workloads across platforms, ERP systems, and the cloud. It helps to reduce the cost and complexity of managing mission critical business processes, ensuring consistent and reliable service delivery.
  • 3
    Proficio

    Proficio

    Proficio

    Proficio’s Managed, Detection and Response (MDR) solution surpasses the capabilities of traditional Managed Security Services Providers (MSSPs). Our MDR service is powered by next-generation cybersecurity technology and our security experts partner with you to become an extension of your team, continuously monitoring and investigating threats from our global networks of security operations centers. Proficio’s advanced approach to threat detection leverages an extensive library of security use cases, MITRE ATT&CK® framework, AI-based threat hunting models, business context modeling, and a threat intelligence platform. Through our global network of Security Operations Centers (SOCs), Proficio experts monitor, investigate and triage suspicious events. We significantly reduce the number of false positives and provide actionable alerts with remediation recommendations. Proficio is a leader in Security Orchestration Automation and Response (SOAR).
  • 4
    Kylo

    Kylo

    Teradata

    Kylo is an open source enterprise-ready data lake management software platform for self-service data ingest and data preparation with integrated metadata management, governance, security and best practices inspired by Think Big's 150+ big data implementation projects. Self-service data ingest with data cleansing, validation, and automatic profiling. Wrangle data with visual sql and an interactive transform through a simple user interface. Search and explore data and metadata, view lineage, and profile statistics. Monitor health of feeds and services in the data lake. Track SLAs and troubleshoot performance. Design batch or streaming pipeline templates in Apache NiFi and register with Kylo to enable user self-service. Organizations can expend significant engineering effort moving data into Hadoop yet struggle to maintain governance and data quality. Kylo dramatically simplifies data ingest by shifting ingest to data owners through a simple guided UI.
  • 5
    Apache Atlas

    Apache Atlas

    Apache Software Foundation

    Atlas is a scalable and extensible set of core foundational governance services – enabling enterprises to effectively and efficiently meet their compliance requirements within Hadoop and allows integration with the whole enterprise data ecosystem. Apache Atlas provides open metadata management and governance capabilities for organizations to build a catalog of their data assets, classify and govern these assets and provide collaboration capabilities around these data assets for data scientists, analysts and the data governance team. Pre-defined types for various Hadoop and non-Hadoop metadata. Ability to define new types for the metadata to be managed. Types can have primitive attributes, complex attributes, object references; can inherit from other types. Instances of types, called entities, capture metadata object details and their relationships. REST APIs to work with types and instances allow easier integration.
  • 6
    Microsoft Power Query
    Power Query is the easiest way to connect, extract, transform and load data from a wide range of sources. Power Query is a data transformation and data preparation engine. Power Query comes with a graphical interface for getting data from sources and a Power Query Editor for applying transformations. Because the engine is available in many products and services, the destination where the data will be stored depends on where Power Query was used. Using Power Query, you can perform the extract, transform, and load (ETL) processing of data. Microsoft’s Data Connectivity and Data Preparation technology that lets you seamlessly access data stored in hundreds of sources and reshape it to fit your needs—all with an easy to use, engaging, no-code experience. Power Query supports hundreds of data sources with built-in connectors, generic interfaces (such as REST APIs, ODBC, OLE, DB and OData) and the Power Query SDK to build your own connectors.
  • 7
    SAS Data Loader for Hadoop
    Load your data into or out of Hadoop and data lakes. Prep it so it's ready for reports, visualizations or advanced analytics – all inside the data lakes. And do it all yourself, quickly and easily. Makes it easy to access, transform and manage data stored in Hadoop or data lakes with a web-based interface that reduces training requirements. Built from the ground up to manage big data on Hadoop or in data lakes; not repurposed from existing IT-focused tools. Lets you group multiple directives to run simultaneously or one after the other. Schedule and automate directives using the exposed Public API. Enables you to share and secure directives. Call them from SAS Data Integration Studio, uniting technical and nontechnical user activities. Includes built-in directives – casing, gender and pattern analysis, field extraction, match-merge and cluster-survive. Profiling runs in-parallel on the Hadoop cluster for better performance.
  • 8
    SAS MDM
    Integrate master data management technologies with those in SAS 9.4. SAS MDM is a web-based application that is accessed through the SAS Data Management Console. It provides a single, accurate and unified view of corporate data, integrating information from various data sources into one master record. SAS® Data Remediation and SAS® Task Manager work together with SAS MDM and as well as with other software offerings, such as SAS® Data Management and SAS® Data Quality. SAS Data Remediation enables users to manage and correct issues triggered by business rules in SAS MDM batch jobs and real-time processes. SAS Task Manager is a complementary application to others that integrate with SAS Workflow technologies giving users direct access to a workflow that might have been initiated from another SAS application. Users can start, stop, and transition workflows that have been uploaded to the SAS Workflow server environment.
  • 9
    Apache Knox

    Apache Knox

    Apache Software Foundation

    The Knox API Gateway is designed as a reverse proxy with consideration for pluggability in the areas of policy enforcement, through providers and the backend services for which it proxies requests. Policy enforcement ranges from authentication/federation, authorization, audit, dispatch, hostmapping and content rewrite rules. Policy is enforced through a chain of providers that are defined within the topology deployment descriptor for each Apache Hadoop cluster gated by Knox. The cluster definition is also defined within the topology deployment descriptor and provides the Knox Gateway with the layout of the cluster for purposes of routing and translation between user facing URLs and cluster internals. Each Apache Hadoop cluster that is protected by Knox has its set of REST APIs represented by a single cluster specific application context path. This allows the Knox Gateway to both protect multiple clusters and present the REST API consumer with a single endpoint.
  • 10
    The Respond Analyst
    Accelerate investigations and improve analyst productivity with a XDR Cybersecurity Solution. The Respond Analyst™, an XDR Engine, automates the discovery of security incidents by turning resource-intensive monitoring and initial analysis into thorough and consistent investigations. Unlike other XDR solutions, the Respond Analyst connects disparate evidence using probabilistic mathematics and integrated reasoning to determine the likelihood that events are malicious and actionable. The Respond Analyst augments security operations teams by significantly reducing the need to chase false positives resulting in more time for threat hunting. The Respond Analyst allows you to choose best-of-breed controls to modernize your sensor grid. The Respond Analyst integrates with the leading security vendor offerings across important categories such as EDR, IPS, Web Filtering, EPP, Vulnerability Scanning, Authentication, and more.
  • 11
    Gurucul

    Gurucul

    Gurucul

    Data science driven security controls to automate advanced threat detection, remediation and response. Gurucul’s Unified Security and Risk Analytics platform answers the question: Is anomalous behavior risky? This is our competitive advantage and why we’re different than everyone else in this space. We don’t waste your time with alerts on anomalous activity that isn’t risky. We use context to determine whether behavior is risky. Context is critical. Telling you what’s happening is not helpful. Telling you when something bad is happening is the Gurucul difference. That’s information you can act on. We put your data to work. We are the only security analytics company that can consume all your data out-of-the-box. We can ingest data from any source – SIEMs, CRMs, electronic medical records, identity and access management systems, end points – you name it, we ingest it into our enterprise risk engine.
  • 12
    Securonix Security Operations and Analytics
    The Securonix Security Operations and Analytics Platform combines log management; user and entity behavior analytics (UEBA); next-generation security information and event management (SIEM); network detection and response (NDR); and security orchestration, automation and response (SOAR) into a complete, end-to-end security operations platform. The Securonix platform delivers unlimited scale, powered by advanced analytics, behavior detection, threat modeling, and machine learning. It increases your security through improved visibility, actionability, and security posture, while reducing management and analyst burden. With native support for thousands of third-party vendors and technology solutions, the Securonix platform simplifies security operations, events, escalations, and remediations. It easily scales from startups to global enterprises while providing the same fast security ROI and ongoing transparent and predictable cost.
  • 13
    Voltage SecureData
    Secure sensitive data wherever it flows—on premises, in the cloud, and in big data analytic platforms. Voltage encryption delivers data privacy protection, neutralizes data breach, and drives business value through secure data use. Data protection builds customer trust and enables compliance to global regulations, including GDPR, CCPA, and HIPAA. Privacy regulations recommend encryption, pseudonymization, and anonymization to protect personal data. Voltage SecureData enables enterprises to de-identify sensitive structured data and support the use of data in its protect state to safely drive business value. Ensure that applications operate on secure data flowing through the enterprise with no gaps, no decryption, and no performance overhead. SecureData supports the broadest range of platforms and encrypts data in any language. Structured Data Manager integrates SecureData so that businesses can easily and continuously protect data throughout the lifecycle, from discovery to encryption.
  • 14
    Enterprise Recon

    Enterprise Recon

    Ground Labs

    With Enterprise Recon by Ground Labs, organizations can find and remediate sensitive information across the broadest range of structured and unstructured data, whether it’s stored on your servers, on your employees’ devices, or in the cloud. Enterprise Recon enables organizations worldwide to seamlessly discover all of their data and comply with GDPR, PCI DSS, CCPA, HIPAA, Australian Privacy and other data security standards that require the ability to locate and secure PII data as well as information on gender, ethnicity and health… or even non-PII financial data. Enterprise Recon is powered by GLASS™, Ground Labs' proprietary technology that enables the quickest and most accurate data discovery across the broadest set of platforms available. Enterprise Recon natively supports sensitive data discovery on Windows, macOS, Linux, FreeBSD, Solaris, HP-UX and IBM AIX using agent and agentless options. Additional remote options also enable almost any network data stored.
  • 15
    iScramble

    iScramble

    Mage Data

    Mage™ Static Data Masking (SDM) and Test data Management (TDM) capabilities fully integrate with Imperva’s Data Security Fabric (DSF) delivering complete protection for all sensitive or regulated data while simultaneously integrating seamlessly with an organization’s existing IT framework and existing application development, testing and data flows without the requirement for any additional architectural changes.
  • 16
    iMask

    iMask

    Mage Data

    Mage™ Dynamic Data Masking module of the Mage data security platform has been designed with the end customer needs taken into consideration. Mage™ Dynamic Data Masking has been developed working alongside our customers, to address the specific needs and requirements they have. As a result, this product has evolved in a way to meet all the use cases that an enterprise could possibly have. Most other solutions in the market are either a part of an acquisition or are developed to meet only a specific use case. Mage™ Dynamic Data Masking has been designed to deliver adequate protection to sensitive data in production to application and database users while simultaneously integrating seamlessly with an organization's existing IT framework without the requirement of any additional architectural changes.​
  • 17
    Acxiom Real Identity
    Real Identity™ delivers sub second decisions to power relevant messages in real time. Real Identity enables the world’s biggest brands to accurately identify and ethically connect with people anytime, anywhere to create relevant experiences. Engage people with reach, scale and precision across every interaction. Manage and maintain identity across your enterprise by leveraging 50 years of data and identity expertise combined with the latest artificial intelligence and machine learning techniques. The adtech environment requires speed and access to identity and data to enable personalization and decisioning use cases. In a cookieless world, first-party data signals will drive these functions while the conversation continues to be between people, the brands, and the publishers. By delivering experiences that matter, across all channels, you can wow your customer and prospects while staying ahead of regulations and ahead of your competition.
  • 18
    ContextIQ
    Online consumers prefer recommendations that are relevant to their needs or interest. You can now give them an enhanced experience with behavior profiling and contextual targeting. Use our recommendation engine to offer a more focused personalization. Keep your visitors hooked with personalized content. Greater the time a user spends on a site, higher the chances of a conversion. Help shoppers find stuff buried deep within your eCommerce store. Increase sales through timely and intelligent product recommendations. Showcase products or content that interest the user. Only relevant suggestions capture user attention and lead to fruitful interactions. ContextIQ is an easy-to-deploy personalization solution that uses collaborative filtering algorithms to produce recommendations. It is capable of suggesting content to users through behavioral targeting.
  • 19
    Apache Sentry

    Apache Sentry

    Apache Software Foundation

    Apache Sentry™ is a system for enforcing fine grained role based authorization to data and metadata stored on a Hadoop cluster. Apache Sentry has successfully graduated from the Incubator in March of 2016 and is now a Top-Level Apache project. Apache Sentry is a granular, role-based authorization module for Hadoop. Sentry provides the ability to control and enforce precise levels of privileges on data for authenticated users and applications on a Hadoop cluster. Sentry currently works out of the box with Apache Hive, Hive Metastore/HCatalog, Apache Solr, Impala and HDFS (limited to Hive table data). Sentry is designed to be a pluggable authorization engine for Hadoop components. It allows you to define authorization rules to validate a user or application’s access requests for Hadoop resources. Sentry is highly modular and can support authorization for a wide variety of data models in Hadoop.
  • 20
    CloudSwyft

    CloudSwyft

    CloudSwyft

    CloudSwyft has built one of the fastest growing end-to-end cloud-based technology learning platforms globally, focused on supporting the innovative delivery of modern 21st century technology skills training and credentialing to meet the demands of rapid digital transformation. We provide cloud-based learning platforms, customized hands-on labs, digital credentialing and an innovative blended learning experience product. We provide this technology to a broad range of higher learning institutions, governments and corporates across our home markets of Asia Pacific and the Middle East and to the world’s largest MOOC providers. With our technology content partners, Microsoft and UiPath, we have used this same technology to deliver premium online technology skills training to these same customers and direct to individual learners in partnership with a broad range of leading B2C platforms.
  • 21
    Apache Bigtop

    Apache Bigtop

    Apache Software Foundation

    Bigtop is an Apache Foundation project for Infrastructure Engineers and Data Scientists looking for comprehensive packaging, testing, and configuration of the leading open source big data components. Bigtop supports a wide range of components/projects, including, but not limited to, Hadoop, HBase and Spark. Bigtop packages Hadoop RPMs and DEBs, so that you can manage and maintain your Hadoop cluster. Bigtop provides an integrated smoke testing framework, alongside a suite of over 50 test files. Bigtop provides vagrant recipes, raw images, and (work-in-progress) docker recipes for deploying Hadoop from zero. Bigtop support many Operating Systems, including Debian, Ubuntu, CentOS, Fedora, openSUSE and many others. Bigtop includes tools and a framework for testing at various levels (packaging, platform, runtime, etc.) for both initial deployments as well as upgrade scenarios for the entire data platform, not just the individual components.
  • 22
    iFinder

    iFinder

    IntraFind Software

    IntraFind's Enterprise Search solution iFinder is a central search platform for all of your company's data. iFinder can be connected to all of the data sources within your company. Are your data pools constantly growing? With iFinder you are well equipped for the future: our product is based on Elasticsearch technology and can therefore adapt to any volume of data with ease. It also improves search results by deploying artificial intelligence to deliver smart enterprise search functionality. iFinder helps you find important data and documents, whether they are located on a company drive, on the intranet, in wikis, or in e-mail systems. Take the next step in your company’s digital transformation by centralizing access to all company data with our enterprise search application. iFinder improves search results by deploying artificial intelligence to deliver smart enterprise search functionality.
  • 23
    NVMesh

    NVMesh

    Excelero

    Excelero delivers low-latency distributed block storage for web-scale applications. NVMesh enables shared NVMe across any network and supports any local or distributed file system. The solution features an intelligent management layer that abstracts underlying hardware with CPU offload, creates logical volumes with redundancy, and provides centralized, intelligent management and monitoring. Applications can enjoy the latency, throughput and IOPs of a local NVMe device with the convenience of centralized storage while avoiding proprietary hardware lock-in and reducing the overall storage TCO. NVMesh features a distributed block layer that allows unmodified applications to utilize pooled NVMe storage devices across a network at local speeds and latencies. Distributed NVMe storage resources are pooled with the ability to create arbitrary, dynamic block volumes that can be utilized by any host running the NVMesh block client.
  • 24
    lakeFS

    lakeFS

    Treeverse

    lakeFS enables you to manage your data lake the way you manage your code. Run parallel pipelines for experimentation and CI/CD for your data. Simplifying the lives of engineers, data scientists and analysts who are transforming the world with data. lakeFS is an open source platform that delivers resilience and manageability to object-storage based data lakes. With lakeFS you can build repeatable, atomic and versioned data lake operations, from complex ETL jobs to data science and analytics. lakeFS supports AWS S3, Azure Blob Storage and Google Cloud Storage (GCS) as its underlying storage service. It is API compatible with S3 and works seamlessly with all modern data frameworks such as Spark, Hive, AWS Athena, Presto, etc. lakeFS provides a Git-like branching and committing model that scales to exabytes of data by utilizing S3, GCS, or Azure Blob for storage.
  • 25
    Prodea

    Prodea

    Prodea

    Launch secure, scalable and globally compliant connected products with services within six months. Prodea provides the only IoT platform-as-a-service (PaaS) that was specifically designed for manufacturers of mass-market consumer home products. It is comprised of three main services. IoT Service X-Change Platform, for quickly launching connected products with services across global markets requiring minimal development. Insight™ Data Services, to gain key insights from user and product usage data. And EcoAdaptor™ Service, to enhance product value through cloud-to-cloud integration and interoperability with other products and services. Prodea has helped its global brand customers launch 100+ connected products, in less than six months on average, across six continents. This was made possible by using the Prodea X5 Program which was designed to work with our three main cloud services to help brands evolve their systems.
  • 26
    GO+

    GO+

    GO+

    GO+ offers development tools for companies service-providers. The platform allows to develop additional services to its business customers. Through algorithms developed by our decision to support a load with a huge number of devices at the same time. The platform will allow the service-provider does not think about the problem of creating new services for their customers. The core of the platform is an analytical decision-making engine. Granular Computing based analytical engine provides data processing and analysis with complex event processing. We use cloud-based technologies that is designed to endure business logic from real devices directly to the cloud. Scalability allows us to provide solutions in lower cost. Platform scripting engine provide developers with full-stack of dev tools to develop a highly customized IoT services regardless of industry application. The cloud-based IoT platform GO+ is built using the most advanced technology of cloud computing.
  • 27
    Foghub

    Foghub

    Foghub

    Simplified IT/OT Integration, Data Engineering & Real-Time Edge Intelligence. Easy to use, cross-platform, open architecture, edge computing for industrial time-series data. Foghub offers the Critical-Path to IT/OT convergence, connecting Operations (Sensors, Devices, and Systems) with Business (People, Processes, and Applications), enabling automated data acquisition, data engineering, transformations, advanced analytics and ML. Handle large variety, volume, and velocity of industrial data with out-of-the-box support for all data types, most popular industrial network protocols, OT/lab systems, and databases. Easily automate the collection of data about your production runs, batches, parts, cycle-times, process parameters, asset condition, performance, health, utilities, consumables as well as operators and their performance. Designed for scale, Foghub offers a comprehensive set of capabilities to handle large volumes and velocity of data.
  • 28
    Brainwave GRC

    Brainwave GRC

    Radiant Logic

    Brainwave is reinventing the way you analyze your user accesses! You will now be able to thoroughly analyze access risk thanks to a new user interface, predictive controls and risk-scoring functionality. With Autonomous Identity, you can engage your teams and improve their efficiency with a market-approved, ergonomic tool that accelerates your identity management program (IGA). Enable the business to review and make decisions about access to shared files and folders. Inventory, classify, review access and demonstrate compliance regardless of the location, file servers, NAS, Sharepoint, Office 365 and others. Our core product, Brainwave Identity GRC, provides a wealth of analytical capabilities to leverage the inventory of all access. Obtain full visibility at all time, on all resources. Brainwave’s inventory constitutes an entitlement catalog across infrastructure, business applications and data access.
  • 29
    Apache Kylin

    Apache Kylin

    Apache Software Foundation

    Apache Kylin™ is an open source, distributed Analytical Data Warehouse for Big Data; it was designed to provide OLAP (Online Analytical Processing) capability in the big data era. By renovating the multi-dimensional cube and precalculation technology on Hadoop and Spark, Kylin is able to achieve near constant query speed regardless of the ever-growing data volume. Reducing query latency from minutes to sub-second, Kylin brings online analytics back to big data. Kylin can analyze 10+ billions of rows in less than a second. No more waiting on reports for critical decisions. Kylin connects data on Hadoop to BI tools like Tableau, PowerBI/Excel, MSTR, QlikSense, Hue and SuperSet, making the BI on Hadoop faster than ever. As an Analytical Data Warehouse, Kylin offers ANSI SQL on Hadoop/Spark and supports most ANSI SQL query functions. Kylin can support thousands of interactive queries at the same time, thanks to the low resource consumption of each query.
  • 30
    Apache Zeppelin
    Web-based notebook that enables data-driven, interactive data analytics and collaborative documents with SQL, Scala and more. IPython interpreter provides comparable user experience like Jupyter Notebook. This release includes Note level dynamic form, note revision comparator and ability to run paragraph sequentially, instead of simultaneous paragraph execution in previous releases. Interpreter lifecycle manager automatically terminate interpreter process on idle timeout. So resources are released when they're not in use.
  • 31
    SOLIXCloud CDP

    SOLIXCloud CDP

    Solix Technologies

    SOLIXCloud CDP delivers cloud data management as-a-service for modern data-driven enterprises. Built on opensource, cloud native technologies SOLIXCloud CDP helps companies manage and process all of their structured, semi-structured and unstructured data for advanced anaytics, compliance, infrastructure optimization and data security. With features such as Solix Connect for data ingestion, Solix Data Governance, Solix Metadata Management and Solix Search, SOLIXCloud CDP offers a comprehensive cloud data management application framework to build and run data-driven applications such as SQL data warehouse, machine learning and artifitial intelligience while fulfilling the ever growing data management requirements of complex data regulations, data retention and consumer data privacy.
  • 32
    SOLIXCloud

    SOLIXCloud

    Solix Technologies

    Data volume keeps growing, but not all data has equal value. Cloud data management enables forward thinking companies to reduce the cost of managing enterprise data and still provide security, compliance, performance and easy access. As content ages, it loses value, but organizations can still monetize their less current data through modern SaaS-based solutions. SOLIXCloud delivers all of the capabilities required to strike the perfect balance between historical and current data management. With a complete suite of compliance features for structured, unstructured, and semi-structured data, SOLIXCloud offers a fully managed service for all enterprise data. Solix metadata management is an end-to-end framework to explore all enterprise metadata and lineage from a centralized repository and business glossary.
  • 33
    Quantexa

    Quantexa

    Quantexa

    Uncover hidden risk and reveal new, unexpected opportunities with graph analytics across the customer lifecycle. Standard MDM solutions are not built for high volumes of distributed, disparate data, that is generated by various applications and external sources. Traditional MDM probabilistic matching doesn’t work well with siloed data sources. It misses connections, losing context, leads to decision-making inaccuracy, and leaves business value on the table. An ineffective MDM solution affects everything from customer experience to operational performance. Without on-demand visibility of holistic payment patterns, trends and risk, your team can’t make the right decisions quickly, compliance costs escalate, and you can’t increase coverage fast enough. Your data isn’t connected – so customers suffer fragmented experiences across channels, business lines and geographies. Attempts at personalized engagement fall short as these are based on partial, often outdated data.
  • 34
    witboost

    witboost

    Agile Lab

    witboost is a modular, scalable, fast, efficient data management system for your company to truly become data driven, reduce time-to-market, it expenditures and overheads. witboost comprises a series of modules. These are building blocks that can work as standalone solutions to address and solve a single need or problem, or they can be combined to create the perfect data management ecosystem for your company. Each module improves a specific data engineering function and they can be combined to create the perfect solution to answer your specific needs, guaranteeing a blazingly fact and smooth implementation, thus dramatically reducing time-to-market, time-to-value and consequently the TCO of your data engineering infrastructure. Smart Cities need digital twins to predict needs and avoid unforeseen problems, gathering data from thousands of sources and managing ever more complex telematics.
  • 35
    ScriptString

    ScriptString

    ScriptString

    Optimize your document knowledge and make critical decisions with confidence. Tired of manual processing, time constraints, budget pressures and shifting compliance requirements? Hassle free collection and integration of your cloud spend data in half the time at half the cost. Recommended cost savings and guidance to save more than 50% of total spend. Gain 360° visibility of your entire cloud spend with KPI tracking, real-time insights and recommendations. Built-in peace of mind with security and compliance protection to meet any standards. Gather data via portal, email, API, repository, table, data lake or 3rd party data source. Automated AI powered intelligent document processing eliminates manual effort. Intelligent review of document knowledge identifies anomalies, duplicates and errors. Find the needle in the haystack with ScriptString's Knowledge Relationship Indexing.
  • 36
    Occubee

    Occubee

    3SOFT

    Occubee platform automatically converts large amount of receipt data, information on thousands of products and dozens of retail-specific factors into valuable sales and demand forecasts. In stores, Occubee forecasts sales individually for each product and generates replenishment commands. In warehouses, Occubee optimizes the availability of goods and allocated capital, and generates orders for suppliers. In the head office, Occubee provides real-time monitoring of sales processes and generates anomaly alerts and reports. Modern technologies for data collection and processing ensure automation of key business processes in the retail industry. Occubee fully responds to the needs of modern retail and fits in with the global megatrends related to the use of data in business.
  • 37
    Acxiom InfoBase
    Acxiom enables you to leverage comprehensive data for premium audiences and insights across the globe. Better understand, identify, and target ideal audiences by engaging and personalizing experiences across digital and offline channels. With marketing technology, identity resolution and digital connectivity converging in a “borderless digital world,” brands can now quickly locate data attributes, service availability and the digital footprint across the globe to fuel informed decisions. Acxiom is the global data leader with thousands of data attributes in more than 60 countries helping brands improve millions of customer experiences every day through meaningful data-driven insights, all while protecting consumer privacy. Understand, reach and engage audiences everywhere, maximize your media investments and power more personalized experiences. Reach audiences around the globe and deliver experiences that matter with Acxiom data.
  • 38
    Deeplearning4j

    Deeplearning4j

    Deeplearning4j

    DL4J takes advantage of the latest distributed computing frameworks including Apache Spark and Hadoop to accelerate training. On multi-GPUs, it is equal to Caffe in performance. The libraries are completely open-source, Apache 2.0, and maintained by the developer community and Konduit team. Deeplearning4j is written in Java and is compatible with any JVM language, such as Scala, Clojure, or Kotlin. The underlying computations are written in C, C++, and Cuda. Keras will serve as the Python API. Eclipse Deeplearning4j is the first commercial-grade, open-source, distributed deep-learning library written for Java and Scala. Integrated with Hadoop and Apache Spark, DL4J brings AI to business environments for use on distributed GPUs and CPUs. There are a lot of parameters to adjust when you're training a deep-learning network. We've done our best to explain them, so that Deeplearning4j can serve as a DIY tool for Java, Scala, Clojure, and Kotlin programmers.
  • 39
    Span Global Services

    Span Global Services

    Span Global Services

    Span Global Services is the powerhouse for digital and data-driven marketing services. We put targeted insight into every campaign; fueling your B2B sales and marketing results with data and insights across a plethora of industries: technology, healthcare, manufacturing, retail, telecommunication and more. Over 90 Million multi-verified contacts, business firmographics, business entity relationships, business intelligence, active social profile details, our customized databases can fulfill data requirements of large enterprises and SMEs simultaneously. We acquire and validate data through technology, public records and the human element, people contacting people. Our sales and marketing clients enjoy higher MQL and conversions, data quality guarantees, custom appending and profiling services, marketing automation and industry’s best subject matter expertise.
  • 40
    Apache Kudu

    Apache Kudu

    The Apache Software Foundation

    A Kudu cluster stores tables that look just like tables you're used to from relational (SQL) databases. A table can be as simple as a binary key and value, or as complex as a few hundred different strongly-typed attributes. Just like SQL, every table has a primary key made up of one or more columns. This might be a single column like a unique user identifier, or a compound key such as a (host, metric, timestamp) tuple for a machine time-series database. Rows can be efficiently read, updated, or deleted by their primary key. Kudu's simple data model makes it a breeze to port legacy applications or build new ones, no need to worry about how to encode your data into binary blobs or make sense of a huge database full of hard-to-interpret JSON. Tables are self-describing, so you can use standard tools like SQL engines or Spark to analyze your data. Kudu's APIs are designed to be easy to use.
  • 41
    Apache Parquet

    Apache Parquet

    The Apache Software Foundation

    We created Parquet to make the advantages of compressed, efficient columnar data representation available to any project in the Hadoop ecosystem. Parquet is built from the ground up with complex nested data structures in mind, and uses the record shredding and assembly algorithm described in the Dremel paper. We believe this approach is superior to simple flattening of nested namespaces. Parquet is built to support very efficient compression and encoding schemes. Multiple projects have demonstrated the performance impact of applying the right compression and encoding scheme to the data. Parquet allows compression schemes to be specified on a per-column level, and is future-proofed to allow adding more encodings as they are invented and implemented. Parquet is built to be used by anyone. The Hadoop ecosystem is rich with data processing frameworks, and we are not interested in playing favorites.
  • 42
    Hypertable

    Hypertable

    Hypertable

    Hypertable delivers scalable database capacity at maximum performance to speed up your big data application and reduce your hardware footprint. Hypertable delivers maximum efficiency and superior performance over the competition which translates into major cost savings. A proven scalable design that powers hundreds of Google services. All the benefits of open source with a strong and thriving community. C++ implementation for optimum performance. 24/7/365 support for your business-critical big data application. Unparalleled access to Hypertable brain power by the employer of all core Hypertable developers. Hypertable was designed for the express purpose of solving the scalability problem, a problem that is not handled well by a traditional RDBMS. Hypertable is based on a design developed by Google to meet their scalability requirements and solves the scale problem better than any of the other NoSQL solutions out there.
  • 43
    Apache Pinot

    Apache Pinot

    Apache Corporation

    Pinot is designed to answer OLAP queries with low latency on immutable data. Pluggable indexing technologies - Sorted Index, Bitmap Index, Inverted Index. Joins are currently not supported, but this problem can be overcome by using Trino or PrestoDB for querying. SQL like language that supports selection, aggregation, filtering, group by, order by, distinct queries on data. Consist of of both offline and real-time table. Use real-time table only to cover segments for which offline data may not be available yet. Detect the right anomalies by customizing anomaly detect flow and notification flow.
  • 44
    Apache Hudi

    Apache Hudi

    Apache Corporation

    Hudi is a rich platform to build streaming data lakes with incremental data pipelines on a self-managing database layer, while being optimized for lake engines and regular batch processing. Hudi maintains a timeline of all actions performed on the table at different instants of time that helps provide instantaneous views of the table, while also efficiently supporting retrieval of data in the order of arrival. A Hudi instant consists of the following components. Hudi provides efficient upserts, by mapping a given hoodie key consistently to a file id, via an indexing mechanism. This mapping between record key and file group/file id, never changes once the first version of a record has been written to a file. In short, the mapped file group contains all versions of a group of records.
  • 45
    Azure HDInsight
    Run popular open-source frameworks—including Apache Hadoop, Spark, Hive, Kafka, and more—using Azure HDInsight, a customizable, enterprise-grade service for open-source analytics. Effortlessly process massive amounts of data and get all the benefits of the broad open-source project ecosystem with the global scale of Azure. Easily migrate your big data workloads and processing to the cloud. Open-source projects and clusters are easy to spin up quickly without the need to install hardware or manage infrastructure. Big data clusters reduce costs through autoscaling and pricing tiers that allow you to pay for only what you use. Enterprise-grade security and industry-leading compliance with more than 30 certifications helps protect your data. Optimized components for open-source technologies such as Hadoop and Spark keep you up to date.
  • 46
    CYRES

    CYRES

    CYRES

    The best solution to guarantee a high level of security on all your equipment & data. Choose Exchange, the most complete and secure business messaging solution on the market. By relying on Cloudera, centralize, process and analyze your data within flexible Cloud platforms, in an industrialized and secure manner. Launch micro-services architectures with the Docker containerization platform and automate deployment to production with GitLab. Take advantage of our managed services to integrate the AWS or Azure cloud. Deploy your applications in the most efficient environments on the market. Use Veeam Cloud Connect to deploy your PRA/PCA or outsource your virtual machine backups. Your private cloud to respond with agility to the rapid evolution of your business. The cloud benchmark on which millions of companies are already relying to gain agility. A wide range of cloud solutions to create VMs in seconds.
  • 47
    IBM Intelligent Operations Center for Emergency Mgmt
    An incident and emergency management solution for daily operations, emergency and crisis situations. This command, control and communication (C3) solution uses data analytic technologies coupled with social and mobile technology to streamline and integrate preparation, response, recovery and mitigation of daily incidents, emergencies and disasters. IBM works with governments and public safety organizations worldwide to implement public safety technology solutions. Proven preparation techniques use the same technology to manage day-to-day community incidents when responding to crises situations. This familiarity helps ensure first responders and C3 staff can engage immediately and naturally in response, recovery and mitigation without needing access to special documentation and systems. This incident and emergency management solution integrates and correlates information sources to create a dynamic, near real-time geospatial framework for a common operating picture.
  • 48
    Red Hat JBoss Data Virtualization
    Red Hat JBoss Data Virtualization is a lean, virtual data integration solution that unlocks trapped data and delivers it as easily consumable, unified, and actionable information. Red Hat JBoss Data Virtualization makes data spread across physically diverse systems, such as multiple databases, XML files, and Hadoop systems, appear as a set of tables in a local database. Provides standards-based read/write access to heterogeneous data stores in real-time. Speeds application development and integration by simplifying access to distributed data. Integrate and transform data semantics based on data consumer requirements. Provides centralized access control, and auditing through robust security infrastructure. Turn fragmented data into actionable information at the speed your business needs. Red Hat offers support and maintenance over stated time periods for the major versions of JBoss products.
  • 49
    Value Innovation Labs Marketing Automation Platform
    Track your user behavior with power analytics. Segment users based on their behavior. Create engagement strategies with powerful AI. OS/Device level restrictions by certain handset makers restrict push notification delivery. With our product, you can bypass those restrictions to reach and engage an additional 20% of users. We ensure higher inbox reach with email consultants and industry experts to help you with the best practices. Avoid sending blast messages that end up in spam, or taint your domain and brand reputation. Localize the communication based on language, seamlessly. Our platform supports multilingual architecture and you can reach out to your customers in the local language for a local touch. Target users with acquisition source, uninstall data and more. Segment users just the way you want. Initiate conversation, reduce churn and do much more with powerful insights.
  • 50
    Value Innovation Labs Enterprise HRMS
    Assign, track, execute tasks, track productivity with powerful insight. Automate over 100+ tasks related and amplify human interactions with bots, group chat and more. Actionable insights that help Line Managers, HR Professionals & CXO achieve more. Define organizational structure, assign roles & permissions, grant access rights. Manage your employee life cycle from onboarding to exit, publish letters. Run error-free payroll, manage loans & reimbursements, meet statutory norms. Real-time attendance for managing attendance, holiday calendar, shifts and integration. Meet organizational goals & improve performance with 360-degree feedback. Boost employee morale & improve employee engagement using engagement tools. Real-time attendance for managing attendance, holiday calendar, shifts and integration. Meet organizational goals & improve performance with 360-degree feedback. Boost employee morale & improve employee engagement using engagement tools.