Business Software for Azure Data Lake

Top Software that integrates with Azure Data Lake as of August 2025

Compare business software, products, and services to find the best solution for your business or organization. Use the filters on the left to drill down by category, pricing, features, organization size, organization type, region, user reviews, integrations, and more. View and sort the products and solutions that match your needs in the results below.

  • 1
    Microsoft Azure
    Microsoft's Azure is a cloud computing platform that allows for rapid and secure application development, testing and management. Azure. Invent with purpose. Turn ideas into solutions with more than 100 services to build, deploy, and manage applications—in the cloud, on-premises, and at the edge—using the tools and frameworks of your choice. Continuous innovation from Microsoft supports your development today, and your product visions for tomorrow. With a commitment to open source, and support for all languages and frameworks, build how you want, and deploy where you want to. On-premises, in the cloud, and at the edge—we’ll meet you where you are. Integrate and manage your environments with services designed for hybrid cloud. Get security from the ground up, backed by a team of experts, and proactive compliance trusted by enterprises, governments, and startups. The cloud you can trust, with the numbers to prove it.
  • 2
    Microsoft Power BI
    Power BI is a business intelligence platform that enables users to analyze data using AI-driven tools and intuitive report creation. It consolidates data from various sources into OneLake, creating a centralized data source. This platform aids in embedding actionable insights into applications like Microsoft 365, aiding decision-making. Power BI integrates with Microsoft Fabric, enhancing data management. It offers scalability to handle large data volumes and integrates seamlessly with Microsoft services. Its AI capabilities efficiently identify patterns and generate insights. Power BI ensures data security and compliance. Its Copilot feature allows rapid report generation. Additionally, Power BI Pro offers self-service analytics, and its free version includes data modeling and visualization tools. It's known for unified data management, empowering users with accessibility and training resources. Power BI has demonstrated a significant ROI and economic benefit, as evidenced in a Forres
    Leader badge
    Starting Price: $10 per user per month
  • 3
    Visual Studio

    Visual Studio

    Microsoft

    Visual Studio is Microsoft’s comprehensive integrated development environment (IDE) for .NET and C++ developers on Windows. It offers a rich set of tools including code editing, debugging, compiling, and graphical designers to enhance every stage of software development. With powerful AI-powered features like GitHub Copilot, it assists developers in generating, refactoring, and debugging code faster and more accurately. Visual Studio supports building applications for any platform or device, streamlining collaboration with real-time teamwork tools. It integrates advanced debugging tools such as memory analysis and root cause insights to catch issues early. Trusted by millions, Visual Studio accelerates productivity by combining developer control with intelligent assistance.
    Starting Price: $45/user/month
  • 4
    Azure Synapse Analytics
    Azure Synapse is Azure SQL Data Warehouse evolved. Azure Synapse is a limitless analytics service that brings together enterprise data warehousing and Big Data analytics. It gives you the freedom to query data on your terms, using either serverless or provisioned resources—at scale. Azure Synapse brings these two worlds together with a unified experience to ingest, prepare, manage, and serve data for immediate BI and machine learning needs.
  • 5
    Activeeon ProActive
    The solution provided by Activeeon is suited to fit modern challenges such as the growth of data, new infrastructures, cloud strategy evolving, new application architecture, etc. It provides orchestration and scheduling to automate and build a solid base for future growth. ProActive Workflows & Scheduling is a java-based cross-platform workflow scheduler and resource manager that is able to run workflow tasks in multiple languages and multiple environments (Windows, Linux, Mac, Unix, etc). ProActive Resource Manager makes compute resources available for task execution. It handles on-premises and cloud compute resources in an elastic, on-demand and distributed fashion. ProActive AI Orchestration from Activeeon empowers data engineers and data scientists with a simple, portable and scalable solution for machine learning pipelines. It provides pre-built and customizable tasks that enable automation within the machine learning lifecycle, which helps data scientists and IT Operations work.
    Starting Price: $10,000
  • 6
    StarfishETL

    StarfishETL

    StarfishETL

    StarfishETL is an Integration Platform as a Service (iPaaS), and although “integration” is in the name, it’s capable of much more. An iPaaS lives in the cloud and can integrate different systems by using their APIs. This makes it adaptable beyond integration for migration, data governance, and data cleansing. Unlike traditional integration apps, StarfishETL provides low-code mapping and powerful scripting tools to manage, personalize, and manipulate data at scale. Features: - Drag and drop mapping - AI-powered connections - Purpose built integrations - Extensibility through scripting - Secure on-premises connections - Scalable data capacity
    Starting Price: 400/month
  • 7
    Dasera

    Dasera

    Dasera

    Dasera is a Data Security Posture Management (DSPM) platform providing automated security and governance controls for structured and unstructured data across cloud and on-prem environments. Uniquely, Dasera monitors data in use while offering continuous visibility and automated remediation, preventing data breaches across the entire data lifecycle. Dasera provides continuous visibility, risk detection, and mitigation to align with business goals while ensuring seamless integration, unmatched security, and regulatory compliance. Through its deep understanding of the four data variables - data infrastructure, data attributes, data users, and data usage - Dasera promotes a secure data-driven growth strategy that minimizes risk and maximizes value, giving businesses a competitive edge in today's rapidly evolving digital landscape.
    Starting Price: 3 data stores at $20,000
  • 8
    Microsoft Fabric
    Reshape how everyone accesses, manages, and acts on data and insights by connecting every data source and analytics service together—on a single, AI-powered platform. All your data. All your teams. All in one place. Establish an open and lake-centric hub that helps data engineers connect and curate data from different sources—eliminating sprawl and creating custom views for everyone. Accelerate analysis by developing AI models on a single foundation without data movement—reducing the time data scientists need to deliver value. Innovate faster by helping every person in your organization act on insights from within Microsoft 365 apps, such as Microsoft Excel and Microsoft Teams. Responsibly connect people and data using an open and scalable solution that gives data stewards additional control with built-in security, governance, and compliance.
    Starting Price: $156.334/month/2CU
  • 9
    Immuta

    Immuta

    Immuta

    Immuta is the market leader in secure Data Access, providing data teams one universal platform to control access to analytical data sets in the cloud. Only Immuta can automate access to data by discovering, securing, and monitoring data. Data-driven organizations around the world trust Immuta to speed time to data, safely share more data with more users, and mitigate the risk of data leaks and breaches. Founded in 2015, Immuta is headquartered in Boston, MA. Immuta is the fastest way for algorithm-driven enterprises to accelerate the development and control of machine learning and advanced analytics. The company's hyperscale data management platform provides data scientists with rapid, personalized data access to dramatically improve the creation, deployment and auditability of machine learning and AI.
  • 10
    PubSub+ Platform
    Solace PubSub+ Platform helps enterprises design, deploy and manage event-driven systems across hybrid and multi-cloud and IoT environments so they can be more event-driven and operate in real-time. The PubSub+ Platform includes the powerful PubSub+ Event Brokers, event management capabilities with PubSub+ Event Portal, as well as monitoring and integration capabilities all available via a single cloud console. PubSub+ allows easy creation of an event mesh, an interconnected network of event brokers, allowing for seamless and dynamic data movement across highly distributed network environments. PubSub+ Event Brokers can be deployed as fully managed cloud services, self-managed software in private cloud or on-premises environments, or as turnkey hardware appliances for unparalleled performance and low TCO. PubSub+ Event Portal is a complimentary toolset for design and governance of event-driven systems including both Solace and Kafka-based event broker environments.
  • 11
    Electrik.Ai

    Electrik.Ai

    Electrik.Ai

    Automatically ingest marketing data into any data warehouse or cloud file storage of your choice such as BigQuery, Snowflake, Redshift, Azure SQL, AWS S3, Azure Data Lake, Google Cloud Storage with our fully managed ETL pipelines in the cloud. Our hosted marketing data warehouse integrates all your marketing data and provides ad insights, cross-channel attribution, content insights, competitor Insights, and more. Our customer data platform performs identity resolution in real-time across data sources thus enabling a unified view of the customer and their journey. Electrik.AI is a cloud-based marketing analytics software and full-service platform. Electrik.AI’s Google Analytics Hit Data Extractor enriches and extracts the un-sampled hit level data sent to Google Analytics from the website or application and periodically ships it to your desired destination database/data warehouse or file/data lake.
    Starting Price: $49 per month
  • 12
    Vertex AI Notebooks
    Vertex AI Notebooks is a fully managed, scalable solution from Google Cloud that accelerates machine learning (ML) development. It provides a seamless, interactive environment for data scientists and developers to explore data, prototype models, and collaborate in real-time. With integration into Google Cloud’s vast data and ML tools, Vertex AI Notebooks supports rapid prototyping, automated workflows, and deployment, making it easier to scale ML operations. The platform’s support for both Colab Enterprise and Vertex AI Workbench ensures a flexible and secure environment for diverse enterprise needs.
    Starting Price: $10 per GB
  • 13
    Openbridge

    Openbridge

    Openbridge

    Uncover insights to supercharge sales growth using code-free, fully-automated data pipelines to data lakes or cloud warehouses. A flexible, standards-based platform to unify sales and marketing data for automating insights and smarter growth. Say goodbye to messy, expensive manual data downloads. Always know what you’ll pay and only pay for what you use. Fuel your tools with quick access to analytics-ready data. As certified developers, we only work with secure, official APIs. Get started quickly with data pipelines from popular sources. Pre-built, pre-transformed, and ready-to-go data pipelines. Unlock data from Amazon Vendor Central, Amazon Seller Central, Instagram Stories, Facebook, Amazon Advertising, Google Ads, and many others. Code-free data ingestion and transformation processes allow teams to realize value from their data quickly and cost-effectively. Data is always securely stored directly in a trusted, customer-owned data destination like Databricks, Amazon Redshift, etc.
    Starting Price: $149 per month
  • 14
    BigLake

    BigLake

    Google

    BigLake is a storage engine that unifies data warehouses and lakes by enabling BigQuery and open-source frameworks like Spark to access data with fine-grained access control. BigLake provides accelerated query performance across multi-cloud storage and open formats such as Apache Iceberg. Store a single copy of data with uniform features across data warehouses & lakes. Fine-grained access control and multi-cloud governance over distributed data. Seamless integration with open-source analytics tools and open data formats. Unlock analytics on distributed data regardless of where and how it’s stored, while choosing the best analytics tools, open source or cloud-native over a single copy of data. Fine-grained access control across open source engines like Apache Spark, Presto, and Trino, and open formats such as Parquet. Performant queries over data lakes powered by BigQuery. Integrates with Dataplex to provide management at scale, including logical data organization.
    Starting Price: $5 per TB
  • 15
    Streamkap

    Streamkap

    Streamkap

    Streamkap is a streaming data platform that makes streaming as easy as batch. Stream data from database (change data capturee) or event sources to your favorite database, data warehouse or data lake. Streamkap can be deployed as a SaaS or in a bring your own cloud (BYOC) deployment.
    Starting Price: $600 per month
  • 16
    Azure Data Lake Analytics
    Easily develop and run massively parallel data transformation and processing programs in U-SQL, R, Python, and .NET over petabytes of data. With no infrastructure to manage, you can process data on demand, scale instantly, and only pay per job. Process big data jobs in seconds with Azure Data Lake Analytics. There is no infrastructure to worry about because there are no servers, virtual machines, or clusters to wait for, manage, or tune. Instantly scale the processing power, measured in Azure Data Lake Analytics Units (AU), from one to thousands for each job. You only pay for the processing that you use per job. Act on all of your data with optimized data virtualization of your relational sources such as Azure SQL Database and Azure Synapse Analytics. Your queries are automatically optimized by moving processing close to the source data without data movement, which maximizes performance and minimizes latency.
    Starting Price: $2 per hour
  • 17
    Dimodelo

    Dimodelo

    Dimodelo

    Stay focused on delivering valuable and impressive reporting, analytics and insights, instead of being stuck in data warehouse code. Don’t let your data warehouse become a jumble of 100’s of hard-to-maintain pipelines, notebooks, stored procedures, tables. and views etc. Dimodelo DW Studio dramatically reduces the effort required to design, build, deploy and run a data warehouse. Design, generate and deploy a data warehouse targeting Azure Synapse Analytics. Generating a best practice architecture utilizing Azure Data Lake, Polybase and Azure Synapse Analytics, Dimodelo Data Warehouse Studio delivers a high-performance, modern data warehouse in the cloud. Utilizing parallel bulk loads and in-memory tables, Dimodelo Data Warehouse Studio generates a best practice architecture that delivers a high-performance, modern data warehouse in the cloud.
    Starting Price: $899 per month
  • 18
    Hyper Historian
    ICONICS’ Hyper Historian™ is an advanced 64-bit high-speed, reliable, and robust historian. Designed for the most mission-critical applications, Hyper Historian's advanced high compression algorithm delivers unparalleled performance with very efficient use of resources. Hyper Historian integrates with our ISA-95-compliant asset database and the latest big data technologies, including Azure SQL, Microsoft Data Lakes, Kafka, and Hadoop. This makes Hyper Historian the most efficient and secure real-time plant historian for any Microsoft operating system. Hyper Historian includes a module for automatic or manual insertion of data, empowering users to import historical or log data from databases, other historians, or intermittently connected field devices and equipment. This also provides for greatly increased reliability in capturing all data, even when network disruptions occur. Leverage rapid collection for enterprise-wide storage.
  • 19
    Azure Virtual Machines
    Migrate your business- and mission-critical workloads to Azure infrastructure and improve operational efficiency. Run SQL Server, SAP, Oracle® software and high-performance computing applications on Azure Virtual Machines. Choose your favorite Linux distribution or Windows Server. Deploy virtual machines featuring up to 416 vCPUs and 12 TB of memory. Get up to 3.7 million local storage IOPS per VM. Take advantage of up to 30 Gbps Ethernet and cloud’s first deployment of 200 Gbps InfiniBand. Select the underlying processors – AMD, Ampere (Arm-based), or Intel - that best meet your requirements. Encrypt sensitive data, protect VMs from malicious threats, secure network traffic, and meet regulatory and compliance requirements. Use Virtual Machine Scale Sets to build scalable applications. Reduce your cloud spend with Azure Spot Virtual Machines and reserved instances. Build your private cloud with Azure Dedicated Host. Run mission-critical applications in Azure to increase resiliency.
  • 20
    Azure Data Factory
    Integrate data silos with Azure Data Factory, a service built for all data integration needs and skill levels. Easily construct ETL and ELT processes code-free within the intuitive visual environment, or write your own code. Visually integrate data sources using more than 90+ natively built and maintenance-free connectors at no added cost. Focus on your data—the serverless integration service does the rest. Data Factory provides a data integration and transformation layer that works across your digital transformation initiatives. Data Factory can help independent software vendors (ISVs) enrich their SaaS apps with integrated hybrid data as to deliver data-driven user experiences. Pre-built connectors and integration at scale enable you to focus on your users while Data Factory takes care of the rest.
  • 21
    Apache Atlas

    Apache Atlas

    Apache Software Foundation

    Atlas is a scalable and extensible set of core foundational governance services – enabling enterprises to effectively and efficiently meet their compliance requirements within Hadoop and allows integration with the whole enterprise data ecosystem. Apache Atlas provides open metadata management and governance capabilities for organizations to build a catalog of their data assets, classify and govern these assets and provide collaboration capabilities around these data assets for data scientists, analysts and the data governance team. Pre-defined types for various Hadoop and non-Hadoop metadata. Ability to define new types for the metadata to be managed. Types can have primitive attributes, complex attributes, object references; can inherit from other types. Instances of types, called entities, capture metadata object details and their relationships. REST APIs to work with types and instances allow easier integration.
  • 22
    Privacera

    Privacera

    Privacera

    At the intersection of data governance, privacy, and security, Privacera’s unified data access governance platform maximizes the value of data by providing secure data access control and governance across hybrid- and multi-cloud environments. The hybrid platform centralizes access and natively enforces policies across multiple cloud services—AWS, Azure, Google Cloud, Databricks, Snowflake, Starburst and more—to democratize trusted data enterprise-wide without compromising compliance with regulations such as GDPR, CCPA, LGPD, or HIPAA. Trusted by Fortune 500 customers across finance, insurance, retail, healthcare, media, public and the federal sector, Privacera is the industry’s leading data access governance platform that delivers unmatched scalability, elasticity, and performance. Headquartered in Fremont, California, Privacera was founded in 2016 to manage cloud data privacy and security by the creators of Apache Ranger™ and Apache Atlas™.
  • 23
    Microsoft Power Query
    Power Query is the easiest way to connect, extract, transform and load data from a wide range of sources. Power Query is a data transformation and data preparation engine. Power Query comes with a graphical interface for getting data from sources and a Power Query Editor for applying transformations. Because the engine is available in many products and services, the destination where the data will be stored depends on where Power Query was used. Using Power Query, you can perform the extract, transform, and load (ETL) processing of data. Microsoft’s Data Connectivity and Data Preparation technology that lets you seamlessly access data stored in hundreds of sources and reshape it to fit your needs—all with an easy to use, engaging, no-code experience. Power Query supports hundreds of data sources with built-in connectors, generic interfaces (such as REST APIs, ODBC, OLE, DB and OData) and the Power Query SDK to build your own connectors.
  • 24
    Okera

    Okera

    Okera

    Okera, the Universal Data Authorization company, helps modern, data-driven enterprises accelerate innovation, minimize data security risks, and demonstrate regulatory compliance. The Okera Dynamic Access Platform automatically enforces universal fine-grained access control policies. This allows employees, customers, and partners to use data responsibly, while protecting them from inappropriately accessing data that is confidential, personally identifiable, or regulated. Okera’s robust audit capabilities and data usage intelligence deliver the real-time and historical information that data security, compliance, and data delivery teams need to respond quickly to incidents, optimize processes, and analyze the performance of enterprise data initiatives. Okera began development in 2016 and now dynamically authorizes access to hundreds of petabytes of sensitive data for the world’s most demanding F100 companies and regulatory agencies. The company is headquartered in San Francisco.
  • 25
    Apache Pinot

    Apache Pinot

    Apache Corporation

    Pinot is designed to answer OLAP queries with low latency on immutable data. Pluggable indexing technologies - Sorted Index, Bitmap Index, Inverted Index. Joins are currently not supported, but this problem can be overcome by using Trino or PrestoDB for querying. SQL like language that supports selection, aggregation, filtering, group by, order by, distinct queries on data. Consist of of both offline and real-time table. Use real-time table only to cover segments for which offline data may not be available yet. Detect the right anomalies by customizing anomaly detect flow and notification flow.
  • 26
    Apache Hudi

    Apache Hudi

    Apache Corporation

    Hudi is a rich platform to build streaming data lakes with incremental data pipelines on a self-managing database layer, while being optimized for lake engines and regular batch processing. Hudi maintains a timeline of all actions performed on the table at different instants of time that helps provide instantaneous views of the table, while also efficiently supporting retrieval of data in the order of arrival. A Hudi instant consists of the following components. Hudi provides efficient upserts, by mapping a given hoodie key consistently to a file id, via an indexing mechanism. This mapping between record key and file group/file id, never changes once the first version of a record has been written to a file. In short, the mapped file group contains all versions of a group of records.
  • 27
    Azure HDInsight
    Run popular open-source frameworks—including Apache Hadoop, Spark, Hive, Kafka, and more—using Azure HDInsight, a customizable, enterprise-grade service for open-source analytics. Effortlessly process massive amounts of data and get all the benefits of the broad open-source project ecosystem with the global scale of Azure. Easily migrate your big data workloads and processing to the cloud. Open-source projects and clusters are easy to spin up quickly without the need to install hardware or manage infrastructure. Big data clusters reduce costs through autoscaling and pricing tiers that allow you to pay for only what you use. Enterprise-grade security and industry-leading compliance with more than 30 certifications helps protect your data. Optimized components for open-source technologies such as Hadoop and Spark keep you up to date.
  • 28
    Azure Data Lake Storage
    Eliminate data silos with a single storage platform. Optimize costs with tiered storage and policy management. Authenticate data using Azure Active Directory (Azure AD) and role-based access control (RBAC). And help protect data with security features like encryption at rest and advanced threat protection. Highly secure with flexible mechanisms for protection across data access, encryption, and network-level control. Single storage platform for ingestion, processing, and visualization that supports the most common analytics frameworks. Cost optimization via independent scaling of storage and compute, lifecycle policy management, and object-level tiering. Meet any capacity requirements and manage data with ease, with the Azure global infrastructure. Run large-scale analytics queries at consistently high performance.
  • 29
    Azure Databricks
    Unlock insights from all your data and build artificial intelligence (AI) solutions with Azure Databricks, set up your Apache Spark™ environment in minutes, autoscale, and collaborate on shared projects in an interactive workspace. Azure Databricks supports Python, Scala, R, Java, and SQL, as well as data science frameworks and libraries including TensorFlow, PyTorch, and scikit-learn. Azure Databricks provides the latest versions of Apache Spark and allows you to seamlessly integrate with open source libraries. Spin up clusters and build quickly in a fully managed Apache Spark environment with the global scale and availability of Azure. Clusters are set up, configured, and fine-tuned to ensure reliability and performance without the need for monitoring. Take advantage of autoscaling and auto-termination to improve total cost of ownership (TCO).
  • 30
    Evvox

    Evvox

    Evvox

    Evvox is an end-to-end platform enabled for the insurance industry. Our North Star is to help you identify new channels of growth and increase your business. We make this possible through a combination of advanced analytics, AI/ML technology, intelligent routing, customer segmentation, customer lifetime value, cross-sell/up-sell, and much more all ready for your organization out of the box. Our data platforms’ biggest advantage is the ease with which new data can be not only ingested but integrated into the existing models and showcased in a manner relevant to you. We walk the walk. Our company has 25 years of ingestion experience. We’ve optimized and focused on it to speed up new data times 5x-10x. We bring the data models and the external data. You bring what you can and together we build the foundation for insights. Insights from your data are siphoned directly to your sales team. Equip your team to deliver the most value to your customers and sign new business.
  • Previous
  • You're on page 1
  • 2
  • Next