41 Integrations with Secoda

View a list of Secoda integrations and software that integrates with Secoda below. Compare the best Secoda integrations as well as features, ratings, user reviews, and pricing of software that integrates with Secoda. Here are the current Secoda integrations in 2024:

  • 1
    Google Cloud BigQuery
    BigQuery is a serverless, multicloud data warehouse that simplifies the process of working with all types of data so you can focus on getting valuable business insights quickly. At the core of Google’s data cloud, BigQuery allows you to simplify data integration, cost effectively and securely scale analytics, share rich data experiences with built-in business intelligence, and train and deploy ML models with a simple SQL interface, helping to make your organization’s operations more data-driven.
    Starting Price: $0.04 per slot hour
    View Software
    Visit Website
  • 2
    Looker

    Looker

    Google

    Looker, Google Cloud’s business intelligence platform, enables you to chat with your data. Organizations turn to Looker for self-service and governed BI, to build custom applications with trusted metrics, or to bring Looker modeling to their existing environment. The result is improved data engineering efficiency and true business transformation. Looker is reinventing business intelligence for the modern company. Looker works the way the web does: browser-based, its unique modeling language lets any employee leverage the work of your best data analysts. Operating 100% in-database, Looker capitalizes on the newest, fastest analytic databases—to get real results, in real time.
    Leader badge
    View Software
    Visit Website
  • 3
    Salesforce

    Salesforce

    Salesforce

    Salesforce helps sales teams accelerate their performance and achieve their goals. Salesforce is the world's most-used CRM, and so much more. With Salesforce, teams get access to tools that allow them to grow their accounts, find new customers, and close deals—faster and from anywhere. It offers a wealth of features that include contact management, opportunity management, lead management, email integration, reports and dashboards, sales forecasting, files sync and share, and so much more. Spend less time digging around in spreadsheets and more time running your business. No hardware, no software, no hassle. Our simple setup assistant will have you streamlining sales and answering customer questions in just minutes. Give buyers seamless, personalized experiences by connecting data across sales, service, and marketing. See a complete view of the customer — their account, activity history, and connections. Pull in social data for a deeper view of your customers.
    Leader badge
    Starting Price: $25.00/month/user
  • 4
    Slack

    Slack

    Slack

    Slack is a cloud-based project collaboration and team interaction software solution specially designed to seamlessly facilitate communication across organizations. Featuring powerful tools and services integrated into a single platform, Slack provides private channels to promote interaction within smaller teams, direct channels to help send messages directly to colleagues, and public channels that enables members across organizations to start conversations. Available on Mac, Windows, Android, and iOS apps, Slack offers a plethora of features that include chat, file sharing, collaborative workspace, real-time notifications, two-way audio and video, screen sharing, document imaging, activity tracking and logging, and more.
    Leader badge
    Starting Price: $6.67 per user per month
  • 5
    Jira

    Jira

    Atlassian

    Jira is the only project management tool you need to plan and track work across every team. Jira by Atlassian is the #1 software development tool for teams planning and building great products. Trusted by thousands of teams, Jira offers access to a wide range of tools for planning, tracking, and releasing world-class software, capturing and organizing issues, assigning work, and following team activity. It also integrates with leading developer tools for end-to-end traceability. From short projects, to large cross-functional programs, Jira helps break big ideas down into achievable steps. Organize work, create milestones, map dependencies and more. Link work to goals so everyone can see how their work contributes to company objectives and stay aligned to what’s important. Your next move, suggested by AI. Atlassian Intelligence takes your big ideas and automatically suggests the tasks to help get it done.
    Leader badge
    Starting Price: Free
  • 6
    MongoDB

    MongoDB

    MongoDB

    MongoDB is a general purpose, document-based, distributed database built for modern application developers and for the cloud era. No database is more productive to use. Ship and iterate 3–5x faster with our flexible document data model and a unified query interface for any use case. Whether it’s your first customer or 20 million users around the world, meet your performance SLAs in any environment. Easily ensure high availability, protect data integrity, and meet the security and compliance standards for your mission-critical workloads. An integrated suite of cloud database services that allow you to address a wide variety of use cases, from transactional to analytical, from search to data visualizations. Launch secure mobile apps with native, edge-to-cloud sync and automatic conflict resolution. Run MongoDB anywhere, from your laptop to your data center.
    Leader badge
    Starting Price: Free
  • 7
    Confluence

    Confluence

    Atlassian

    Confluence by Atlassian is a leading content collaboration software for modern teams. It empowers teams to create, share, and collaborate on projects in one place, easily publish and access company information, capture, store and grow team's knowledge, and so much more. Confluence is also available on mobile, enabling teams to track team activity, give feedback, and stay in sync regardless of device.
    Leader badge
    Starting Price: $10.00/month
  • 8
    Mixpanel

    Mixpanel

    Mixpanel

    At Mixpanel, our mission is to increase the rate of innovation. Not only as a company, but for the businesses we serve. Through our analytics and engagement product, companies can analyze how and why their users engage, convert, and retain in real-time across web, mobile, and smart devices. Then they can use that data to improve their business and products. Mixpanel serves over 26,000 companies from different industries around the world, including Samsung, Twitter, and BMW. Headquartered in San Francisco, Mixpanel has offices in New York, Seattle, Austin, London, Barcelona, Paris, and Singapore. Great products are built by teams who know their users. Go beneath the surface to learn which features are popular, who your power users are, and the behaviors tied to long-term retention. See which features are popular and how many power users you have.
    Leader badge
    Starting Price: $89 per month
  • 9
    Microsoft Power BI
    Power BI is a business intelligence platform that enables users to analyze data using AI-driven tools and intuitive report creation. It consolidates data from various sources into OneLake, creating a centralized data source. This platform aids in embedding actionable insights into applications like Microsoft 365, aiding decision-making. Power BI integrates with Microsoft Fabric, enhancing data management. It offers scalability to handle large data volumes and integrates seamlessly with Microsoft services. Its AI capabilities efficiently identify patterns and generate insights. Power BI ensures data security and compliance. Its Copilot feature allows rapid report generation. Additionally, Power BI Pro offers self-service analytics, and its free version includes data modeling and visualization tools. It's known for unified data management, empowering users with accessibility and training resources. Power BI has demonstrated a significant ROI and economic benefit, as evidenced in a Forres
    Leader badge
    Starting Price: $10 per user per month
  • 10
    Tableau

    Tableau

    Tableau

    Gain, generate, and analyze business data and meaningful insights with Tableau, an integrated business intelligence (BI) and analytics solution. With Tableau, users are able to collect data from different sources such as spreadsheets, SQL databases, Salesforce, and cloud apps. Tableau provides users with real-time visual analytics and interactive dashboard that enables them to slice and dice datasets for making relevant insights and look for new opportunities. Tableau also allows users to customize the platform to serve different kinds of industry verticals like banking, communication, and more.
  • 11
    MySQL

    MySQL

    Oracle

    MySQL is the world's most popular open source database. With its proven performance, reliability, and ease-of-use, MySQL has become the leading database choice for web-based applications, used by high profile web properties including Facebook, Twitter, YouTube, and all five of the top five websites*. Additionally, it is an extremely popular choice as embedded database, distributed by thousands of ISVs and OEMs.
  • 12
    Amazon S3
    Amazon Simple Storage Service (Amazon S3) is an object storage service that offers industry-leading scalability, data availability, security, and performance. This means customers of all sizes and industries can use it to store and protect any amount of data for a range of use cases, such as data lakes, websites, mobile applications, backup and restore, archive, enterprise applications, IoT devices, and big data analytics. Amazon S3 provides easy-to-use management features so you can organize your data and configure finely-tuned access controls to meet your specific business, organizational, and compliance requirements. Amazon S3 is designed for 99.999999999% (11 9's) of durability, and stores data for millions of applications for companies all around the world. Scale your storage resources up and down to meet fluctuating demands, without upfront investments or resource procurement cycles. Amazon S3 is designed for 99.999999999% (11 9’s) of data durability.
  • 13
    Snowflake

    Snowflake

    Snowflake

    Your cloud data platform. Secure and easy access to any data with infinite scalability. Get all the insights from all your data by all your users, with the instant and near-infinite performance, concurrency and scale your organization requires. Seamlessly share and consume shared data to collaborate across your organization, and beyond, to solve your toughest business problems in real time. Boost the productivity of your data professionals and shorten your time to value in order to deliver modern and integrated data solutions swiftly from anywhere in your organization. Whether you’re moving data into Snowflake or extracting insight out of Snowflake, our technology partners and system integrators will help you deploy Snowflake for your success.
    Starting Price: $40.00 per month
  • 14
    SQL Server

    SQL Server

    Microsoft

    Intelligence and security are built into Microsoft SQL Server 2019. You get extras without extra cost, along with best-in-class performance and flexibility for your on-premises needs. Take advantage of the efficiency and agility of the cloud by easily migrating to the cloud without changing code. Unlock insights and make predictions faster with Azure. Develop using the technology of your choice, including open source, backed by Microsoft's innovations. Easily integrate data into your apps and use a rich set of cognitive services to build human-like intelligence across any scale of data. AI is native to the data platform—you can unlock insights faster from all your data, on-premises and in the cloud. Combine your unique enterprise data and the world's data to build an intelligence-driven organization. Work with a flexible data platform that gives you a consistent experience across platforms and gets your innovations to market faster—you can build your apps and then deploy anywhere.
    Starting Price: $1 one-time payment
  • 15
    Google Data Studio
    Unlock the power of your data with interactive dashboards and beautiful reports that inspire smarter business decisions. It's easy and free. Easily access a wide variety of data. Data Studio’s built-in and partner connectors makes it possible to connect to virtually any kind of data. Turn your data into compelling stories of data visualization art. Quickly build interactive reports and dashboards with Data Studio’s web based reporting tools. Share your reports and dashboards with individuals, teams, or the world. Collaborate in real time. Embed your report on any web page.
  • 16
    Retool

    Retool

    Retool

    Retool is an application development platform that enables developers to combine the benefits of traditional software development with a drag-and-drop UI editor and AI to build internal tools radically faster. Building in Retool fits with how you develop software today—deploy it anywhere, connect to any internal service, import your libraries, debug with your toolchain, and share it securely to users—ensuring good and well-governed software by default. Retool is used by industry leaders such as Amazon, American Express, DoorDash, OpenAI, and Mercedes Benz for mission critical custom software across operations, billing, and customer support.
    Starting Price: $10 per user per month
  • 17
    Azure Synapse Analytics
    Azure Synapse is Azure SQL Data Warehouse evolved. Azure Synapse is a limitless analytics service that brings together enterprise data warehousing and Big Data analytics. It gives you the freedom to query data on your terms, using either serverless or provisioned resources—at scale. Azure Synapse brings these two worlds together with a unified experience to ingest, prepare, manage, and serve data for immediate BI and machine learning needs.
  • 18
    Stitch

    Stitch

    Talend

    Stitch is a cloud-based platform for ETL – extract, transform, and load. More than a thousand companies use Stitch to move billions of records every day from SaaS applications and databases into data warehouses and data lakes.
  • 19
    Amplitude

    Amplitude

    Amplitude

    Build products that drive outcomes. Amplitude is the product intelligence platform that helps teams convert, engage, and retain customers. Teams building digital products use Amplitude to better understand user behavior, ship improved experiences, and retain more customers. Gain a deeper understanding of how customers experience your digital products. Empower teams to ship faster, measure impact, and visualize user journeys. Personalize product experiences to drive engagement, conversion, and loyalty. Product intelligence gives teams the data and insights they need to build great product experiences—and do it at scale. Use self-serve analytics to understand what happens, why, and how to improve your product. Align on decisions and integrate with your existing workflows and tech stack to make changes, fast.
  • 20
    Apache Hive

    Apache Hive

    Apache Software Foundation

    The Apache Hive data warehouse software facilitates reading, writing, and managing large datasets residing in distributed storage using SQL. Structure can be projected onto data already in storage. A command line tool and JDBC driver are provided to connect users to Hive. Apache Hive is an open source project run by volunteers at the Apache Software Foundation. Previously it was a subproject of Apache® Hadoop®, but has now graduated to become a top-level project of its own. We encourage you to learn about the project and contribute your expertise. Traditional SQL queries must be implemented in the MapReduce Java API to execute SQL applications and queries over distributed data. Hive provides the necessary SQL abstraction to integrate SQL-like queries (HiveQL) into the underlying Java without the need to implement queries in the low-level Java API.
  • 21
    Amazon Redshift
    More customers pick Amazon Redshift than any other cloud data warehouse. Redshift powers analytical workloads for Fortune 500 companies, startups, and everything in between. Companies like Lyft have grown with Redshift from startups to multi-billion dollar enterprises. No other data warehouse makes it as easy to gain new insights from all your data. With Redshift you can query petabytes of structured and semi-structured data across your data warehouse, operational database, and your data lake using standard SQL. Redshift lets you easily save the results of your queries back to your S3 data lake using open formats like Apache Parquet to further analyze from other analytics services like Amazon EMR, Amazon Athena, and Amazon SageMaker. Redshift is the world’s fastest cloud data warehouse and gets faster every year. For performance intensive workloads you can use the new RA3 instances to get up to 3x the performance of any cloud data warehouse.
    Starting Price: $0.25 per hour
  • 22
    Redash

    Redash

    Redash

    Connect and query your data sources, build dashboards to visualize data and share them with your company. Enjoy the power and comfort of a SQL client with the collaborative advantages of a cloud based service. Easily visualize your results in various formats: chart, cohort, pivot table, boxplot, map, counter, sankey, sunburst and word cloud. Share your data-story with colleagues, other teams or external partners. Access Redash via API and extend its functionality as you like. SSO, access control and many other great features for enterprise-friendly workflow.
    Starting Price: $29 per month
  • 23
    Hightouch

    Hightouch

    Hightouch

    Your data warehouse is your source of truth for customer data. Hightouch syncs this data to the tools that your business teams rely on. Serve your sales, marketing, and customer success teams a 360˚ view of the customer in the tools they rely on. Eliminate tedious data requests. Hightouch operationalizes your data warehouse to turn analytics into action. Drive growth with better data. Run personalized campaigns across all email, push, and ads. Iterate without asking for engineering favors. Increase revenue with better data. Target leads using custom PQL and MQL models. Sync a single view of the customer to your CRM. Prevent churn with better data. Build a 360° customer view in your CS CRMs. Use customer data proactively identify “at risk” customers. Your data warehouse has all your data. Why stop at analytics? Hightouch operationalizes your data warehouse by empowering you to sync data to any SaaS tool with just SQL.
    Starting Price: $350 per month
  • 24
    dbt

    dbt

    dbt Labs

    Version control, quality assurance, documentation and modularity allow data teams to collaborate like software engineering teams. Analytics errors should be treated with the same level of urgency as bugs in a production product. Much of an analytic workflow is manual. We believe workflows should be built to execute with a single command. Data teams use dbt to codify business logic and make it accessible to the entire organization—for use in reporting, ML modeling, and operational workflows. Built-in CI/CD ensures that changes to data models move appropriately through development, staging, and production environments. dbt Cloud also provides guaranteed uptime and custom SLAs.
    Starting Price: $50 per user per month
  • 25
    Airbyte

    Airbyte

    Airbyte

    Get all your ELT data pipelines running in minutes, even your custom ones. Let your team focus on insights and innovation. Unify your data integration pipelines in one open-source ELT platform. Airbyte addresses all your data team's connector needs, however custom they are and whatever your scale. The data integration platform that can scale with your custom or high-volume needs. From high-volume databases to the long tail of API sources. Leverage Airbyte’s long tail of high-quality connectors that adapt to schema and API changes. Extensible to unify all native & custom ELT. Edit pre-built open-source connectors, or build new ones with our connector development kit in a few hours. Transparent and scalable pricing. Finally, a transparent and predictable cost-based pricing that scales with your data needs. You don’t need to worry about volume anymore. No more need for custom systems for your in-house scripts or database replication.
    Starting Price: $2.50 per credit
  • 26
    Metaplane

    Metaplane

    Metaplane

    Monitor your entire warehouse in 30 minutes. Identify downstream impact with automated warehouse-to-BI lineage. Trust takes seconds to lose and months to regain. Gain peace of mind with observability built for the modern data era. Code-based tests take hours to write and maintain, so it's hard to achieve the coverage you need. In Metaplane, you can add hundreds of tests within minutes. We support foundational tests (e.g. row counts, freshness, and schema drift), more complex tests (distribution drift, nullness shifts, enum changes), custom SQL, and everything in between. Manual thresholds take a long time to set and quickly go stale as your data changes. Our anomaly detection models learn from historical metadata to automatically detect outliers. Monitor what matters, all while accounting for seasonality, trends, and feedback from your team to minimize alert fatigue. Of course, you can override with manual thresholds, too.
    Starting Price: $825 per month
  • 27
    Mode

    Mode

    Mode Analytics

    Understand how users are interacting with your product and identify opportunity areas to inform your product decisions. Mode empowers one Stitch analyst to do the work of a full data team through speed, flexibility, and collaboration. Build dashboards for annual revenue, then use chart visualizations to identify anomalies quickly. Create polished, investor-ready reports or share analysis with teams for collaboration. Connect your entire tech stack to Mode and identify upstream issues to improve performance. Speed up workflows across teams with APIs and webhooks. Understand how users are interacting with your product and identify opportunity areas to inform your product decisions. Leverage marketing and product data to fix weak spots in your funnel, improve landing-page performance, and understand churn before it happens.
  • 28
    Databricks Data Intelligence Platform
    The Databricks Data Intelligence Platform allows your entire organization to use data and AI. It’s built on a lakehouse to provide an open, unified foundation for all data and governance, and is powered by a Data Intelligence Engine that understands the uniqueness of your data. The winners in every industry will be data and AI companies. From ETL to data warehousing to generative AI, Databricks helps you simplify and accelerate your data and AI goals. Databricks combines generative AI with the unification benefits of a lakehouse to power a Data Intelligence Engine that understands the unique semantics of your data. This allows the Databricks Platform to automatically optimize performance and manage infrastructure in ways unique to your business. The Data Intelligence Engine understands your organization’s language, so search and discovery of new data is as easy as asking a question like you would to a coworker.
  • 29
    Fivetran

    Fivetran

    Fivetran

    Fivetran is the smartest way to replicate data into your warehouse. We've built the only zero-maintenance pipeline, turning months of on-going development into a 5-minute setup. Our connectors bring data from applications and databases into one central location so that analysts can unlock profound insights about their business. Schema designs and ERDs make synced data immediately usable. Transform data into analytics-ready tables as soon as it’s loaded into your warehouse. Spend less time writing transformation code with our out-of-the-box data modeling. Connect to any git repository and manage dbt models directly from Fivetran. Develop and deliver your product with the utmost confidence in ours. Uptime and data delivery guarantees ensure your customers’ data never goes stale. Troubleshoot fast with a global team of Support Specialists.
  • 30
    Quicksight

    Quicksight

    Amazon Web Services

    Amazon QuickSight allows everyone in your organization to understand your data by asking questions in natural language, exploring through interactive dashboards, or automatically looking for patterns and outliers powered by machine learning. QuickSight powers millions of dashboard views weekly for customers such as the NFL, Expedia, Volvo, Thomson Reuters, Best Western and Comcast, allowing their end-users to make better data-driven decisions. Ask conversational questions of your data and use Q’s ML-powered engine to receive relevant visualizations without the time-consuming data preparation from authors and admins. Discover hidden insights from your data, perform accurate forecasting and what-if analysis, or add easy-to-understand natural language narratives to dashboards by leveraging AWS' expertise in machine learning. Easily embed interactive visualizations and dashboards, sophisticated dashboard authoring, or natural language query capabilities in your applications.
  • 31
    Sigma

    Sigma

    Sigma Computing

    Sigma is a modern business intelligence (BI) and analytics application built for the cloud. Trusted by data-first companies, Sigma provides live access to cloud data warehouses using an intuitive spreadsheet interface empowering business experts to ask more of their data without writing a single line of code. With the full power of SQL, the cloud, and a familiar interface, business users have the freedom to analyze data in real time without limits. Sigma is self-service analytics as it was meant to be.
  • 32
    AWS Glue

    AWS Glue

    Amazon

    AWS Glue is a serverless data integration service that makes it easy to discover, prepare, and combine data for analytics, machine learning, and application development. AWS Glue provides all the capabilities needed for data integration so that you can start analyzing your data and putting it to use in minutes instead of months. Data integration is the process of preparing and combining data for analytics, machine learning, and application development. It involves multiple tasks, such as discovering and extracting data from various sources; enriching, cleaning, normalizing, and combining data; and loading and organizing data in databases, data warehouses, and data lakes. These tasks are often handled by different types of users that each use different products. AWS Glue runs in a serverless environment. There is no infrastructure to manage, and AWS Glue provisions, configures, and scales the resources required to run your data integration jobs.
  • 33
    Census

    Census

    Census

    Census is the operational analytics platform that syncs your data warehouse with all your favorite apps. Get your customer success, sales & marketing teams on the same page by keeping customer data in sync. No engineering favors required. Census automatically publishes SQL & dbt models from your warehouse without a line of code. Stop deciphering external APIs and maintaining custom scripts, just focus on the business outcomes. You don't want "yet another source of truth" when everything is already in your warehouse. Census works on top of your existing infrastructure. Just pick a destination app, map the data, and voila, you are done. Your data shouldn't be stuck in quarterly reports. Census empowers everyone in your business to take action. Live metrics in every app means better business operations, which leads to happier users & more revenue.
  • 34
    Dataforma

    Dataforma

    Dataforma

    Dataforma is the most inclusive system for service management professionals, including roofers, HVAC technicians, and other remote contractors. Run a completely paperless service department so you can focus on: winning more bids, increasing productivity, working remotely on-site, improving client satisfaction. Dataforma was created by a roofing contractor to provide a user-friendly, robust software to the roofing industry without the price tag of a custom-built program. With our Dataforma CRM, you can revolutionize your service department from start to finish, use a single paperless system from the initial call all the way through invoicing and never lose track of a call or customer’s history again. As a busy contractor, you should never have to spend time searching through stacks of unfiled papers. With Dataforma, improve the organization of your operations by storing documents in an online database that’s simple to use and accessible from any location.
  • 35
    Metabase

    Metabase

    Metabase

    Meet the easy, open source way for everyone in your company to ask questions and learn from data. Connect to your data and get it in front of your team. Dashboards (like this one) are easy to build, share, and explore. Anyone on your team can get answers to questions about your data with just a few clicks, whether it's the CEO or Customer Support. When the questions get more complicated, SQL and our notebook editor are there for the data savvy. Visual joins, multiple aggregations and filtering steps give you the tools to dig deeper into your data. Add variables to your queries to create interactive visualizations that users can tweak and explore. Set up alerts and scheduled reports to get the right data in front of the right people at the right time. Start in a couple clicks with the hosted version, or use Docker to get up and running on your own for free. Connect to your existing data, invite your team, and you have a BI solution that would usually take a sales call.
  • 36
    Oracle Cloud Infrastructure
    Oracle Cloud Infrastructure supports traditional workloads and delivers modern cloud development tools. It is architected to detect and defend against modern threats, so you can innovate more. Combine low cost with high performance to lower your TCO. Oracle Cloud is a Generation 2 enterprise cloud that delivers powerful compute and networking performance and includes a comprehensive portfolio of infrastructure and platform cloud services. Built from the ground up to meet the needs of mission-critical applications, Oracle Cloud supports all legacy workloads while delivering modern cloud development tools, enabling enterprises to bring their past forward as they build their future. Our Generation 2 Cloud is the only one built to run Oracle Autonomous Database, the industry's first and only self-driving database. Oracle Cloud offers a comprehensive cloud computing portfolio, from application development and business analytics to data management, integration, security, AI & blockchain.
  • 37
    PostgreSQL

    PostgreSQL

    PostgreSQL Global Development Group

    PostgreSQL is a powerful, open-source object-relational database system with over 30 years of active development that has earned it a strong reputation for reliability, feature robustness, and performance. There is a wealth of information to be found describing how to install and use PostgreSQL through the official documentation. The open-source community provides many helpful places to become familiar with PostgreSQL, discover how it works, and find career opportunities. Learm more on how to engage with the community. The PostgreSQL Global Development Group has released an update to all supported versions of PostgreSQL, including 15.1, 14.6, 13.9, 12.13, 11.18, and 10.23. This release fixes 25 bugs reported over the last several months. This is the final release of PostgreSQL 10. PostgreSQL 10 will no longer receive security and bug fixes. If you are running PostgreSQL 10 in a production environment, we suggest that you make plans to upgrade.
  • 38
    Delta Lake

    Delta Lake

    Delta Lake

    Delta Lake is an open-source storage layer that brings ACID transactions to Apache Spark™ and big data workloads. Data lakes typically have multiple data pipelines reading and writing data concurrently, and data engineers have to go through a tedious process to ensure data integrity, due to the lack of transactions. Delta Lake brings ACID transactions to your data lakes. It provides serializability, the strongest level of isolation level. Learn more at Diving into Delta Lake: Unpacking the Transaction Log. In big data, even the metadata itself can be "big data". Delta Lake treats metadata just like data, leveraging Spark's distributed processing power to handle all its metadata. As a result, Delta Lake can handle petabyte-scale tables with billions of partitions and files at ease. Delta Lake provides snapshots of data enabling developers to access and revert to earlier versions of data for audits, rollbacks or to reproduce experiments.
  • 39
    Datafold

    Datafold

    Datafold

    Prevent data outages by identifying and fixing data quality issues before they get into production. Go from 0 to 100% test coverage of your data pipelines in a day. Know the impact of each code change with automatic regression testing across billions of rows. Automate change management, improve data literacy, achieve compliance, and reduce incident response time. Don’t let data incidents take you by surprise. Be the first one to know with automated anomaly detection. Datafold’s easily adjustable ML model adapts to seasonality and trend patterns in your data to construct dynamic thresholds. Save hours spent on trying to understand data. Use the Data Catalog to find relevant datasets, fields, and explore distributions easily with an intuitive UI. Get interactive full-text search, data profiling, and consolidation of metadata in one place.
  • 40
    Great Expectations

    Great Expectations

    Great Expectations

    Great Expectations is a shared, open standard for data quality. It helps data teams eliminate pipeline debt, through data testing, documentation, and profiling. We recommend deploying within a virtual environment. If you’re not familiar with pip, virtual environments, notebooks, or git, you may want to check out the Supporting. There are many amazing companies using great expectations these days. Check out some of our case studies with companies that we've worked closely with to understand how they are using great expectations in their data stack. Great expectations cloud is a fully managed SaaS offering. We're taking on new private alpha members for great expectations cloud, a fully managed SaaS offering. Alpha members get first access to new features and input to the roadmap.
  • 41
    Apache Airflow

    Apache Airflow

    The Apache Software Foundation

    Airflow is a platform created by the community to programmatically author, schedule and monitor workflows. Airflow has a modular architecture and uses a message queue to orchestrate an arbitrary number of workers. Airflow is ready to scale to infinity. Airflow pipelines are defined in Python, allowing for dynamic pipeline generation. This allows for writing code that instantiates pipelines dynamically. Easily define your own operators and extend libraries to fit the level of abstraction that suits your environment. Airflow pipelines are lean and explicit. Parametrization is built into its core using the powerful Jinja templating engine. No more command-line or XML black-magic! Use standard Python features to create your workflows, including date time formats for scheduling and loops to dynamically generate tasks. This allows you to maintain full flexibility when building your workflows.
  • Previous
  • You're on page 1
  • Next