Best Data Management Software for GitHub - Page 4

Compare the Top Data Management Software that integrates with GitHub as of October 2025 - Page 4

This a list of Data Management software that integrates with GitHub. Use the filters on the left to add additional filters for products that have integrations with GitHub. View the products that work with GitHub in the table below.

  • 1
    PouchDB

    PouchDB

    PouchDB

    ​PouchDB is an open source JavaScript database inspired by Apache CouchDB, designed to run efficiently within the browser. It enables applications to store data locally while offline and synchronize it with CouchDB and compatible servers when back online, ensuring user data remains in sync across sessions. PouchDB supports cross-browser functionality, is lightweight, requires just a script tag and 46KB (gzipped) in the browser, and can be installed via npm. It is easy to learn, requiring some programming knowledge, and is fully open source, with development conducted openly on GitHub. PouchDB allows developers to build applications that function seamlessly offline and online, providing a consistent user experience regardless of network connectivity. It offers a simple API for creating, reading, updating, and deleting documents.
    Starting Price: Free
  • 2
    Dataform

    Dataform

    Google

    Dataform enables data analysts and data engineers to develop and operationalize scalable data transformation pipelines in BigQuery using only SQL from a single, unified environment. Its open source core language lets teams define table schemas, configure dependencies, add column descriptions, and set up data quality assertions within a shared code repository while applying software development best practices, version control, environments, testing, and documentation. A fully managed, serverless orchestration layer automatically handles workflow dependencies, tracks lineage, and executes SQL pipelines on demand or via schedules in Cloud Composer, Workflows, BigQuery Studio, or third-party services. In the browser-based development interface, users get real-time error feedback, visualize dependency graphs, connect to GitHub or GitLab for commits and code reviews, and launch production-grade pipelines in minutes without leaving BigQuery Studio.
    Starting Price: Free
  • 3
    Adverity

    Adverity

    Adverity GmbH

    Adverity is the fully-integrated data platform for automating the connectivity, transformation, governance and utilization of data at scale. The platform enables businesses to blend disparate datasets such as sales, finance, marketing, and advertising, to create a single source of truth over business performance. Through automated connectivity to hundreds of data sources and destinations, unrivaled data transformation options, and powerful data governance features, Adverity is the easiest way to get your data how you want it, where you want it, and when you need it. Adverity was founded in 2015 and is headquartered in Vienna with offices in London and New York, and currently works with leading brands and agencies including Unilever, Bosch, IKEA, Forbes, GroupM, Publicis, and Dentsu.
  • 4
    Xplenty

    Xplenty

    Xplenty Data Integration

    Xplenty, a scalable data integration and delivery software, allows SMBs and large enterprises to prepare and transfer data for analytics to the cloud. Xplenty features include data transformations, drag-and-drop interface, and integration with over 100 data stores and SaaS applications. Xplenty can be added by developers to their data solution stack with ease. Xplenty also allows users to schedule jobs and monitor job progress and status.
  • 5
    DBxtra

    DBxtra

    Advisionario, S.A. de C.V.

    DBxtra is a cloud-based business intelligence, reporting and dashboard solution that lets businesses create and distribute interactive web-reports and dashboards. With DBxtra, businesses need not have any knowledge of SQL, programming or web technologies to design, deploy, and manage ad-hoc reports. The platform has all your reporting needs covered. DBxtra can be taken on a test drive for 30 days.
  • 6
    Alooma

    Alooma

    Google

    Alooma enables data teams to have visibility and control. It brings data from your various data silos together into BigQuery, all in real time. Set up and flow data in minutes or customize, enrich, and transform data on the stream before it even hits the data warehouse. Never lose an event. Alooma's built in safety nets ensure easy error handling without pausing your pipeline. Any number of data sources, from low to high volume, Alooma’s infrastructure scales to your needs.
  • 7
    Enov8

    Enov8

    Enov8

    End-to-end “Business Intelligence” for your IT organization. Promoting transparency, control, and productivity across environments, release and data. Promote scaled agility across your IT fabric. A complete environment and release picture supporting collaboration across teams and providing the insight that organizations require today to drive competitive innovation. Improve visibility of your complex IT fabric allowing better collaboration and decision making. Manage complex computer systems & the end-to-end IT fabric through a centralized portal. Measure test environment usage to reduce IT spend and increase project productivity. Eliminate chaotic and non-repeatable operations by establishing control via centralized runbooks and using automation on regular & time consuming tasks. Manage change and contention effectively whilst providing real time health status and powerful analytics to determine business impact.
    Starting Price: $8 per month
  • 8
    Fivetran

    Fivetran

    Fivetran

    Fivetran is a leading data integration platform that centralizes an organization’s data from various sources to enable modern data infrastructure and drive innovation. It offers over 700 fully managed connectors to move data automatically, reliably, and securely from SaaS applications, databases, ERPs, and files to data warehouses and lakes. The platform supports real-time data syncs and scalable pipelines that fit evolving business needs. Trusted by global enterprises like Dropbox, JetBlue, and Pfizer, Fivetran helps accelerate analytics, AI workflows, and cloud migrations. It features robust security certifications including SOC 1 & 2, GDPR, HIPAA, and ISO 27001. Fivetran provides an easy-to-use, customizable platform that reduces engineering time and enables faster insights.
  • 9
    Coralogix

    Coralogix

    Coralogix

    Coralogix is the leading stateful streaming platform providing modern engineering teams with real-time insights and long-term trend analysis with no reliance on storage or indexing. Ingest data from any source for a centralized platform to manage, monitor, and alert on your applications. As data is ingested, Coralogix instantly narrows millions of events down to common patterns for deeper insights and faster troubleshooting. Machine learning algorithms continuously observe data patterns and flows between system components and trigger dynamic alerts so you know when a pattern deviates from the norm without static thresholds or the need for pre-configurations. Connect any data, in any format, and view your insights anywhere including our purpose-built UI, Kibana, Grafana, SQL clients, Tableau, or using our CLI and full API support. Coralogix has successfully completed relevant security and privacy compliances by BDO including GDPR, SOC 2, PCI, HIPAA, and ISO 27001/27701.
  • 10
    Zyte

    Zyte

    Zyte

    Hi, we’re Zyte (formerly Scrapinghub)! We are the leader in web data extraction technology and services. We’re obsessed with data. And what it can do for businesses. We help thousands of companies and millions of developers to get their hands on clean, accurate data. Quickly, reliably and at scale. Every day, for more than a decade. From price intelligence, news and media, job listings and entertainment trends, brand monitoring, and more, our customers rely on us to obtain dependable data from over 13 billion web pages each month. We led the way with open source projects like Scrapy, products like our Smart Proxy Manager (formerly Crawlera), and our end-to-end data extraction services. Our fully remote team of nearly two hundred developers and extraction experts set out to remove the barriers to data and change the game.
  • 11
    Qualibrate

    Qualibrate

    Qualibrate

    Qualibrate is the cloud solution for SAP & web apps test automation, like Salesforce: it has the power of simplicity, customization, and integration with the most CI/CD tools. Test cases are highly reusable and easily maintainable. Undertaking a software transformation journey is a high risk. We offer a simple yet powerful solution to minimize the risk and reduce the implementation resources up to 80%. All you need to do is to record a Business Process: user actions, test data, and technical information will be captured. The recording will be your unique source of truth for running Automated tests and Manual tests, but also for Learning. Check out the website to see how Qualibrate is reinventing test automation for SAP and web apps.
  • 12
    Stackby

    Stackby

    Stackby

    No more messy spreadsheets when you have easy databases. With the simplicity of spreadsheets, the functionality of databases and integrations with best business APIs, Stackby gives you the flexibility to build your own work tools your way. No coding or technical skills are required. It's as easy as a spreadsheet, but you can mold it into a powerful relational database that your teams will love to use. Import excel or google Sheets or create your own template, in a few minutes. No coding or training is needed. With over 25+ unique column data types, 6 data views, powerups, an apps marketplace & ever-increasing capabilities, you can build custom no-code solutions for your teams to move work forward faster. Discover useful powerups, build custom reporting dashboards and extend your workflows with powerful building blocks for your team need to reach new milestones.
    Starting Price: $5 / per user per month
  • 13
    Backtrace

    Backtrace

    Backtrace

    Don’t let app, device, or game crashes get in the way of a great experience. Backtrace takes all the manual labor out of cross-platform crash and exception management so you can focus on shipping. Cross-platform callstack and event aggregation and monitoring. Process errors from panics, core dumps, minidumps, and during runtime across your stack with a single system. Backtrace generates structured, searchable error reports from your data. Automated analysis cuts down on time to resolution by surfacing important signals that lead engineers to crash root cause. Never worry about missing a clue with rich integrations into dashboards, notification, and workflow systems. Answer the questions that matter to you with Backtrace’s rich query engine. View a high-level overview of error frequency, prioritization, and trends across all your projects. Search through key data points and your own custom data across all your errors.
  • 14
    Tengu

    Tengu

    Tengu

    TENGU is a DataOps Orchestration Platform that works as a central workspace for data profiles of all levels. It provides data integration, extraction, transformation, loading all within it’s graph view UI in which you can intuitively monitor your data environment. By using the platform, business, analytics & data teams need fewer meetings and service tickets to collect data, and can start right away with the data relevant to furthering the company. The Platform offers a unique graph view in which every element is automatically generated with all available info based on metadata. While allowing you to perform all necessary actions from the same workspace. Enhance collaboration and efficiency, with the ability to quickly add and share comments, documentation, tags, groups. The platform enables anyone to get straight to the data with self-service. Thanks to the many automations and low to no-code functionalities and built-in assistant.
  • 15
    Logstash

    Logstash

    Elasticsearch

    Centralize, transform & stash your data. Logstash is a free and open server-side data processing pipeline that ingests data from a multitude of sources, transforms it, and then sends it to your favorite "stash." Logstash dynamically ingests, transforms, and ships your data regardless of format or complexity. Derive structure from unstructured data with grok, decipher geo coordinates from IP addresses, anonymize or exclude sensitive fields, and ease overall processing. Data is often scattered or siloed across many systems in many formats. Logstash supports a variety of inputs that pull in events from a multitude of common sources, all at the same time. Easily ingest from your logs, metrics, web applications, data stores, and various AWS services, all in continuous, streaming fashion. Download: https://sourceforge.net/projects/logstash.mirror/
  • 16
    Lenses

    Lenses

    Lenses.io

    Enable everyone to discover and observe streaming data. Sharing, documenting and cataloging your data can increase productivity by up to 95%. Then from data, build apps for production use cases. Apply a data-centric security model to cover all the gaps of open source technology, and address data privacy. Provide secure and low-code data pipeline capabilities. Eliminate all darkness and offer unparalleled observability in data and apps. Unify your data mesh and data technologies and be confident with open source in production. Lenses is the highest rated product for real-time stream analytics according to independent third party reviews. With feedback from our community and thousands of engineering hours invested, we've built features that ensure you can focus on what drives value from your real time data. Deploy and run SQL-based real time applications over any Kafka Connect or Kubernetes infrastructure including AWS EKS.
    Starting Price: $49 per month
  • 17
    Y42

    Y42

    Datos-Intelligence GmbH

    Y42 is the first fully managed Modern DataOps Cloud. It is purpose-built to help companies easily design production-ready data pipelines on top of their Google BigQuery or Snowflake cloud data warehouse. Y42 provides native integration of best-of-breed open-source data tools, comprehensive data governance, and better collaboration for data teams. With Y42, organizations enjoy increased accessibility to data and can make data-driven decisions quickly and efficiently.
  • 18
    Iterative

    Iterative

    Iterative

    AI teams face challenges that require new technologies. We build these technologies. Existing data warehouses and data lakes do not fit unstructured datasets like text, images, and videos. AI hand in hand with software development. Built with data scientists, ML engineers, and data engineers in mind. Don’t reinvent the wheel! Fast and cost‑efficient path to production. Your data is always stored by you. Your models are trained on your machines. Existing data warehouses and data lakes do not fit unstructured datasets like text, images, and videos. AI teams face challenges that require new technologies. We build these technologies. Studio is an extension of GitHub, GitLab or BitBucket. Sign up for the online SaaS version or contact us to get on-premise installation
  • 19
    Mozart Data

    Mozart Data

    Mozart Data

    Mozart Data is the all-in-one modern data platform that makes it easy to consolidate, organize, and analyze data. Start making data-driven decisions by setting up a modern data stack in an hour - no engineering required.
  • 20
    IRI Voracity

    IRI Voracity

    IRI, The CoSort Company

    Voracity is the only high-performance, all-in-one data management platform accelerating AND consolidating the key activities of data discovery, integration, migration, governance, and analytics. Voracity helps you control your data in every stage of the lifecycle, and extract maximum value from it. Only in Voracity can you: 1) CLASSIFY, profile and diagram enterprise data sources 2) Speed or LEAVE legacy sort and ETL tools 3) MIGRATE data to modernize and WRANGLE data to analyze 4) FIND PII everywhere and consistently MASK it for referential integrity 5) Score re-ID risk and ANONYMIZE quasi-identifiers 6) Create and manage DB subsets or intelligently synthesize TEST data 7) Package, protect and provision BIG data 8) Validate, scrub, enrich and unify data to improve its QUALITY 9) Manage metadata and MASTER data. Use Voracity to comply with data privacy laws, de-muck and govern the data lake, improve the reliability of your analytics, and create safe, smart test data
  • 21
    Supabase

    Supabase

    Supabase

    Create a backend in less than 2 minutes. Start your project with a Postgres database, authentication, instant APIs, real-time subscriptions and storage. Build faster and focus on your products. Every project is a full Postgres database, the world's most trusted relational database. Add user sign-ups and logins, securing your data with Row Level Security. Store, organize and serve large files. Any media, including videos and images. Write custom code and cron jobs without deploying or scaling servers. There are many example apps and starter projects to get going. We introspect your database to provide APIs instantly. Stop building repetitive CRUD endpoints and focus on your product. Type definitions built directly from your database schema. Use Supabase in the browser without a build process. Develop locally and push to production when you're ready. Manage Supabase projects from your local machine.
    Starting Price: $25 per month
  • 22
    SAS Viya
    SAS® Viya® data science offerings provide a comprehensive, scalable analytics environment that's quick and easy to deploy, enabling you to meet diverse business needs. Automatically generated insights enable you to identify the most common variables across all models, the most important variables selected across models and assessment results for all models. Natural language generation capabilities are used to create project summaries written in plain language, enabling you to easily interpret reports. Analytics team members can add project notes to the insights report to facilitate communication and collaboration among team members. SAS lets you embed open source code within an analysis and call open source algorithms seamlessly within its environment. This facilitates collaboration across your organization because users can program in their language of choice. You can also take advantage of SAS Deep Learning with Python (DLPy), our open-source package on GitHub.
  • 23
    Hex

    Hex

    Hex

    Hex brings together the best of notebooks, BI, and docs into a seamless, collaborative UI. Hex is a modern Data Workspace. It makes it easy to connect to data, analyze it in collaborative SQL and Python-powered notebooks, and share work as interactive data apps and stories. Your default landing page in Hex is the Projects page. You can quickly find projects you created, as well as those shared with you and your workspace. The outline provides an easy-to-browse overview of all the cells in a project's Logic View. Every cell in the outline lists the variables it defines, and cells that return a displayed output (chart cells, Input Parameters, markdown cells, etc.) display a preview of that output. You can click any cell in the outline to automatically jump to that position in the logic.
    Starting Price: $24 per user per month
  • 24
    Gravity Data
    Gravity's mission is to make streaming data easy from over 100 sources while only paying for what you use. Gravity removes the reliance on engineering teams to deliver streaming pipelines with a simple interface to get streaming up and running in minutes from databases, event data and APIs. Everyone in the data team can now build with simple point and click so that you can focus on building apps, services and customer experiences. Full Execution trace and detailed error messaging for quick diagnosis and resolution. We have implemented new, feature-rich ways for you to quickly get started. From bulk set-up, default schemas and data selection to different job modes and statuses. Spend less time wrangling with infrastructure and more time analysing data while allowing our intelligent engine to keep your pipelines running. Gravity integrates with your systems for notifications and orchestration.
  • 25
    Progress DataDirect

    Progress DataDirect

    Progress Software

    Empowering applications with enterprise data is our passion here at Progress DataDirect. We offer cloud and on-premises data connectivity solutions across relational, NoSQL, Big Data, and SaaS data sources. Performance, reliability, and security are at the heart of everything we design for thousands of enterprises and the leading vendors in analytics, BI, and data management. Minimize your development costs with our portfolio of high-value connectors for a variety of data sources. Enjoy 24/7 world-class support and security for greater peace of mind. Connect with affordable, easy-to-use, and time-saving drivers for faster SQL access to your data. As a leader in data connectivity, keeping up with the evolving trends in space is our mission. But if we haven’t built the connector you need yet, reach out and we’ll help you develop the right solution. Embed connectivity in an application or service.
  • 26
    Nextflow

    Nextflow

    Seqera Labs

    Data-driven computational pipelines. Nextflow enables scalable and reproducible scientific workflows using software containers. It allows the adaptation of pipelines written in the most common scripting languages. Its fluent DSL simplifies the implementation and deployment of complex parallel and reactive workflows on clouds and clusters. Nextflow is built around the idea that Linux is the lingua franca of data science. Nextflow allows you to write a computational pipeline by making it simpler to put together many different tasks. You may reuse your existing scripts and tools and you don't need to learn a new language or API to start using it. Nextflow supports Docker and Singularity containers technology. This, along with the integration of the GitHub code-sharing platform, allows you to write self-contained pipelines, manage versions, and rapidly reproduce any former configuration. Nextflow provides an abstraction layer between your pipeline's logic and the execution layer.
    Starting Price: Free
  • 27
    CorralData

    CorralData

    CorralData

    CorralData is a full-service, AI-powered platform that radically simplifies unifying and analyzing data across multiple sources. With CorralData, you can access human data experts and industry-leading AI tools to easily and cost-effectively drive remarkable, data-fueled results. No coding, SQL, or engineering resources required. Here’s how CorralData works. First, unify all of your data sources using our 450+ pre-built data connectors. Our U.S.-based support team will guide you along the way. Then, begin instantly and securely unlocking powerful data insights using natural language AI questions. Or work directly with a human data expert to surface the insight you need. The CorralData platform includes everything you need to start using data to drive results in one simple, monthly subscription—stunning data visualization tools, built-in chat, custom reporting, and access to human experts. Schedule a free demo today and see how simple and easy AI-powered data can be.
    Starting Price: $200/month
  • 28
    Bytebase

    Bytebase

    Bytebase

    Bytebase offers a web-based collaboration workspace to help DBAs and developers manage the database development lifecycle. Standardize database changes with peer review, lint rules, and GitOps integration. Run ad-hoc SQL queries with built-in data privacy protection via web-based IDE. Eliminate administrative complexity and ensure compliance with holistic policy control. Bytebase streamlines database deployment from non-prod to prod with change review workflow and optional GitOps integration. 100+ lint rules to detect SQL anti-patterns and enforce consistent SQL style in the organization. Batch change to multiple databases across different environments and tenants in a single ticket. Reuse developer workflow and manage change scripts in VCS. Lockless schema change to reduce database downtime from hours to seconds. Bytebase employs a security-first approach to protect your data with access control, data anonymization, watermarking, audit log, and more.
    Starting Price: $100 per month
  • 29
    Chalk

    Chalk

    Chalk

    Powerful data engineering workflows, without the infrastructure headaches. Complex streaming, scheduling, and data backfill pipelines, are all defined in simple, composable Python. Make ETL a thing of the past, fetch all of your data in real-time, no matter how complex. Incorporate deep learning and LLMs into decisions alongside structured business data. Make better predictions with fresher data, don’t pay vendors to pre-fetch data you don’t use, and query data just in time for online predictions. Experiment in Jupyter, then deploy to production. Prevent train-serve skew and create new data workflows in milliseconds. Instantly monitor all of your data workflows in real-time; track usage, and data quality effortlessly. Know everything you computed and data replay anything. Integrate with the tools you already use and deploy to your own infrastructure. Decide and enforce withdrawal limits with custom hold times.
    Starting Price: Free
  • 30
    Zerve AI

    Zerve AI

    Zerve AI

    Merging the best of a notebook and an IDE into one integrated coding environment, experts can explore their data and write stable code at the same time with fully automated cloud infrastructure. Zerve’s data science development environment gives data science and ML teams a unified space to explore, collaborate, build, and deploy data science & AI projects like never before. Zerve offers true language interoperability, meaning that as well as being able to use Python, R, SQL, or Markdown all in the same canvas, users can connect these code blocks to each other. No more long-running code blocks or containers, with Zerve enjoying unlimited parallelization at any stage of the development journey. Analysis artifacts are automatically serialized, versioned, stored, and preserved for later use, meaning easily changing a step in the data flow without needing to rerun any preceding steps. Fine-grained selection of compute resources and extra memory for complex data transformation.