Best Data Management Software for GitHub - Page 5

Compare the Top Data Management Software that integrates with GitHub as of October 2025 - Page 5

This a list of Data Management software that integrates with GitHub. Use the filters on the left to add additional filters for products that have integrations with GitHub. View the products that work with GitHub in the table below.

  • 1
    Breadcrumb.ai

    Breadcrumb.ai

    Breadcrumb.ai

    Get facts in real-time with AI generative dashboards. Offload the work of combining multiple data sources, modeling, and calculating to Breadcrumb’s 100% accurate AI. Master the narrative with AI-assisted data presentation. Transform insights into slides and reports that you can use on a Zoom call, customized to your brand and audience. Traditional BI dashboards are of the past. Go from servicing one client with an interactive dashboard to serving all of your clients in the same amount of time. Breadcrumb empowers your audience to go from big picture to detail with a click of a button. Breadcrumb is a web application accessible in any browser. Upload spreadsheets and connect applications that have your data and our AI will analyze and recommend insights. You can then explore more insights by speaking to your data in plain language or generating shareable reports and dashboards.
    Starting Price: $8 per month
  • 2
    Foundational

    Foundational

    Foundational

    Identify code and optimization issues in real-time, prevent data incidents pre-deploy, and govern data-impacting code changes end to end—from the operational database to the user-facing dashboard. Automated, column-level data lineage, from the operational database all the way to the reporting layer, ensures every dependency is analyzed. Foundational automates data contract enforcement by analyzing every repository from upstream to downstream, directly from source code. Use Foundational to proactively identify code and data issues, find and prevent issues, and create controls and guardrails. Foundational can be set up in minutes with no code changes required.
  • 3
    Abstract Security

    Abstract Security

    Abstract Security

    Put your team’s focus back on catching attackers and let Abstract handle the heavy lifting of security data management. Our real-time streaming approach gives the breathing room to prioritize their security effectiveness instead. No Noise – Remove unnecessary noise from your data in flight before routing it to your destination No lock-in – With our real-time normalization of data to OCSF format, route to any destination without worrying No Hassle – No need to learn complex query languages with our easy to use ‘no-code-required' model for policy creation. Additionally, let our AI SME help build your policies via natural language requests. No Alert Fatigue – Our AI SME can help summarize insights and prioritize alerts based on MITRE ATT&CK Framework.
  • 4
    Citus

    Citus

    Citus Data

    Citus gives you the Postgres you love, plus the superpower of distributed tables. 100% open source. Now with schema-based and row-based sharding, plus Postgres 16 support. Scale Postgres by distributing data & queries. You can start with a single Citus node, then add nodes & rebalance shards when you need to grow. Speed up queries by 20x to 300x (or more) through parallelism, keeping more data in memory, higher I/O bandwidth, and columnar compression. Citus is an extension (not a fork) to the latest Postgres versions, so you can use your familiar SQL toolset & leverage your Postgres expertise. Reduce your infrastructure headaches by using a single database for both your transactional and analytical workloads. Download and use Citus open source for free. You can manage Citus yourself, embrace open source, and help us improve Citus via GitHub. Focus on your application & forget about your database. Run your app on Citus in the cloud with Azure Cosmos DB for PostgreSQL.
    Starting Price: $0.27 per hour
  • 5
    TapData

    TapData

    TapData

    CDC-based live data platform for heterogeneous database replication, real-time data integration, or building a real-time data warehouse. By using CDC to sync production line data stored in DB2 and Oracle to the modern database, TapData enabled an AI-augmented real-time dispatch software to optimize the semiconductor production line process. The real-time data made instant decision-making in the RTD software a possibility, leading to faster turnaround times and improved yield. As one of the largest telcos, customer has many regional systems that cater to the local customers. By syncing and aggregating data from various sources and locations into a centralized data store, customers were able to build an order center where the collective orders from many applications can now be aggregated. TapData seamlessly integrates inventory data from 500+ stores, providing real-time insights into stock levels and customer preferences, enhancing supply chain efficiency.
  • 6
    ProxySQL

    ProxySQL

    ProxySQL

    ProxySQL is built with an advanced multi-core architecture to support hundreds of thousands of concurrent connections, multiplexed to thousands of servers. ProxySQL supports sharding by user, schema or table by means of the advanced query rule engine or through customized plugins. The development team no longer needs to rewrite queries generated by ORMs or packaged software, ProxySQL's query rewriting feature can modify SQL statements on the fly. Battle-tested doesn't even begin to cover it — ProxySQL is war-tested. Performance is the priority and the numbers prove it. ProxySQL is an open source high performance, high availability, database protocol aware proxy for MySQL and PostgreSQL. ProxySQL is a robust SQL proxy solution that acts as a pivotal bridge between database clients and servers, offering a plethora of features designed to streamline database operations. ProxySQL empowers organizations to harness the full potential of their database infrastructure.
  • 7
    Olive

    Olive

    Olive

    Olive is an AI-powered platform that lets teams build full-stack internal tools and dashboards in minutes simply by describing what they need in natural language. It connects securely to your databases (PostgreSQL, MySQL, MongoDB, etc.) and third-party services (CRMs, analytics platforms, REST APIs), examines your schema, writes the necessary queries and application code, and deploys a polished, responsive web interface complete with data listing, filtering, editing and visualization components. Users can generate admin panels, CRM modules, support tools, inventory management systems, or any custom workflow without manual coding. Olive supports collaboration through organizational workspaces, role-based access controls, and single sign-on, while its progressive-web-app design delivers mobile-friendly experiences and offline access. An extensible API and prompt-engineering guidance allow advanced customization and integration into existing CI/CD pipelines.
  • 8
    Tray.ai

    Tray.ai

    Tray.ai

    Tray.ai is an API integration platform that allows users to innovate, integrate, and automate organization with no developer resources needed. Tray.io enables users to connect their entire cloud stack on their own. With Tray.ai, users can easily build and streamline processes with a specifically designed visual workflow editor. Tray.io also empowers the users' workforce with automated processes. The intelligence powering the first iPaaS that everyone can use to complete business processes using natural language instructions. Tray.ai is a low-code automation platform designed for both non-technical and technical users to create sophisticated workflow automations that facilitate efficient data movement and actions across multiple applications. Our low-code builder and new Merlin AI transform the automation process by bringing together the power of flexible, scalable automation; support for advanced business logic; and native generative AI capabilities that anyone can use.
  • 9
    Onna

    Onna

    Reveal

    Connect and search across an ever-growing list of cloud platforms with Onna, a real-time search solution. Onna assists users in accessing eDiscovery and finding high-value items across legal departments. Onna provides users with reporting, document sharing, collaborating, compliance managing, and more. Onna also integrates well with different data sources like Gmail, DropBox, and Confluence.
  • 10
    Astro by Astronomer
    For data teams looking to increase the availability of trusted data, Astronomer provides Astro, a modern data orchestration platform, powered by Apache Airflow, that enables the entire data team to build, run, and observe data pipelines-as-code. Astronomer is the commercial developer of Airflow, the de facto standard for expressing data flows as code, used by hundreds of thousands of teams across the world.
  • 11
    Seerene

    Seerene

    Seerene

    Seerene’s Digital Engineering Platform is a software analytics and process mining technology that analyzes and visualizes the software development processes in your company. It reveals weaknesses and turns your organization into a well-oiled machine, delivering software efficiently, cost-effectively, quickly, and with the highest quality. Seerene provides decision-makers with the information needed to actively drive their organization towards 360° software excellence. Reveal code that frequently contains defects and kills developer productivity.​ Reveal lighthouse teams and transfer their best-practice processes across the entire workforce.​ Reveal defect risks in release candidates with a holistic X-ray of code, development hotspots and tests. Reveal features with a mismatch between invested developer time und created user value.​ Reveal code that is never executed by end-users and produces unnecessary maintenance costs.​
  • 12
    AutonomIQ

    AutonomIQ

    AutonomIQ

    Our AI-driven, autonomous low-code automation platform is designed to help you achieve the highest quality outcome in the shortest amount of time possible. Generate automation scripts automatically in plain English with our Natural Language Processing (NLP) powered solution, and allow your coders to focus on innovation. Maintain quality throughout your application lifecycle with our autonomous discovery and up-to-date tracking of changes. Reduce risk in your dynamic development environment with our autonomous healing capability and deliver flawless updates by keeping automation current. Ensure compliance with all regulatory requirements and eliminate security risk using AI-generated synthetic data for all your automation needs. Run multiple tests in parallel, determine test frequency, keep pace with browser updates and executions across operating systems and platforms.
  • 13
    Nightfall

    Nightfall

    Nightfall

    Discover, classify, and protect your sensitive data. Nightfall™ uses machine learning to identify business-critical data, like customer PII, across your SaaS, APIs, and data infrastructure, so you can manage & protect it. Integrate in minutes with cloud services via APIs to monitor data without agents. Machine learning classifies your sensitive data & PII with high accuracy, so nothing gets missed. Setup automated workflows for quarantines, deletions, alerts, and more - saving you time and keeping your business safe. Nightfall integrates directly with all your SaaS, APIs, and data infrastructure. Start building with Nightfall’s APIs for sensitive data classification & protection for free. Via REST API, programmatically get structured results from Nightfall’s deep learning-based detectors for things like credit card numbers, API keys, and more. Integrate with just a few lines of code. Seamlessly add data classification to your applications & workflows using Nightfall's REST API.
  • 14
    BMC Compuware File-AID
    Today’s Agile DevOps teams need the ability to go faster. BMC Compuware File-AID provides a cross-platform file and data management solution that enables developers and QA staff to quickly and conveniently access necessary data and files instead of hunting around for them. In turn, developers devote less time to data-related tasks and spend more time developing new functionality and managing production problems. Rightsizing your test data provides confidence to make code changes without unintended consequences. Access all standard file types regardless of record length or format for application integration. Compare data files or objects to simplify the test results validation process. Reformat files by easily modifying an existing file format instead of starting from scratch. Extract and load related subsets of data from multiple databases and files & more.
  • 15
    Elucidata Polly
    Harness the power of biomedical data with Polly. The Polly Platform helps to scale batch jobs, workflows, coding environments and visualization applications. Polly allows resource pooling and provides optimal resource allocation based on your usage requirements and makes use of spot instances whenever possible. All this leads to optimization, efficiency, faster response time and lower costs for the resources. Get access to a dashboard to monitor resource usage and cost real time and minimize overhead of resource management by your IT team. Version control is integral to Polly’s infrastructure. Polly ensures version control for your workflows and analyses through a combination of dockers and interactive notebooks. We have built a mechanism that allows the data, code and the environment co-exist. This coupled with data storage on the cloud and the ability to share projects ensures reproducibility of every analysis you perform.
  • 16
    Microsoft Power Query
    Power Query is the easiest way to connect, extract, transform and load data from a wide range of sources. Power Query is a data transformation and data preparation engine. Power Query comes with a graphical interface for getting data from sources and a Power Query Editor for applying transformations. Because the engine is available in many products and services, the destination where the data will be stored depends on where Power Query was used. Using Power Query, you can perform the extract, transform, and load (ETL) processing of data. Microsoft’s Data Connectivity and Data Preparation technology that lets you seamlessly access data stored in hundreds of sources and reshape it to fit your needs—all with an easy to use, engaging, no-code experience. Power Query supports hundreds of data sources with built-in connectors, generic interfaces (such as REST APIs, ODBC, OLE, DB and OData) and the Power Query SDK to build your own connectors.
  • 17
    Metomic

    Metomic

    Metomic

    Reduce the risk of a data breach and automate necessary security practises, so you can spend time growing your business. Accurately identify sensitive data across all of your cloud apps and infrastructure, so you know precisely where it is, and who has access to it. Precisely control sensitive data across thousands of locations. Block data being uploaded to the wrong place, and automatically delete it when it's no longer needed. Put compliance on autopilot, with no added risk. Use Metomic's off-the-shelf data classifiers or create your own using our no-code data classifier builder. Create your own data-driven workflows from any app using our Webhooks or Query API. Metomic's secure architecture helps you eliminate your security risks, without adding new ones. Leverage Metomic's pre-built app integrations to gain visibility into data flows from day one. Explore your surface area of security risks and control what data is being processed where.
  • 18
    Gretel

    Gretel

    Gretel.ai

    Privacy engineering tools delivered to you as APIs. Synthesize and transform data in minutes. Build trust with your users and community. Gretel’s APIs grant immediate access to creating anonymized or synthetic datasets so you can work safely with data while preserving privacy. Keeping the pace with development velocity requires faster access to data. Gretel is accelerating access to data with data privacy tools that bypass blockers and fuel Machine Learning and AI applications. Keep your data contained by running Gretel containers in your own environment or scale out workloads to the cloud in seconds with Gretel Cloud runners. Using our cloud GPUs makes it radically more effortless for developers to train and generate synthetic data. Scale workloads automatically with no infrastructure to set up and manage. Invite team members to collaborate on cloud projects and share data across teams.
  • 19
    Datafold

    Datafold

    Datafold

    Prevent data outages by identifying and fixing data quality issues before they get into production. Go from 0 to 100% test coverage of your data pipelines in a day. Know the impact of each code change with automatic regression testing across billions of rows. Automate change management, improve data literacy, achieve compliance, and reduce incident response time. Don’t let data incidents take you by surprise. Be the first one to know with automated anomaly detection. Datafold’s easily adjustable ML model adapts to seasonality and trend patterns in your data to construct dynamic thresholds. Save hours spent on trying to understand data. Use the Data Catalog to find relevant datasets, fields, and explore distributions easily with an intuitive UI. Get interactive full-text search, data profiling, and consolidation of metadata in one place.
  • 20
    Vectice

    Vectice

    Vectice

    Enabling all enterprise’s AI/ML initiatives to result in consistent and positive impact. Data scientists deserve a solution that makes all their experiments reproducible, every asset discoverable and simplifies knowledge transfer. Managers deserve a dedicated data science solution. to secure knowledge, automate reporting and simplify reviews and processes. Vectice is on a mission to revolutionize the way data science teams work and collaborate. The goal is to ensure consistent and positive AI/ML impact for all organizations. Vectice is bringing the first automated knowledge solution that is both data science aware, actionable and compatible with the tools data scientists use. Vectice auto-captures all the assets that AI/ML teams create such as datasets, code, notebooks, models or runs. Then it auto-generates documentation from business requirements to production deployments.
  • 21
    OpsHub

    OpsHub

    OpsHub

    OpsHub Integration Manager (OIM) can be configured to synchronize data between any of the 50+ tools in the ALM ecosystem. OIM provides an easy-to-use interface and intuitive user experience allowing users to easily configure the integration. The platform is built to be resilient and guarantees consistency of data in the systems that are being integrated. Businesses with heterogeneous IT landscapes need an agile integration that can put their entire value stream on a fast track and be a partner in their digital transformation. To remain competitive in the ever-evolving digital economy, it is now more crucial than ever to optimize processes and keep each step through the process connected. With OpsHub, get an enterprise-class integration solution that has been transforming clients’ value stream for over two decades.
  • 22
    Kovair QuickSync

    Kovair QuickSync

    Kovair Software

    Kovair QuickSync is a one stop, cost-effective, wide-range data migration solution for any enterprise across industry. Kovair QuickSync is a Windows-based desktop solution, which can be easily installed and used. Requirement of minimal infrastructure for operation makes it a very cost effective and efficient solution for the industry. It not only helps to migrate data from one source to one target but also helps to migrate data from one source to multiple targets. Its Instinctive UI makes it easily adaptable and adorable to the users. Offers a built-in disaster recovery mechanism and re-migration capability to ensure 100% data migration with zero data loss. Supports template-based migration capability. Once the configuration is done for one project it can be reused for others. Provides on-screen monitoring of migration status providing a real-time update on the health of migration.
  • 23
    NVISIONx

    NVISIONx

    NVISIONx

    NVISIONx data risk Intelligence platform enables companies to gain control of their enterprise data to reduce data risks, compliance scopes, and storage costs. Data is growing out of control and getting worse every day. Business and security leaders are overwhelmed and can’t protect what they don’t know. More controls won’t fix the problem. Rich and unlimited analysis to support over 150 business use cases to empower data owners and cyber professionals to proactively manage their data from cradle to grave. First, categorize or group data that is redundant, outdated, or trivial (ROT) and see what data can be defensibly disposed of to reduce the classification scope (and storage costs). Then, contextually classify all remaining data using a number of easy-to-use data analytics techniques to enable the data owner to be their own data analyst! Data identified as useless and unwanted can then go through legal and records retention reviews.
  • 24
    Integrate.io

    Integrate.io

    Integrate.io

    Unify Your Data Stack: Experience the first no-code data pipeline platform and power enlightened decision making. Integrate.io is the only complete set of data solutions & connectors for easy building and managing of clean, secure data pipelines. Increase your data team's output with all of the simple, powerful tools & connectors you’ll ever need in one no-code data integration platform. Empower any size team to consistently deliver projects on-time & under budget. We ensure your success by partnering with you to truly understand your needs & desired outcomes. Our only goal is to help you overachieve yours. Integrate.io's Platform includes: -No-Code ETL & Reverse ETL: Drag & drop no-code data pipelines with 220+ out-of-the-box data transformations -Easy ELT & CDC :The Fastest Data Replication On The Market -Automated API Generation: Build Automated, Secure APIs in Minutes - Data Warehouse Monitoring: Finally Understand Your Warehouse Spend - FREE Data Observability: Custom
  • 25
    Meltano

    Meltano

    Meltano

    Meltano provides the ultimate flexibility in deployment options. Own your data stack, end to end. Ever growing connector library of 300+ connectors have been running in production for years. Run workflows in isolated environments, execute end-to-end tests, and version control everything. Open source gives you the power to build your ideal data stack. Define your entire project as code and collaborate confidently with your team. The Meltano CLI enables you to rapidly create your project, making it easy to start replicating data. Meltano is designed to be the best way to run dbt to manage your transformations. Your entire data stack is defined in your project, making it simple to deploy it to production. Validate your changes in development before moving to CI, and in staging before moving to production.
  • 26
    Zepl

    Zepl

    Zepl

    Sync, search and manage all the work across your data science team. Zepl’s powerful search lets you discover and reuse models and code. Use Zepl’s enterprise collaboration platform to query data from Snowflake, Athena or Redshift and build your models in Python. Use pivoting and dynamic forms for enhanced interactions with your data using heatmap, radar, and Sankey charts. Zepl creates a new container every time you run your notebook, providing you with the same image each time you run your models. Invite team members to join a shared space and work together in real time or simply leave their comments on a notebook. Use fine-grained access controls to share your work. Allow others have read, edit, and run access as well as enable collaboration and distribution. All notebooks are auto-saved and versioned. You can name, manage and roll back all versions through an easy-to-use interface, and export seamlessly into Github.
  • 27
    LiveDocs

    LiveDocs

    LiveDocs

    Powerful, flexible, simple, Livedocs is the quickest way for anyone on your team to explore and share your data. Connect all your apps and centralize your data in one place for easy access. Discover trends, get notified of key events, and automate analysis for reporting. Build smart reports with data from multiple apps including charts and metrics. Get a headstart with templates. Pick a template that works or create one from scratch.
  • 28
    Waveline

    Waveline

    Waveline

    You get dozens of daily e-mails, but only some need your immediate attention, so the e-mail classifier below helps you maintain an organized inbox. For customer complaints, we summarize the main issue and notify #customer-support on Slack. Delayed orders go into #customer-relation. After a customer call with your support agent, you want to stay informed on what happened. Instead of listening to the whole call, create a Waveline flow that summarizes the main points. Many people experience writer's block when writing text. Quickly build an internal tool with Waveline that automatically gathers information about the recipient from LinkedIn and a Google search to generate a highly personalized first draft. Parse unstructured data and repackaged it into a structured format. Waveline uses LLMs to extract information from text, images, and more.
  • 29
    Polar Security

    Polar Security

    Polar Security

    Automate data discovery, protection & governance in your cloud workload and SaaS applications. Automatically pinpoint all your exposed sensitive data in cloud workloads and SaaS applications, allowing you to shrink the data attack surface. Identify and classify sensitive data such as PII, PHI, PCI, and custom company IP to prevent sensitive data exposure. Get actionable insights on how to protect your cloud data and ensure compliance, in real-time. Enforce data access policies to achieve least privileged access, maintain a strong security posture, and remain resilient to cyber-threats.
  • 30
    Unstructured

    Unstructured

    Unstructured

    80% of enterprise data exists in difficult-to-use formats like HTML, PDF, CSV, PNG, PPTX, and more. Unstructured effortlessly extracts and transforms complex data for use with every major vector database and LLM framework. Unstructured allows data scientists to pre-process data at scale so they spend less time collecting and cleaning, and more time modeling and analyzing. Our enterprise-grade connectors capture data wherever it lives, so we can transform it into AI-friendly JSON files for companies who are eager to fold AI into their business. You can count on Unstructured to deliver data that's curated, clean of artifacts, and most importantly, LLM-ready.