Best Data Management Software for GitHub - Page 6

Compare the Top Data Management Software that integrates with GitHub as of October 2025 - Page 6

This a list of Data Management software that integrates with GitHub. Use the filters on the left to add additional filters for products that have integrations with GitHub. View the products that work with GitHub in the table below.

  • 1
    Tarsal

    Tarsal

    Tarsal

    Tarsal's infinite scalability means as your organization grows, Tarsal grows with you. Tarsal makes it easy for you to switch where you're sending data - today's SIEM data is tomorrow's data lake data; all with one click. Keep your SIEM and gradually migrate analytics over to a data lake. You don't have to rip anything out to use Tarsal. Some analytics just won't run on your SIEM. Use Tarsal to have query-ready data on a data lake. Your SIEM is one of the biggest line items in your budget. Use Tarsal to send some of that data to your data lake. Tarsal is the first highly scalable ETL data pipeline built for security teams. Easily exfil terabytes of data in just just a few clicks, with instant normalization, and route that data to your desired destination.
  • 2
    MINDely
    MIND is the first-ever data security platform that puts data loss prevention (DLP) and insider risk management (IRM) programs on autopilot, so you can automatically identify, detect, and prevent data leaks at machine speed. Continuously find your sensitive data in files spread across your IT environments whether at rest, in motion, or in use. MIND continuously exposes blindspots of sensitive data across your IT environments including SaaS, AI apps, endpoints, on-premise file shares, and emails. MIND monitors and analyzes billions of data security events in real time, enriches each incident with context, and remediates autonomously. MIND automatically blocks sensitive data in real-time from escaping your control, or collaborates with users to remediate risks and educate on your policies. MIND continuously exposes blindspots of sensitive data at rest, in motion, and in use by integrating with data sources across your IT workloads, e.g. SaaS, AI apps, on-premises, endpoints, and emails.
  • 3
    ScrapeOps

    ScrapeOps

    ScrapeOps

    Schedule your scraping jobs, monitor their performance & scrape with proxies from the ScrapeOps dashboard. Use over 20+ proxy providers with our all-in-one proxy aggregator. We find the best proxy providers so you don't have to. Connect your server with ScrapeOps, deploy code from GitHub & schedule your spiders from the ScrapeOps dashboard. Easily monitor your scrapers, log errors, configure health checks, and get alerts from the ScrapeOps dashboard. ScrapeOps is a comprehensive platform tailored for web scraping, offering tools for job scheduling, real-time monitoring, error tracking, and proxy management. The platform enables users to connect servers and GitHub repositories, facilitating the deployment, scheduling, and management of scraping jobs across multiple servers from a unified dashboard. The ScrapeOps SDK provides real-time and historical job statistics, allowing users to monitor job progress, compare current runs with previous ones, and identify trends.
  • 4
    SQLNotebook

    SQLNotebook

    TimeStored

    SQL Notebooks allow developers to write Markdown combined with SQL to produce live HTML5 reports. They offer a lightning-fast, modern HTML5 interface where data sources are queried in real time. Users can create beautiful, live-updating SQL notebooks, easily source control the code, and take static snapshots to share with colleagues who don't have database access. SQL Notebooks are available in both QStudio Version 4, a desktop SQL client based on editing markdown files locally, and Pulse Version 3, which serves as a shared team server accessible via a web address. To help users get started, a showcase of example notebooks has been created in collaboration with leading community members; these examples are snapshotted versions with static data, and the source markdown and most of the data to recreate them are available on GitHub.
  • 5
    TROCCO

    TROCCO

    primeNumber Inc

    TROCCO is a fully managed modern data platform that enables users to integrate, transform, orchestrate, and manage their data from a single interface. It supports a wide range of connectors, including advertising platforms like Google Ads and Facebook Ads, cloud services such as AWS Cost Explorer and Google Analytics 4, various databases like MySQL and PostgreSQL, and data warehouses including Amazon Redshift and Google BigQuery. The platform offers features like Managed ETL, which allows for bulk importing of data sources and centralized ETL configuration management, eliminating the need to manually create ETL configurations individually. Additionally, TROCCO provides a data catalog that automatically retrieves metadata from data analysis infrastructure, generating a comprehensive catalog to promote data utilization. Users can also define workflows to create a series of tasks, setting the order and combination to streamline data processing.
  • 6
    SchemaFlow

    SchemaFlow

    SchemaFlow

    SchemaFlow is a powerful tool designed to enhance AI-powered development by providing real-time access to your PostgreSQL database schema through the Model Context Protocol (MCP). It allows developers to connect their databases, visualize schema structures with interactive diagrams, and export schemas in various formats such as JSON, Markdown, SQL, and Mermaid. With native MCP support via Server-Sent Events (SSE), SchemaFlow enables seamless integration with AI-Integrated Development Environments (AI-IDEs) like Cursor, Windsurf, and VS Code, ensuring that AI assistants have up-to-date schema information for accurate code generation. It offers secure token-based authentication for MCP connections, automatic schema synchronization to keep AI assistants informed of any changes, and a schema browser for easy navigation of tables and relationships.
  • 7
    TIBCO Streaming
    TIBCO Streaming is a real-time analytics platform designed to process and analyze high-velocity data streams, enabling organizations to make immediate, data-driven decisions. It offers a low-code development environment through StreamBase Studio, allowing users to build complex event processing applications with minimal coding. It supports over 150 connectors, including APIs, Apache Kafka, MQTT, RabbitMQ, and databases like MySQL and JDBC, facilitating seamless integration with various data sources. TIBCO Streaming incorporates dynamic learning operators, enabling adaptive machine learning models that provide contextual insights and automate decision-making processes. It also features real-time business intelligence capabilities, allowing users to visualize live data alongside historical information for comprehensive analysis. It is cloud-ready, supporting deployments on AWS, Azure, GCP, and on-premises environments.
  • 8
    Singer

    Singer

    Singer

    Singer describes how data extraction scripts called “taps” and data loading scripts called “targets” should communicate, allowing them to be used in any combination to move data from any source to any destination. Send data between databases, web APIs, files, queues, and just about anything else you can think of. Singer taps and targets are simple applications composed with pipes—no daemons or complicated plugins needed. Singer applications communicate with JSON, making them easy to work with and implement in any programming language. Singer also supports JSON Schema to provide rich data types and rigid structure when needed. Singer makes it easy to maintain state between invocations to support incremental extraction.
  • 9
    GenRocket

    GenRocket

    GenRocket

    Enterprise synthetic test data solutions. In order to generate test data that accurately reflects the structure of your application or database, it must be easy to model and maintain each test data project as changes to the data model occur throughout the lifecycle of the application. Maintain referential integrity of parent/child/sibling relationships across the data domains within an application database or across multiple databases used by multiple applications. Ensure the consistency and integrity of synthetic data attributes across applications, data sources and targets. For example, a customer name must always match the same customer ID across multiple transactions simulated by real-time synthetic data generation. Customers want to quickly and accurately create their data model as a test data project. GenRocket offers 10 methods for data model setup. XTS, DDL, Scratchpad, Presets, XSD, CSV, YAML, JSON, Spark Schema, Salesforce.
  • 10
    Code Ocean

    Code Ocean

    Code Ocean

    The Code Ocean Computational Workbench speeds usability, coding and data tool integration, and DevOps and lifecycle tasks by closing technology gaps with a highly intuitive, ready-to-use user experience. Ready-to-use RStudio, Jupyter, Shiny, Terminal, and Git. Choice of popular languages. Access to any size of data and storage type. Configure and generate Docker environments. One-click access to AWS compute resources. Using the Code Ocean Computational Workbench app panel researchers share results by generating and publishing easy-to-use, point-n-click, web analysis apps to teams of scientists without any IT, coding, or using the command line. Create and deploy interactive analysis. Used in standard web browsers. Easy to share and collaborate. Reuseable, easy to manage. Offering an easy-to-use application and repository researchers can quickly organize, publish, and secure project-based Compute Capsules, data assets, and research results.
  • 11
    SSIS Integration Toolkit
    Jump right to our product page to see our full range of data integration software, including solutions for SharePoint and Active Directory. With over 300 individual data integration tools for connectivity and productivity, our data integration solutions allow developers to take advantage of the flexibility and power of the SSIS ETL engine to integrate virtually any application or data source. You don't have to write a single line of code to make data integration happen so your development can be done in a matter of minutes. We make the most flexible integration solution on the market. Our software offers intuitive user interfaces that are flexible and easy to use. With a streamlined development experience and an extremely simple licensing model, our solution offers the best value for your investment. Our software offers many specifically designed features that help you achieve the best possible performance without having to hijack your budget.
  • 12
    Comake

    Comake

    Comake

    Easily create cutting-edge AI applications and unlock growth with unified data. Comake revolutionizes your data management and software landscape by consolidating fragmented data and simplifying access across your organization. Discover the right information effortlessly, exactly when you need it. Say farewell to data chaos and overwhelming complexity in your software initiatives. Experience unprecedented efficiency and productivity with Comake's seamless data unification. The future thrives on specialized, AI-driven applications that intelligently handle specific tasks. Comake enables you to establish a central hub for accessing all your data, while seamlessly integrating with the modular capabilities you build up, including automation, AI-powered processes, agents, and manual workflows. Experience the power of connected innovation with Comake. Unlock the potential of a unified data foundation with Comake's user-friendly infrastructure and technologies.
  • 13
    Data Sentinel

    Data Sentinel

    Data Sentinel

    As a business leader, you need to trust your data and be 100% certain that it’s well-governed, compliant, and accurate. Including all data, in all sources, and in all locations, without limitations. Understand your data assets. Audit for risk, compliance, and quality in support of your project. Catalog a complete data inventory across all sources and data types, creating a shared understanding of your data assets. Run a one-time, fast, affordable, and accurate audit of your data. PCI, PII, and PHI audits are fast, accurate, and complete. As a service, with no software to purchase. Measure and audit data quality and data duplication across all of your enterprise data assets, cloud-native and on-premises. Comply with global data privacy regulations at scale. Discover, classify, track, trace and audit privacy compliance. Monitor PII/PCI/PHI data propagation and automate DSAR compliance processes.
  • 14
    DataKitchen

    DataKitchen

    DataKitchen

    Reclaim control of your data pipelines and deliver value instantly, without errors. The DataKitchen™ DataOps platform automates and coordinates all the people, tools, and environments in your entire data analytics organization – everything from orchestration, testing, and monitoring to development and deployment. You’ve already got the tools you need. Our platform automatically orchestrates your end-to-end multi-tool, multi-environment pipelines – from data access to value delivery. Catch embarrassing and costly errors before they reach the end-user by adding any number of automated tests at every node in your development and production pipelines. Spin-up repeatable work environments in minutes to enable teams to make changes and experiment – without breaking production. Fearlessly deploy new features into production with the push of a button. Free your teams from tedious, manual work that impedes innovation.
  • 15
    SmartDraw

    SmartDraw

    SmartDraw

    SmartDraw is a data-driven diagramming and collaboration solution that can replace Lucidchart, Visio, or Miro at your enterprise. Get all the features you need at a more affordable price: - Sophisticated diagramming that lets your team make flowcharts, organizational charts, floor plans, CAD drawings, project charts, network diagrams, UML diagrams, AWS, Azure, and more - Whiteboarding and real-time collaboration - Powerful integrations that allow you to generate diagrams from data automatically - Migrate your existing Visio and Lucidchart files in bulk SmartDraw will save files directly to OneDrive, SharePoint, or Google Drive, giving you full control of your data. Minimize risk, simplify compliance, and increase data security. SmartDraw also works hand in glove with your existing IT infrastructure without disruption. You can provision users, save files, and set permissions entirely inside the Microsoft or Google enterprise stack.
    Starting Price: $10.95 per user per month