Showing 254 open source projects for "data integration"

View related business solutions
  • Go From AI Idea to AI App Fast Icon
    Go From AI Idea to AI App Fast

    One platform to build, fine-tune, and deploy ML models. No MLOps team required.

    Access Gemini 3 and 200+ models. Build chatbots, agents, or custom models with built-in monitoring and scaling.
    Try Free
  • $300 in Free Credit Towards Top Cloud Services Icon
    $300 in Free Credit Towards Top Cloud Services

    Build VMs, containers, AI, databases, storage—all in one place.

    Start your project in minutes. After credits run out, 20+ products include free monthly usage. Only pay when you're ready to scale.
    Get Started
  • 1
    DocWire SDK

    DocWire SDK

    Award-winning modern data processing SDK in C++20

    ...DocWire SDK aims to expand its capabilities, focusing on versatile data extraction, platform support, and seamless integration with various systems. DocWire SDK is dedicated to streamlining data processing, reducing development time and costs, and harnessing the potential of AI. Its advancements promise a superior experience compared to its predecessor, DocToText.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2

    GDS3D

    Interactive 3D Layout Viewer for GDSII

    GDS3D is a cross-platform 3D hardware accelerated viewer for chip layouts. Read standalone GDS files or use the Cadence plugin for easy integration with your Virtuoso environment. Developed by PhDs of the IC-Design Group, University of Twente, The Netherlands The GDS3D project is also hosted on Github: https://github.com/icdut24/GDS3D, but currently not actively maintained by us. Forks can be found at https://github.com/trilomix/GDS3D and https://github.com/skuep/GDS3D
    Leader badge
    Downloads: 13 This Week
    Last Update:
    See Project
  • 3
    Mara Pipelines

    Mara Pipelines

    A lightweight opinionated ETL framework, halfway between plain scripts

    This package contains a lightweight data transformation framework with a focus on transparency and complexity reduction. Data integration pipelines as code: pipelines, tasks and commands are created using declarative Python code. PostgreSQL as a data processing engine. Extensive web ui. The web browser as the main tool for inspecting, running and debugging pipelines. GNU make semantics.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    OSHMI - Open Substation HMI

    OSHMI - Open Substation HMI

    SCADA HMI for substations, IoT and automation applications

    Now with IEC61850 support! This project combines existing open source projects and tools to create a very capable, mobile and cloud-friendly HMI system that can rival proprietary software. This approach makes it possible to join forces of each project (Chromium, SVG/HTML5, PHP, Lua, SQLite, Inkscape, Lib61850, OpenDNP3, Nginx, Vega, PostgreSQL, Grafana,…) to achieve a great set of open, evergreen, modular and customizable tools for building great HMIs for automation projects. This is not...
    Leader badge
    Downloads: 32 This Week
    Last Update:
    See Project
  • Our Free Plans just got better! | Auth0 Icon
    Our Free Plans just got better! | Auth0

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
    Try free now
  • 5
    jluna

    jluna

    Julia Wrapper for C++ with Focus on Safety, Elegance, and Ease of Use

    Julia is a beautiful language, it is well-designed and well-documented. Julia's C-API is also well-designed, less beautiful, and much less... documented. jluna aims to fully wrap the official Julia C-API, replacing it in projects with C++ as the host language, by making accessing Julia's unique strengths through C++ safe, hassle-free, and just as beautiful. It allows for easy integration of Julia scripts and packages into projects with C++ as the host language, making language interaction...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    Mousetrap.jl

    Mousetrap.jl

    Finally, a GUI Engine made for Julia

    Mousetrap is a GUI library designed for Julia. It fully wraps GTK4 (which is written in C), vastly simplifying its interface to improve ease of use without sacrificing flexibility. It aims to give developers of all skill levels the tools to start creating complex GUI applications with little time and effort while taking full advantage of Julia's idiosyncrasies.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    OctoBase

    OctoBase

    OctoBase is the open-source database behind AFFiNE

    OctoBase is a local-first, peer-to-peer database designed to power collaborative apps with real-time syncing and offline-first capabilities. Developed by the team behind Anytype, OctoBase ensures that data is owned and controlled by the user, using CRDTs (Conflict-free Replicated Data Types) to resolve changes across peers without needing a central server. It supports encrypted, schema-flexible data storage and enables seamless syncing across devices while maintaining privacy. OctoBase is...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    PANDORA

    PANDORA

    Revolutionizing Biomedical Research with Advanced Machine Learning

    PANDORA is a machine learning (ML) tool that can be used to integrate various data types, including clinical, transcriptome and microbiome data and find connections in large datasets. PANDORA can be easily installed using Docker, a pre-built version of the software can be pulled from DockerHub. In order to run a test instance of PANDORA, users will first need to prepare their local environment by downloading, installing, and configuring Docker. genular is a community behind SIMON an...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 9
    NBi

    NBi

    NBi is a testing framework (add-on to NUnit)

    NBi is a testing framework (add-on to NUnit) for Business Intelligence. It supports most of the relational databases (SQL server, MySQL, postgreSQL ...) and OLAP platforms (Analysis Services, Mondrian ...) but also ETL and reporting components (Microsoft technologies). The main goal of this framework is to let users create tests with a declarative approach based on an Xml syntax. By the means of NBi, you don't need to develop C# code to specify your tests! Either, you don't need Visual...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Streamline Azure Security with Palo Alto Networks VM-Series Icon
    Streamline Azure Security with Palo Alto Networks VM-Series

    Centrally manage physical and virtualized firewalls with Panorama

    Improve your security posture and reduce incident response time. Use the VM-Series to natively analyze Azure traffic and dynamically drive policy updates based on workload changes.
    Learn more
  • 10
    DataVoyager.jl

    DataVoyager.jl

    Julia wrapper for the Voyager data exploration tool

    This package provides Julia integration for the Voyager data exploration tool. DataVoyager.jl can be used for data exploration. It can help you visualize and understand any data that is in a tabular format.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    scArches

    scArches

    Reference mapping for single-cell genomics

    Single-cell architecture surgery (scArches) is a package for reference-based analysis of single-cell data. scArches allows your single-cell query data to be analyzed by integrating it into a reference atlas. By mapping your data into an integrated reference you can transfer cell-type annotation from reference to query, identify disease states by mapping to healthy atlas, and advanced applications such as imputing missing data modalities or spatial locations.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    Transducers.jl

    Transducers.jl

    Efficient transducers for Julia

    Transducers are transformations of "sequence" of input that can be composed very efficiently. The interface used by transducers naturally describes a wide range of processes that is expressible as a succession of steps. Furthermore, transducers can be defined without specifying the details of the input and output (collections, streams, channels, etc.) and therefore achieves a full reusability. Transducers are introduced by Rich Hickey, the creator of the Clojure language. His Strange Loop...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    Swiple

    Swiple

    Swiple enables you to easily observe, understand, validate data

    Swiple is an automated data monitoring platform that helps analytics and data engineering teams seamlessly monitor the quality of their data. With automated data analysis and profiling, scheduling and alerting, teams can resolve data quality issues before they impact mission critical resources. Experience hassle-free integration with Swiple's zero-infrastructure and zero-code setup.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    Feathr

    Feathr

    A scalable, unified data and AI engineering platform for enterprise

    Feathr is a data and AI engineering platform that is widely used in production at LinkedIn for many years and was open sourced in 2022. It is currently a project under LF AI & Data Foundation. Define data and feature transformations based on raw data sources (batch and streaming) using Pythonic APIs. Register transformations by names and get transformed data(features) for various use cases including AI modeling, compliance, go-to-market and more. Share transformations and data(features)...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    BitSail

    BitSail

    BitSail is a distributed high-performance data integration engine

    BitSail is ByteDance's open source data integration engine which is based on distributed architecture and provides high performance. It supports data synchronization between multiple heterogeneous data sources, and provides global data integration solutions in batch, streaming, and incremental scenarios. At present, it serves almost all business lines in ByteDance, such as Douyin, Toutiao, etc., and synchronizes hundreds of trillions of data every day. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    SQLBucket

    SQLBucket

    Lightweight library to write, orchestrate and test your SQL ETL

    SQLBucket is a lightweight framework to help write, orchestrate and validate SQL data pipelines. It gives the possibility to set variables and introduces some control flow using the fantastic Jinja2 library. It also implements a very simplistic unit and integration test framework where you can validate the results of your ETL in the form of SQL checks. With SQLBucket, you can apply TDD principles when writing data pipelines.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    JuliaCall for Seamless Integration of R
    Package JuliaCall is an R interface to Julia, which is a high-level, high-performance dynamic programming language for numerical computing. Below is an image for Mandelbrot set. JuliaCall brings more than 100 times speedup of the calculation.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18

    incubator-seatunnel

    SeaTunnel is a distributed, high-performance data integration platform for the synchronization and transformation of massive data (offline & real-time).

    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    Weave Scope

    Weave Scope

    Monitoring, visualization and management for Docker and Kubernetes

    Understand your application quickly by seeing it in a real-time interactive display. Pick open-source or cloud-hosted options. Weave Scope automatically detects processes, containers, hosts. No kernel modules, no agents, no special libraries, no coding. Seamless integration with Docker, Kubernetes, DCOS and AWS ECS. See your Docker hosts, containers and services in real-time. Easily identify and correct issues to ensure the stability and performance of your containerized applications. View...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    InterMine

    InterMine

    A powerful open source data warehouse system

    InterMine is an open-source data warehouse system tailored for the integration and analysis of complex biological data. It enables researchers to create databases from diverse data sources and provides sophisticated web query tools for data exploration.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    AugmentedGaussianProcesses.jl

    AugmentedGaussianProcesses.jl

    Gaussian Process package based on data augmentation, and sparsity

    AugmentedGaussianProcesses.jl is a Julia package in development for Data Augmented Sparse Gaussian Processes. It contains a collection of models for different gaussian and non-gaussian likelihoods, which are transformed via data augmentation into conditionally conjugate likelihood allowing for extremely fast inference via block coordinate updates. There are also more options to use more traditional variational inference via quadrature or Monte Carlo integration.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    Remotery

    Remotery

    Single C file, Realtime CPU/GPU Profiler with Remote Web Viewer

    Remotery is a real-time CPU/GPU profiler implemented as a single C file, providing developers with immediate insights into the performance of their applications. It features a remote web-based viewer that runs in browsers like Chrome, Firefox, and Safari, allowing for cross-platform performance analysis. Remotery supports profiling multiple threads and GPU contexts, offering a comprehensive view of an application's performance characteristics.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    Log4jScanner

    Log4jScanner

    A log4j vulnerability filesystem scanner and Go package

    log4jscanner is a filesystem scanner and Go package that helps organizations quickly identify vulnerable Log4j components inside JARs and shaded dependencies. Instead of probing networks, it walks directories and archives, including nested JARs, to find version fingerprints and risky classes associated with the Log4Shell family of issues. The focus on static analysis makes it suitable for container images, build artifacts, and offline systems where active scanning isn’t feasible. Clear,...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    Starlight.jl

    Starlight.jl

    A greedy game engine for greedy programmers

    Welcome to the documentation for Starlight.jl, a greedy application framework for greedy developers. Its primary use case is video games, but the power of Julia, SDL2, Vulkan, and the Bullet Physics SDK can be leveraged to make just about anything you want. With a focus on flexibility and code quality, Starlight aims to be such a framework. It includes a suite of components and integrations that make it particuarly well-suited for video games, so it is not a stretch to call it a "game...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    Metl ETL Data Integration

    Metl ETL Data Integration

    Simple message-based, web-based ETL integration

    Metl is a simple, web-based ETL tool that allows for data integrations including database, files, messaging, and web services. Supports RDBMS, SOAP, HTTP, FTP, SFTP, XML, FIXLEN, CSV, JSON, ZIP, and more. Metl implements scheduled integration tasks without the need for custom coding or heavy infrastructure. It can be deployed in the cloud or in an internal data center, and it was built to allow developers to extend it with custom components.
    Downloads: 0 This Week
    Last Update:
    See Project
MongoDB Logo MongoDB