Showing 133 open source projects for "data integration"

View related business solutions
  • Build Securely on AWS with Proven Frameworks Icon
    Build Securely on AWS with Proven Frameworks

    Lay a foundation for success with Tested Reference Architectures developed by Fortinet’s experts. Learn more in this white paper.

    Moving to the cloud brings new challenges. How can you manage a larger attack surface while ensuring great network performance? Turn to Fortinet’s Tested Reference Architectures, blueprints for designing and securing cloud environments built by cybersecurity experts. Learn more and explore use cases in this white paper.
    Download Now
  • Try Google Cloud Risk-Free With $300 in Credit Icon
    Try Google Cloud Risk-Free With $300 in Credit

    No hidden charges. No surprise bills. Cancel anytime.

    Use your credit across every product. Compute, storage, AI, analytics. When it runs out, 20+ products stay free. You only pay when you choose to.
    Start Free
  • 1
    QSV

    QSV

    Blazing-fast Data-Wrangling toolkit

    qsv is a fast, command-line CSV data toolkit written in Rust that extends the capabilities of xsv. It’s designed to make working with CSV files at scale easy and efficient, offering over 40 powerful subcommands for tasks like querying, sampling, splitting, deduplicating, and more. qsv is ideal for data engineers, analysts, and developers who need high-performance CSV manipulation on the command line.
    Downloads: 60 This Week
    Last Update:
    See Project
  • 2

    JSON for Modern C++

    JSON that's part of C++

    This is JSON for C++, a JSON library unlike any other that's packed with plenty of great features. While there may be dozens of JSON libraries out there, JSON for C++ stands out with a focus on three things: an intuitive syntax, trivial integration and serious testing. Using the operator magic of modern C++, this library makes JSON feel like a first class data type. With trivial integration, the entire code is made up of a single header file json.hpp, no dependencies, no complex build system required. It's been heavily unit-tested covering 100% of the code, and follows the Core Infrastructure Initiative (CII) best practices to ensure the highest quality at all times. ...
    Downloads: 20 This Week
    Last Update:
    See Project
  • 3
    ScanAPI

    ScanAPI

    Automated Integration Testing and Live Documentation for your API

    Given an API specification, written in YAML/JSON format, ScanAPI hits the specified endpoints, runs the test cases, and generates a detailed report of this execution - which can also be used as the API documentation itself. With almost no Python knowledge, the user can define endpoints to be hit, the expected behavior for each response and will receive a full real-time diagnostic report of the API.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 4
    keycloak-config-cli

    keycloak-config-cli

    Import YAML/JSON-formatted configuration files into Keycloak

    keycloak-config-cli is a Keycloak utility to ensure the desired configuration state for a realm based on a JSON/YAML file. The format of the JSON/YAML file is based on the export realm format. Store and handle the configuration files inside git just like normal code. A Keycloak restart isn't required to apply the configuration. The config files are based on the keycloak export files. You can use them to re-import your settings. But keep your files as small as possible. Remove all UUIDs and...
    Downloads: 130 This Week
    Last Update:
    See Project
  • Go From AI Idea to AI App Fast Icon
    Go From AI Idea to AI App Fast

    One platform to build, fine-tune, and deploy ML models. No MLOps team required.

    Access Gemini 3 and 200+ models. Build chatbots, agents, or custom models with built-in monitoring and scaling.
    Try Free
  • 5
    lxml

    lxml

    The lxml XML toolkit for Python

    A Python library for efficient XML and HTML processing, known for speed and compatibility. The lxml XML toolkit is a Pythonic binding for the C libraries libxml2 and libxslt. It is unique in that it combines the speed and XML feature completeness of these libraries with the simplicity of a native Python API, mostly compatible but superior to the well-known ElementTree API. The latest release works with all CPython versions from 3.6 to 3.12. See the introduction for more information about the...
    Downloads: 20 This Week
    Last Update:
    See Project
  • 6
    PDFCraft

    PDFCraft

    PDFCraft is a free, privacy-focused PDF toolkit

    ...But beyond manual editing, it also offers a programmable layer so developers can write scripts to batch process documents, generate templated reports, or extract structured data from PDFs for integration in workflows. The design emphasizes quality and compatibility: output PDFs render accurately across readers, preserve metadata, and support interactive elements like hyperlinks and form fields.
    Downloads: 40 This Week
    Last Update:
    See Project
  • 7
    JSON Hero

    JSON Hero

    JSON Hero is an open-source, beautiful JSON explorer for the web

    JSON Hero is a beautiful and powerful JSON viewer designed for developers who work with large and complex JSON files. It runs as a web-based interface (and as a standalone app) that provides semantic, interactive rendering of JSON content, helping users understand the structure and meaning of data at a glance. JSON Hero automatically detects data types such as URLs, dates, colors, and base64 images, and presents them in meaningful ways. It’s designed for productivity and readability, with...
    Downloads: 7 This Week
    Last Update:
    See Project
  • 8
    py-pdf-parser

    py-pdf-parser

    A Python tool to help extracting information from structured PDFs

    py-pdf-parser is a Python tool designed to help extract information from structured PDFs. It provides a simple interface to define parsing rules and extract data from PDF documents. ​
    Downloads: 1 This Week
    Last Update:
    See Project
  • 9
    League CSV

    League CSV

    CSV data manipulation made easy in PHP

    The PHP League CSV is a PHP library for reading, writing, and manipulating CSV files. It offers a straightforward API for handling common CSV operations, including parsing data, writing rows, and formatting output. The library is designed to handle large datasets efficiently, making it a reliable choice for data processing tasks in web applications.
    Downloads: 1 This Week
    Last Update:
    See Project
  • Earn up to 16% annual interest with Nexo. Icon
    Earn up to 16% annual interest with Nexo.

    Let your crypto work for you

    Put idle assets to work with competitive interest rates, borrow without selling, and trade with precision. All in one platform. Geographic restrictions, eligibility, and terms apply.
    Get started with Nexo.
  • 10
    composer-normalize

    composer-normalize

    Provides a composer plugin for normalizing composer.json

    This package provides a composer plugin for normalizing composer.json.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    Nano PDF Editor

    Nano PDF Editor

    Edit PDF files with Nano Banana

    Nano PDF Editor is a minimalist, portable PDF viewer and toolkit that focuses on simplicity, speed, and ease of integration for applications that need basic PDF rendering without heavy dependencies. It provides core functionality such as page navigation, zooming, text selection, and rendering directly to native graphics surfaces, making it suitable for lightweight PDF viewing scenarios on desktop or embedded platforms. Designed to be easily embedded into larger software projects, Nano-PDF...
    Downloads: 25 This Week
    Last Update:
    See Project
  • 12
    Apollo and GraphQL for Vue.js

    Apollo and GraphQL for Vue.js

    Apollo/GraphQL integration for VueJS

    This library integrates apollo in your Vue components with declarative queries. Compatible with Vue 2.0+. Don't think about updating the UI or refetching the queries! It provides automatic updates. Use Apollo in a truly declarative way with the Apollo components. Run your queries on the server before rendering the page HTML. GraphQL is a specification that aims at easing the communication between frontends and backends. It mainly consists of a Schema Language for the server and a Query...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    flag-icon-css

    flag-icon-css

    A collection of all country flags in SVG

    A collection of all country flags in SVG, plus the CSS for easier integration. You can either download the whole project as is or install it via Bower or NPM. Run the npm install to install the dependencies after cloning the project and you'll be able to watch for changes and live reload if served, to build *.less files, to serve it on localhost:8000, and to have only specific countries in the css file, for which you will need to remove the ones that you don't need from the...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    Unredact

    Unredact

    A simple tool for reading in poorly redacted documents

    Unredact is a specialized tool that attempts to reconstruct redacted or obscured text in images, PDFs, or screenshots using a combination of image processing and generative AI inference to suggest plausible completions of blurred, black-boxed, or jumbled content. Unlike traditional optical character recognition (OCR), which only reads visible text, Unredact focuses on inferring missing content where redaction has been applied by analyzing surrounding context, font characteristics, and...
    Downloads: 23 This Week
    Last Update:
    See Project
  • 15
    Memvid

    Memvid

    Video-based AI memory library. Store millions of text chunks in MP4

    Memvid encodes text chunks as QR codes within MP4 frames to build a portable “video memory” for AI systems. This innovative approach uses standard video containers and offers millisecond-level semantic search across large corpora with dramatically less storage than vector DBs. It's self-contained—no DB needed—and supports features like PDF indexing, chat integration, and cloud dashboards.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 16
    Form Component

    Form Component

    Allows to easily create, process and reuse HTML forms

    Symfony Form is a component that simplifies form creation and management in PHP applications. It provides a robust framework for building and handling forms, including form validation, data binding, and rendering. Symfony Form is highly flexible and integrates seamlessly with other Symfony components, making it a go-to choice for creating dynamic, data-driven forms in web applications.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    Rubick

    Rubick

    Electron based open source toolbox, free integration of rich plug-ins

    Based on the electron open-source toolbox, free integration of rich plug-ins, creates the ultimate desktop efficiency tool, Rubick is one of the heroes of Dota, The core skill is the ability to use other heroes as plug-ins, Finished the walk, Very consistent with the design concept of this tool, So named Rubick. Plug-in management is based on npm package mode, installing plugins is as easy as installing npm packages. Support system plug-ins, as long as rubick is running, plug-ins can be used...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 18
    wikmd

    wikmd

    A file based wiki that uses markdown

    It’s a file-based wiki that aims to simplicity. Instead of storing the data in a database I chose to have a file-based system. The advantage of this system is that every file is directly readable inside a terminal etc. Also when you have direct access to the system you can export the files to anything you like.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    ZIO JSON

    ZIO JSON

    Fast, secure JSON library with tight ZIO integration

    ZIO Json is a fast and secure JSON library with tight ZIO integration. The goal of this project is to create the best all-round JSON library for Scala. Extreme performance is achieved by decoding JSON directly from the input source into business objects (docs/inspired by plokhotnyuk). Although not a requirement, the latest advances in Java Loom can be used to support arbitrarily large payloads with near-zero overhead. Best-in-class security is achieved with an aggressive early exit strategy...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    minted

    minted

    minted is a LaTeX package that provides syntax highlighting

    minted is a LaTeX package that enables advanced syntax highlighting of source code using the Pygments library. It supports customization via LaTeX and Python integration, allowing fine-grained control over code snippets in documents.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    EnvFile

    EnvFile

    EnvFile 3.x is a plugin for JetBrains IDEs

    Env File is a plugin for JetBrains IDEs that allows you to set environment variables for your run configurations from one or multiple files. Not all run configurations available in IDEA-based IDEs are implemented similarly. Some of them differ significantly. In certain cases (so far, only Gradle has been confirmed) the implementation exposes interfaces to integrate the EnvFile UI but doesn't provide interfaces for it to actually make its work. Luckily, it was possible to make few assumptions...
    Downloads: 4 This Week
    Last Update:
    See Project
  • 22
    Open Policy Agent

    Open Policy Agent

    An open source, general-purpose policy engine

    Policy-based control for cloud-native environments. Flexible, fine-grained control for administrators across the stack. Stop using a different policy language, policy model, and policy API for every product and service you use. Use OPA for a unified toolset and framework for policy across the cloud-native stack. Whether for one service or for all your services, use OPA to decouple policy from the service's code so you can release, analyze, and review policies (which security and compliance...
    Downloads: 4 This Week
    Last Update:
    See Project
  • 23
    DocArray

    DocArray

    The data structure for multimodal data

    DocArray is a library for nested, unstructured, multimodal data in transit, including text, image, audio, video, 3D mesh, etc. It allows deep-learning engineers to efficiently process, embed, search, recommend, store, and transfer multimodal data with a Pythonic API. Door to multimodal world: super-expressive data structure for representing complicated/mixed/nested text, image, video, audio, 3D mesh data. The foundation data structure of Jina, CLIP-as-service, DALL·E Flow, DiscoArt etc. Data...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    Airframe

    Airframe

    Essential Building Blocks for Scala

    Airframe is an essential building block for developing applications in Scala, including logging, object serialization using JSON or MessagePack, dependency injection, HTTP server/client with RPC support, functional testing with AirSpec, etc. Airframe RPC supports seamless integration of servers and clients using Scala as RPC interfaces. AirSpec is a simple unit testing framework for Scala and Scala.js. You can use public methods in your classes as test cases. There is no need to remember...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    HTMLProofer

    HTMLProofer

    Test your rendered HTML files to make sure they're accurate.

    HTMLProofer is a set of tests to validate your HTML output. These tests check if your image references are legitimate, if they have alt tags, if your internal links are working, and so on. It's intended to be an all-in-one checker for your output. In scope for this project is any well-known and widely-used test for HTML document quality. A major use for this project is continuous integration -- so we must have reliable results. We usually balance correctness over performance. And, if...
    Downloads: 0 This Week
    Last Update:
    See Project
MongoDB Logo MongoDB