Search Results for "data integration" - Page 5

Showing 1658 open source projects for "data integration"

View related business solutions
  • Compliant and Reliable File Transfers Backed by Top Security Certifications Icon
    Compliant and Reliable File Transfers Backed by Top Security Certifications

    Cerberus FTP Server delivers SOC 2 Type II certified security and FIPS 140-2 validated encryption.

    Stop relying on non-certified, legacy file transfer tools that creak under the weight of modern security demands. Get full audit trails, advanced access controls and more supported by an award-winning team of experts. Start your free 25-day trial today.
    Start Free Trial
  • Our Free Plans just got better! | Auth0 Icon
    Our Free Plans just got better! | Auth0

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
    Try free now
  • 1
    AWS SDK for pandas

    AWS SDK for pandas

    Easy integration with Athena, Glue, Redshift, Timestream, Neptune

    aws-sdk-pandas (formerly AWS Data Wrangler) bridges pandas with the AWS analytics stack so DataFrames flow seamlessly to and from cloud services. With a few lines of code, you can read from and write to Amazon S3 in Parquet/CSV/JSON/ORC, register tables in the AWS Glue Data Catalog, and query with Amazon Athena directly into pandas. The library abstracts efficient patterns like partitioning, compression, and vectorized I/O so you get performant data lake operations without hand-rolling...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    LaReview

    LaReview

    The code review workbench

    ...Instead of overwhelming developers with raw diffs or automated comment spam, the tool analyzes code changes and generates an intent-driven review plan that groups changes into logical flows such as authentication, API behavior, or data handling, and prioritizes them based on risk. It operates as a desktop application with CLI integration, allowing users to launch reviews directly from their terminal while keeping all processing local to ensure security and prevent data leakage. The system presents reviews as hierarchical task trees, enabling developers to work through changes step by step, attach notes, and track progress across different review concerns.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    py-pdf-parser

    py-pdf-parser

    A Python tool to help extracting information from structured PDFs

    py-pdf-parser is a Python tool designed to help extract information from structured PDFs. It provides a simple interface to define parsing rules and extract data from PDF documents. ​
    Downloads: 1 This Week
    Last Update:
    See Project
  • 4
    tidytext

    tidytext

    Text mining using tidy tools

    tidytext brings tidy data principles to text mining by converting text into a tidy data frame format. It provides tools for tokenization, sentiment analysis, n‑gram creation, and term‑document matrices, enabling interoperability with dplyr, ggplot2, and other tidyverse workflows.
    Downloads: 1 This Week
    Last Update:
    See Project
  • AI-powered service management for IT and enterprise teams Icon
    AI-powered service management for IT and enterprise teams

    Enterprise-grade ITSM, for every business

    Give your IT, operations, and business teams the ability to deliver exceptional services—without the complexity. Maximize operational efficiency with refreshingly simple, AI-powered Freshservice.
    Try it Free
  • 5
    NetBox

    NetBox

    The premiere source of truth powering network automation

    NetBox is the leading solution for modeling and documenting modern networks. By combining the traditional disciplines of IP address management (IPAM) and datacenter infrastructure management (DCIM) with powerful APIs and extensions, NetBox provides the ideal "source of truth" to power network automation. Available as open source software under the Apache 2.0 license, NetBox is employed by thousands of organizations around the world. Netbox is written in Python and uses the Django web...
    Downloads: 36 This Week
    Last Update:
    See Project
  • 6
    League CSV

    League CSV

    CSV data manipulation made easy in PHP

    The PHP League CSV is a PHP library for reading, writing, and manipulating CSV files. It offers a straightforward API for handling common CSV operations, including parsing data, writing rows, and formatting output. The library is designed to handle large datasets efficiently, making it a reliable choice for data processing tasks in web applications.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 7
    MCP Atlassian

    MCP Atlassian

    MCP server that integrates Confluence and Jira

    The MCP Atlassian server integrates Atlassian products like Confluence and Jira with the Model Context Protocol. It supports both Cloud and Server/Data Center deployments, enabling AI models to interact with these platforms securely. ​
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    Airtable MCP

    Airtable MCP

    Airtable integration for AI-powered applications

    Airtable MCP is an integration tool that enables AI-powered applications to access and manipulate Airtable databases directly from the IDE using Anthropic's Model Context Protocol (MCP). It allows querying, creating, updating, and deleting records using natural language, facilitating seamless data management. ​
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    VisPy

    VisPy

    Main repository for Vispy

    Vispy is an open-source, high-performance interactive visualization library in Python, designed for creating scientific visualizations and interactive plots. It leverages the power of modern Graphics Processing Units (GPUs) through OpenGL to render large datasets efficiently. Vispy supports a wide range of visualization types, including 2D plots, 3D visualizations, volume rendering, and more, making it suitable for scientific research, data analysis, and educational purposes.
    Downloads: 1 This Week
    Last Update:
    See Project
  • Go From AI Idea to AI App Fast Icon
    Go From AI Idea to AI App Fast

    One platform to build, fine-tune, and deploy ML models. No MLOps team required.

    Access Gemini 3 and 200+ models. Build chatbots, agents, or custom models with built-in monitoring and scaling.
    Try Free
  • 10
    SQLTools

    SQLTools

    Database management for VSCode

    VSCode-SQLTools is a Visual Studio Code extension that enhances database management and development. It provides a rich set of features for connecting to databases, executing queries, and managing data directly within the code editor.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 11
    InvertibleNetworks.jl

    InvertibleNetworks.jl

    A Julia framework for invertible neural networks

    Building blocks for invertible neural networks in the Julia programming language.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    cobalt

    cobalt

    Video and media downloader: Best way to save what you love

    ...The project is built with performance in mind, leveraging efficient backend processing to handle requests quickly and consistently. It also prioritizes user privacy by avoiding data collection and minimizing external dependencies. Cobalt is designed to be simple yet powerful, offering a streamlined interface while still supporting a wide range of media sources and formats. Its architecture allows for easy deployment and customization, making it suitable for both personal use and integration into larger systems.
    Downloads: 111 This Week
    Last Update:
    See Project
  • 13
    tracetest

    tracetest

    Build integration and end-to-end tests in minutes

    Tracetest is a trace-based testing tool for integration and end-to-end testing using OpenTelemetry traces. Verify end-to-end transactions and side effects across microservices & event-driven apps by using trace data as test specs. Cypress and Selenium are constrained by using the browser for testing. Tracetest bypasses this entirely by using your existing OpenTelemetry instrumentation and trace data to run tests and assertions against traces in every step of a request transaction.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    Gladys Assistant

    Gladys Assistant

    A privacy-first, open-source home assistant

    Gladys Assistant is a privacy-first, open-source home assistant that integrates with various smart devices, allowing users to automate and control their home environment while ensuring data privacy.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 15
    web-access

    web-access

    Skill for installing full networking capabilities for Claude Code

    web-access is a tool designed to give AI agents structured and controlled access to web content, enabling them to retrieve, navigate, and process information from online sources in real time. It abstracts common web interactions such as page loading, data extraction, and navigation into reusable functions that can be invoked by agents. The system emphasizes safety and control, likely including mechanisms to manage permissions, rate limits, and content filtering. This allows agents to operate within defined boundaries while still benefiting from dynamic, up-to-date information. The architecture supports integration with broader agent frameworks, making it a key component for building systems that require external knowledge. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 16
    Seurat

    Seurat

    R toolkit for single cell genomics

    Seurat is a comprehensive R toolkit for single-cell genomics analysis, introduced by the Satija Lab at NYGC. It supports quality control, normalization, clustering, integration of multimodal data (e.g., scRNA‑seq, spatial, CITE‑seq), and visualization. Seurat v5 introduces scalable workflows and spatial transcriptomics support, commonly used in academic and industry research for single-cell studies.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    Stirling-PDF

    Stirling-PDF

    Web application that allows you to perform operations on PDF files

    Stirling PDF is a powerful, locally hosted web-based PDF manipulation tool offering a wide range of editing, conversion, and utility features. It allows users to merge, split, compress, convert, OCR, and perform other operations on PDF files directly from a browser without uploading data to third-party servers. The tool is privacy-conscious, self-hostable via Docker, and built with modularity in mind to allow future expansion and integration.
    Downloads: 19 This Week
    Last Update:
    See Project
  • 18
    Spring AI

    Spring AI

    An Application Framework for AI Engineering

    ...It focuses on production readiness by offering features such as configuration management, observability, and integration with Spring Boot applications. Spring AI also includes support for retrieval-augmented generation, enabling applications to connect language models with structured and unstructured data sources. Its architecture encourages modular design, making it easier to extend or swap components without rewriting large parts of the system.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 19
    Nano PDF Editor

    Nano PDF Editor

    Edit PDF files with Nano Banana

    Nano PDF Editor is a minimalist, portable PDF viewer and toolkit that focuses on simplicity, speed, and ease of integration for applications that need basic PDF rendering without heavy dependencies. It provides core functionality such as page navigation, zooming, text selection, and rendering directly to native graphics surfaces, making it suitable for lightweight PDF viewing scenarios on desktop or embedded platforms. Designed to be easily embedded into larger software projects, Nano-PDF...
    Downloads: 25 This Week
    Last Update:
    See Project
  • 20
    composer-normalize

    composer-normalize

    Provides a composer plugin for normalizing composer.json

    This package provides a composer plugin for normalizing composer.json.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    Apache Bigtop

    Apache Bigtop

    Bigtop is an Apache Foundation project for Infrastructure Engineers

    ...It also includes a set of integration tests and smoke tests to ensure compatibility and stability between ecosystem components. Developers and operators can use Bigtop to assemble customized Hadoop distributions tailored to their infrastructure and workloads. Its focus on reproducibility and packaging reduces friction in deploying large-scale data processing systems and ensures that different components of the Hadoop ecosystem work well together.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    Cloudberry

    Cloudberry

    One advanced and mature open-source MPP

    Apache Cloudberry is a distributed real-time analytics engine designed for querying massive social media datasets. It integrates with Apache AsterixDB and supports efficient ad-hoc queries and aggregations across large volumes of data. Cloudberry is especially useful for dashboards, trend analysis, and time-series social data exploration.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 23
    AI-Trader

    AI-Trader

    100% Fully-Automated Agent-Native Trading

    AI-Trader is an open-source AI-powered quantitative trading framework designed to combine financial analysis, machine learning, and autonomous trading workflows into a unified research platform. The project integrates large language models, financial indicators, market analysis pipelines, and automated decision-making systems to support strategy generation and market prediction tasks. It is built to help researchers and developers experiment with AI-assisted trading strategies using...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 24
    Shlink

    Shlink

    The definitive self-hosted URL shortener

    ...It provides features for creating short, trackable URLs with detailed analytics on click statistics. Shlink is self-hosted, giving users control over their shortened links and data privacy. It supports custom domains, QR code generation, and integration with third-party services for advanced tracking and management.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    OpenClaw

    OpenClaw

    Your own personal AI assistant. Any OS. Any Platform.

    OpenClaw (formerly Clawdbot/Moltbot) is an open-source, self-hosted autonomous AI assistant designed to run on user-controlled hardware and bridge conversational natural language with real-world task execution, effectively acting as a proactive digital assistant rather than a reactive chatbot. It lets you send instructions through familiar messaging platforms like WhatsApp, Telegram, Discord, Slack, Signal, iMessage, and more, and then interprets those instructions to carry out actions such...
    Downloads: 213 This Week
    Last Update:
    See Project
MongoDB Logo MongoDB