210 projects for "data integration" with 2 filters applied:

  • Compliant and Reliable File Transfers Backed by Top Security Certifications Icon
    Compliant and Reliable File Transfers Backed by Top Security Certifications

    Cerberus FTP Server delivers SOC 2 Type II certified security and FIPS 140-2 validated encryption.

    Stop relying on non-certified, legacy file transfer tools that creak under the weight of modern security demands. Get full audit trails, advanced access controls and more supported by an award-winning team of experts. Start your free 25-day trial today.
    Start Free Trial
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • 1
    Agentic Data Scientist

    Agentic Data Scientist

    An end-to-end Data Scientist

    Agentic Data Scientist is an experimental AI-driven research framework that orchestrates data science workflows through autonomous agents that can reason, plan, and execute complex analytics tasks. Unlike traditional scripted pipelines, this project lets AI agents break down high-level research goals into sub-tasks such as data acquisition, cleaning, modeling, evaluation, and reporting, with minimal human direction. Each agent is designed to independently call functions, interact with data...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    Book6_First-Course-in-Data-Science

    Book6_First-Course-in-Data-Science

    From Addition, Subtraction, Multiplication, and Division to ML

    Book6_First-Course-in-Data-Science is an open-source educational project that serves as part of the “Iris Book” series focused on teaching data science and machine learning concepts through a combination of mathematics, programming, and visualization. The repository contains draft chapters, supporting Python code, and visual materials designed to guide readers from basic mathematical operations toward practical machine learning understanding. The goal of the project is to make complex topics...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    JimuReport

    JimuReport

    Open source drag-and-drop reporting and dashboard builder platform

    ...JimuReport supports traditional report generation, print templates, and modern dashboard visualizations for business intelligence scenarios. JimuReport also includes components for building interactive charts, data tables, and analytical displays that can be used in enterprise applications. It can connect to multiple data sources and retrieve data through SQL queries, APIs, or other structured formats. It can be embedded into Java applications using Spring Boot integration modules.
    Downloads: 6 This Week
    Last Update:
    See Project
  • 4
    graphify

    graphify

    AI coding assistant skill (Claude Code, Codex, OpenCode, OpenClaw)

    graphify is a data visualization and transformation tool designed to convert structured or semi-structured data into graph-based representations, enabling better understanding of relationships and dependencies. It focuses on building visual models such as nodes and edges that represent entities and their connections, making complex datasets easier to interpret. The system likely supports dynamic updates, allowing graphs to evolve as data changes or new inputs are introduced. It is...
    Downloads: 11 This Week
    Last Update:
    See Project
  • Gemini 3 and 200+ AI Models on One Platform Icon
    Gemini 3 and 200+ AI Models on One Platform

    Access Google's best plus Claude, Llama, and Gemma. Fine-tune and deploy from one console.

    Build, govern, and optimize agents and models with Gemini Enterprise Agent Platform.
    Start Free
  • 5
    AI-Trader

    AI-Trader

    100% Fully-Automated Agent-Native Trading

    AI-Trader is an open-source AI-powered quantitative trading framework designed to combine financial analysis, machine learning, and autonomous trading workflows into a unified research platform. The project integrates large language models, financial indicators, market analysis pipelines, and automated decision-making systems to support strategy generation and market prediction tasks. It is built to help researchers and developers experiment with AI-assisted trading strategies using...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 6
    AI-Crawler

    AI-Crawler

    Crawl a website starting from a URL, find relevant pages

    AI Crawler is an experimental AI-powered web crawling and data extraction tool that uses natural language prompts to guide the discovery and retrieval of relevant information across websites. Unlike traditional web scrapers that rely on static selectors and manual scripting, it uses AI to dynamically identify and prioritize pages based on user intent, making it more flexible and resilient to changes in website structure. Users can define their data requirements in plain English, and the...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 7
    Grok CLI

    Grok CLI

    An open-source AI agent that brings the power of Grok

    Grok CLI is a command-line interface built around the Grok AI model that brings programmatic and conversational AI capabilities directly to developer terminals. It lets you run Grok queries from your shell, scripting environment, or automation workflows without switching to a browser, enabling utility in scripting, quick data exploration, code generation, and assistant-guided tasks directly where you write code. The CLI supports streaming responses, so outputs appear in real time as the Grok...
    Downloads: 12 This Week
    Last Update:
    See Project
  • 8
    Casibase

    Casibase

    Open-source enterprise-level AI knowledge base and MCP

    ...It also supports integration with existing systems through database synchronization, allowing organizations to migrate data into the platform without major infrastructure changes.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    Humanizer Skill

    Humanizer Skill

    Claude Code skill that removes signs of AI-generated writing from text

    Humanizer Skill is a utility library focused on transforming technical or machine-oriented text into expressions that are more natural, readable, and “human-friendly.” It provides a suite of algorithms that convert timestamps, identifiers, file sizes, code tokens, and structured data into phrases that resemble typical human phrasing rather than compact machine output. For example, date and time values can be expressed as relative terms (“two hours ago”), and file sizes can be shown in...
    Downloads: 150 This Week
    Last Update:
    See Project
  • $300 in Free Credit Towards Top Cloud Services Icon
    $300 in Free Credit Towards Top Cloud Services

    Build VMs, containers, AI, databases, storage—all in one place.

    Start your project in minutes. After credits run out, 20+ products include free monthly usage. Only pay when you're ready to scale.
    Get Started
  • 10
    Paperclip

    Paperclip

    Open-source orchestration for zero-human companies

    ...Instead of requiring separate APIs and authentication flows for each service, Paperclip provides unified search and retrieval capabilities that simplify integration into AI workflows.
    Downloads: 18 This Week
    Last Update:
    See Project
  • 11
    OpenPlanter

    OpenPlanter

    Language-model investigation agent with a terminal UI

    OpenPlanter is an open-source Python project focused on building an intelligent automated planting or gardening system powered by software control and data processing. The repository is designed to help developers and hobbyists create programmable plant management workflows that can monitor, schedule, and optimize growing conditions. It emphasizes automation and extensibility, allowing integration with sensors, environmental data, and control logic for smart cultivation setups. The system is structured to support experimentation and customization, making it suitable for both research and DIY agriculture projects. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    GeoAI

    GeoAI

    GeoAI: Artificial Intelligence for Geospatial Data

    GeoAI is a comprehensive open-source Python package designed to integrate artificial intelligence techniques with geospatial data analysis, enabling users to perform advanced geographic modeling and visualization tasks with ease. It provides a unified framework that combines machine learning libraries such as PyTorch and Transformers with geospatial tools, allowing users to process satellite imagery, aerial photos, and vector datasets in a streamlined workflow. The platform supports a wide...
    Downloads: 5 This Week
    Last Update:
    See Project
  • 13
    ComfyUI-LTXVideo

    ComfyUI-LTXVideo

    LTX-Video Support for ComfyUI

    ComfyUI-LTXVideo is a bridge between ComfyUI’s node-based generative workflow environment and the LTX-Video multimedia processing framework, enabling creators to orchestrate complex video tasks within a visual graph paradigm. Instead of writing code to apply effects, transitions, edits, and data flows, users can assemble nodes that represent video inputs, transformations, and outputs, letting them prototype and automate video production pipelines visually. This integration empowers non-programmers and rapid-iteration teams to harness the performance of LTX-Video while maintaining the clarity and flexibility of a dataflow graph model. ...
    Downloads: 13 This Week
    Last Update:
    See Project
  • 14
    C3

    C3

    The goal of CLAIMED is to enable low-code/no-code rapid prototyping

    C3 is an open-source framework designed to simplify the development and deployment of data science and machine learning workflows through reusable components and low-code development techniques. The framework focuses on enabling rapid prototyping while maintaining a path to production through automated CI/CD integration. CLAIMED provides a component-based architecture where data processing steps, models, and workflows can be packaged into reusable operators.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    Superglue

    Superglue

    Builds integrations and tools from natural language

    Superglue is an AI-powered integration platform that enables developers to build production-grade tools and workflows using natural language, abstracting away the complexity of connecting APIs, databases, and external systems. It functions as a universal integration layer that allows users to define workflows in plain language, which are then translated into executable pipelines capable of interacting with multiple services.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    Pixeltable

    Pixeltable

    Data Infrastructure providing an approach to multimodal AI workloads

    Pixeltable is an open-source Python data infrastructure framework designed to support the development of multimodal AI applications. The system provides a declarative interface for managing the entire lifecycle of AI data pipelines, including storage, transformation, indexing, retrieval, and orchestration of datasets. Unlike traditional architectures that require multiple tools such as databases, vector stores, and workflow orchestrators, Pixeltable unifies these functions within a...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 17
    Agents Towards Production

    Agents Towards Production

    Code-first tutorials covering every layer of GenAI agents

    Agents Towards Production is an opinionated, code-first playbook for taking AI agents from prototype to production-ready systems. Instead of focusing only on toy examples, it dives into every layer of an agent stack: orchestration, memory, RAG, tool and API integration, security, observability, deployment, evaluation, and UI. The repository is built around runnable tutorials, each in its own folder, often sponsored by or built in collaboration with infrastructure providers like LangChain, Redis, Bright Data, Contextual AI, Tavily, Runpod, Portia, and others. These tutorials show how to implement things like secure tool calling with OAuth, dual-memory architectures, production RAG agents, multi-agent communication protocols, GPU deployment, containerization with Docker, FastAPI endpoints, and Streamlit chat UIs. ...
    Downloads: 5 This Week
    Last Update:
    See Project
  • 18
    FIT Framework

    FIT Framework

    An enterprise-level AI development framework

    ...The system is built to be extensible, enabling integration with various machine learning libraries and tools, as well as customization for domain-specific tasks.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    Rill

    Rill

    Fast SQL-based BI tool for real-time dashboards and analytics

    Rill is an operational BI tool that turns raw datasets into fast, interactive dashboards using SQL and a code-first approach. It helps data teams move from data lake to insight quickly, without the complexity of traditional BI systems. With an embedded in-memory database powered by DuckDB or ClickHouse, queries run in milliseconds, enabling real-time exploration and analysis. Rill supports local and remote data sources such as CSV, Parquet, S3, and GCS, making it flexible across...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 20
    nesa

    nesa

    Run AI models end-to-end encrypted

    nesa is an open-source initiative focused on building decentralized AI infrastructure that enables secure, verifiable, and privacy-preserving machine learning and inference across distributed environments. The project aims to address key challenges in modern AI systems, such as data privacy, trust, and centralization, by leveraging cryptographic techniques and decentralized architectures. NESA allows developers to run AI computations in a way that ensures data integrity and confidentiality, making it particularly relevant for applications involving sensitive or regulated data. It integrates mechanisms for verifiable computation, enabling users to confirm that AI outputs were generated correctly without exposing underlying data or models. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    Apache Hamilton

    Apache Hamilton

    Helps data scientists define testable self-documenting dataflows

    Apache Hamilton is an open-source Python framework designed to simplify the creation and management of dataflows used in analytics, machine learning pipelines, and data engineering workflows. The framework enables developers to define data transformations as simple Python functions, where each function represents a node in a dataflow graph and its parameters define dependencies on other nodes. Hamilton automatically analyzes these functions and constructs a directed acyclic graph...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    SAM 3

    SAM 3

    Code for running inference and finetuning with SAM 3 model

    SAM 3 (Segment Anything Model 3) is a unified foundation model for promptable segmentation in both images and videos, capable of detecting, segmenting, and tracking objects. It accepts both text prompts (open-vocabulary concepts like “red car” or “goalkeeper in white”) and visual prompts (points, boxes, masks) and returns high-quality masks, boxes, and scores for the requested concepts. Compared with SAM 2, SAM 3 introduces the ability to exhaustively segment all instances of an...
    Downloads: 39 This Week
    Last Update:
    See Project
  • 23
    NVIDIA Earth2Studio

    NVIDIA Earth2Studio

    Open-source deep-learning framework

    NVIDIA Earth2Studio is an open-source Python package and framework designed to accelerate the development and deployment of AI-driven weather and climate science workflows. It provides a unified API that lets researchers, data scientists, and engineers build complex forecasting and analysis pipelines by combining modular prognostic and diagnostic AI models with a diverse range of real-world data sources such as global forecast systems, reanalysis datasets, and satellite feeds. The toolkit...
    Downloads: 4 This Week
    Last Update:
    See Project
  • 24
    Wiseflow

    Wiseflow

    Enhance any agent's browser use skill

    Wiseflow is an open-source information extraction and knowledge discovery system designed to collect, filter, and organize valuable information from large volumes of online content. The platform continuously monitors specified sources such as websites, social platforms, and other digital channels to identify relevant data according to user-defined interests or topics. By combining web crawling, content parsing, and large language model analysis, the system extracts concise insights from raw...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 25
    Bespoke Curator

    Bespoke Curator

    Synthetic data curation for post-training and data extraction

    Curator is an open-source Python library designed to build synthetic data pipelines for training and evaluating machine learning models, particularly large language models. The system helps developers generate, transform, and curate high-quality datasets by combining automated generation with structured validation and filtering. It supports workflows where models are used to produce synthetic examples that can later be refined into reliable training datasets for reasoning, question...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • 3
  • 4
  • 5
  • Next
MongoDB Logo MongoDB