Showing 396 open source projects for "data integration"

View related business solutions
  • $300 in Free Credit Towards Top Cloud Services Icon
    $300 in Free Credit Towards Top Cloud Services

    Build VMs, containers, AI, databases, storage—all in one place.

    Start your project in minutes. After credits run out, 20+ products include free monthly usage. Only pay when you're ready to scale.
    Get Started
  • Custom VMs From 1 to 96 vCPUs With 99.95% Uptime Icon
    Custom VMs From 1 to 96 vCPUs With 99.95% Uptime

    General-purpose, compute-optimized, or GPU/TPU-accelerated. Built to your exact specs.

    Live migration and automatic failover keep workloads online through maintenance. One free e2-micro VM every month.
    Try Free
  • 1
    Browserbase Skills

    Browserbase Skills

    Claude Agent SDK with a web browsing tool

    Browserbase Skills is a collection of reusable automation “skills” designed to enable AI agents to interact with web environments programmatically. It provides structured workflows that abstract browser actions such as navigation, form filling, and data extraction into composable building blocks. The system is intended to simplify the development of browser-based agents by offering prebuilt capabilities that can be orchestrated together. It integrates with headless browser infrastructure,...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 2
    Accomplish

    Accomplish

    Accomplish is the open source Al coworker that lives on your desktop

    Accomplish is an open-source AI desktop agent that automates everyday computer tasks directly on a user’s machine. It can handle file management, document creation, and browser-based workflows through natural language instructions. The system runs locally, ensuring that user data remains private and under full control. It supports integration with multiple AI providers or local models, giving users flexibility in how intelligence is powered. Accomplish emphasizes autonomy, allowing it to execute multi-step tasks without constant supervision. Its design focuses on replacing repetitive manual workflows with intelligent automation. ...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 3
    Monkey Code

    Monkey Code

    Enterprise-grade AI programming assistant designed for R&D collab

    Monkey Code is an enterprise-grade AI programming assistant designed to transform how development teams collaborate, build, and manage code across complex environments. It goes beyond traditional AI coding tools by combining intelligent code generation, conversational programming, and automated DevOps-style workflows into a unified platform that integrates directly with Git-based repositories. One of its defining characteristics is its support for private deployment and fully offline...
    Downloads: 5 This Week
    Last Update:
    See Project
  • 4
    Fli

    Fli

    Google Flights MCP and Python Library

    Fli is a powerful Python library and command-line tool that provides direct programmatic access to Google Flights data through reverse-engineered API interactions rather than traditional web scraping. This approach enables faster, more reliable, and more stable access to flight information, avoiding the fragility associated with HTML parsing and UI changes. The library supports a wide range of flight search capabilities, including filtering by airline, departure time, number of stops, cabin...
    Downloads: 1 This Week
    Last Update:
    See Project
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • 5
    AntV Infographic

    AntV Infographic

    Declarative engine for generating AI-powered infographic visuals

    AntV Infographic is a declarative infographic generation and rendering framework designed to transform structured data into visually rich infographic outputs. It provides a custom domain-specific language that allows developers and AI systems to describe infographic layouts in a concise and human-readable syntax. It focuses on simplifying data storytelling by enabling fast creation of professional-quality visuals without requiring complex design workflows.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 6
    Index

    Index

    The SOTA Open-Source Browser Agent

    Index is an open-source browser automation agent designed to autonomously perform complex tasks across websites by transforming web interfaces into programmable APIs. The system enables developers to instruct an AI agent to interact with web pages using natural language rather than traditional automation scripts. Instead of writing detailed browser automation code, users can describe the desired task and allow the agent to interpret the page structure, interact with elements, and complete...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 7
    pgai

    pgai

    A suite of tools to develop RAG, semantic search, and other AI apps

    pgai is a suite of PostgreSQL extensions developed by Timescale to empower developers in building AI applications directly within their databases. It integrates tools for vector storage, advanced indexing, and AI model interactions, facilitating the development of applications like semantic search and Retrieval-Augmented Generation (RAG) without leaving the SQL environment.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    Sublayer

    Sublayer

    A model-agnostic Ruby Generative AI DSL and framework

    Sublayer is a platform that enables developers to build and deploy machine learning models with ease, focusing on simplifying the ML lifecycle from development to production.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    CML

    CML

    Continuous Machine Learning | CI/CD for ML

    Continuous Machine Learning (CML) is an open-source CLI tool for implementing continuous integration & delivery (CI/CD) with a focus on MLOps. Use it to automate development workflows, including machine provisioning, model training and evaluation, comparing ML experiments across project history, and monitoring changing datasets. CML can help train and evaluate models, and then generate a visual report with results and metrics, automatically on every pull request.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Go From AI Idea to AI App Fast Icon
    Go From AI Idea to AI App Fast

    One platform to build, fine-tune, and deploy ML models. No MLOps team required.

    Access Gemini 3 and 200+ models. Build chatbots, agents, or custom models with built-in monitoring and scaling.
    Try Free
  • 10
    Robyn

    Robyn

    Experimental, AI/ML-powered and open sourced Marketing Mix Modeling

    Robyn is an open-source, AI/ML-powered Marketing Mix Modeling (MMM) toolkit developed by Meta Marketing Science under the “facebookexperimental” GitHub umbrella. Its goal is to democratize rigorous MMM: what traditionally required expert statisticians and expensive consulting becomes accessible to any company with data. Robyn takes in historical data (spends on different marketing channels, conversions, or revenue, and optional context or organic-media variables) and uses a combination of...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    MLOps Zoomcamp

    MLOps Zoomcamp

    Free MLOps course from DataTalks.Club

    MLOps Zoomcamp is an open-source educational repository that contains the materials for a free course focused on machine learning operations and production machine learning systems. The course is designed to teach data scientists and engineers how to move machine learning models from experimentation environments into scalable production services. The repository provides lessons, code examples, and assignments that cover the entire MLOps lifecycle, including model training, experiment...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 12
    stt

    stt

    Voice Recognition to Text Tool

    stt is a standalone speech recognition tool that locally converts spoken content in audio or video files into textual formats without requiring internet access, giving users control over their data and reducing reliance on external APIs. It leverages open-source speech models such as Faster-Whisper to recognize and transcribe human speech into plain text, structured JSON objects, or subtitle files with time codes, making it suitable for both personal and professional transcription tasks. The...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 13
    Vibe-Trading

    Vibe-Trading

    Vibe-Trading: Your Personal Trading Agent

    Vibe-Trading is an AI-powered multi-agent financial workspace that converts natural language inputs into executable trading strategies and market analysis. It allows users to describe investment ideas in plain language, which are then translated into code, backtested, and evaluated across global markets. The platform integrates multiple data sources, including equities, crypto, and derivatives, with automatic fallback mechanisms. It features a swarm-based architecture with prebuilt expert...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    SEO Machine

    SEO Machine

    A specialized Claude Code workspace for creating long-form

    SEO Machine is an AI-powered content production system built as a structured workspace for generating long-form, SEO-optimized blog content through automated workflows. It integrates research, writing, analysis, and optimization into a single pipeline, allowing users to produce high-quality articles tailored to search engine performance. The system uses specialized commands and agents to perform tasks such as keyword research, competitor analysis, content drafting, and optimization. It...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    pi-autoresearch

    pi-autoresearch

    Autonomous experiment loop extension for pi

    pi-autoresearch is an automation-oriented research assistant project that focuses on orchestrating iterative information gathering, analysis, and synthesis workflows with minimal human intervention. It is designed to simulate a continuous research loop where queries are generated, refined, and expanded based on previous outputs, enabling deeper exploration of complex topics. The system likely integrates with external data sources or APIs to retrieve information and process it into structured...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    ChatGPT Exporter

    ChatGPT Exporter

    Export and Share your ChatGPT conversation history

    ChatGPT Exporter is a browser-based userscript tool designed to export ChatGPT conversations into multiple structured and shareable formats, enabling users to preserve, analyze, and reuse AI-generated content outside the ChatGPT interface. It integrates directly into the ChatGPT web environment, typically via tools like Tampermonkey, and adds export functionality without requiring backend services or complex setup. The tool supports a wide range of output formats including plain text, HTML,...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    Pydantic Logfire

    Pydantic Logfire

    Python observability platform for tracing apps, metrics, and logs

    Pydantic Logfire is an observability platform designed to help developers monitor, analyze, and understand the behavior of their applications in real time. It is built by the team behind Pydantic and follows a philosophy of combining powerful capabilities with ease of use, making it accessible to entire engineering teams. Pydantic Logfire provides deep visibility into application performance by capturing traces, metrics, and logs through an OpenTelemetry-based architecture. It is...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    docext

    docext

    An on-premises, OCR-free unstructured data extraction

    docext is a document intelligence toolkit that uses vision-language models to extract structured information from documents such as PDFs, forms, and scanned images. The system is designed to operate entirely on-premises, allowing organizations to process sensitive documents without relying on external cloud services. Unlike traditional document processing pipelines that rely heavily on optical character recognition, docext leverages multimodal AI models capable of understanding both visual...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    skfolio

    skfolio

    Python library for portfolio optimization built on top of scikit-learn

    skfolio is a Python library designed for portfolio optimization and financial risk management that integrates closely with the scikit-learn ecosystem. The project provides a unified machine learning-style framework for building, validating, and comparing portfolio allocation strategies using financial data. By following the familiar scikit-learn API design, the library allows quantitative researchers and developers to apply techniques such as model selection, cross-validation, and...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    MiroThinker

    MiroThinker

    MiroThinker is an open source deep research agent

    MiroThinker is an open-source deep research AI agent designed to perform complex reasoning, information gathering, and predictive analysis tasks. The system focuses on enabling long-horizon research workflows by allowing the agent to interact repeatedly with external tools, search systems, and data sources while refining its reasoning through iterative steps. Rather than simply generating responses from a single prompt, the agent performs structured multi-step reasoning processes that...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    Nanocoder

    Nanocoder

    A beautiful local-first coding agent running in your terminal

    ...Its architecture emphasizes extensibility through custom commands and integration with Model Context Protocol servers that allow the AI agent to access additional tools.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    LangServe

    LangServe

    Helps developers deploy LangChain runnables and chains as a REST API

    LangServe is an open-source deployment framework designed to expose LangChain applications as production-ready REST APIs. The tool simplifies the process of turning language-model pipelines, chains, and agents into web services that can be accessed by external applications. Instead of manually writing API endpoints, developers can use LangServe to automatically generate a server that exposes LangChain workflows through HTTP interfaces. The framework is built on top of FastAPI and uses...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    Engram

    Engram

    A New Axis of Sparsity for Large Language Models

    Engram is a high-performance embedding and similarity search library focused on making retrieval-augmented workflows efficient, scalable, and easy to adopt by developers building search, recommendation, or semantic matching systems. It provides utilities to generate embeddings from text or other structured data, index them using efficient approximate nearest neighbor algorithms, and perform real-time similarity queries even on large corpora. Engineered with speed and memory efficiency in...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    Elastiknn

    Elastiknn

    Elasticsearch plugin for nearest neighbor search

    Elasticsearch plugin for nearest neighbor search. Store vectors and run similarity searches using exact and approximate algorithms. Methods like word2vec and convolutional neural nets can convert many data modalities (text, images, users, items, etc.) into numerical vectors, such that pairwise distance computations on the vectors correspond to semantic similarity of the original data. Elasticsearch is a ubiquitous search solution, but its support for vectors is limited. This plugin fills the...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    Label Sleuth

    Label Sleuth

    Open source no-code system for text annotation and building of text

    ...Label Sleuth has been designed with an extensible architecture allowing the easy integration of new components, such as additional model architectures or active learning techniques.
    Downloads: 0 This Week
    Last Update:
    See Project
MongoDB Logo MongoDB