Join/Login
Business Software
Open Source Software
For Vendors
Blog
About
More

For Vendors Help Create Join Login

Business Software

Open Source Software

SourceForge Podcast

Resources

Articles
Case Studies
Blog

Menu

Help
Create
Join
Login

Home
Open Source Software
Search Results

Search Results for "data integration" - Page 2

x

Sort By:

Relevance

Clear All Filters

OS

Linux 1,658
Windows 1,567
Mac 1,521
More...
BSD 754
ChromeOS 696
Mobile Operating Systems 35
Desktop Operating Systems 22
Server Operating Systems 7
Embedded Operating Systems 2
Game Consoles 1

Category

Software Development 493
Artificial Intelligence 396
Business 367
Formats and Protocols 133
System 116
Database 115
Scientific/Engineering 114
Internet 112
Multimedia 65
Security 48
Communications 40
Education 30
Blockchain 14
Text Editors 12
Games 11
Productivity 10
Desktop Environment 7
Mobile 4
Terminals 4
Printing 3
Social sciences 1

License

OSI-Approved Open Source 1,432
Other License 19
Creative Commons Attribution License 10
GNU Free Documentation License 5
More...
Public Domain 4

Translations

Programming Language

Status

Production/Stable 199
Beta 100
Alpha 49
Mature 21
More...
Planning 19
Pre-Alpha 19
Inactive 3

Showing 1658 open source projects for "data integration"

View related business solutions

Linux Clear Filters & Widen Search

$300 in Free Credit Towards Top Cloud Services
Build VMs, containers, AI, databases, storage—all in one place.

Start your project in minutes. After credits run out, 20+ products include free monthly usage. Only pay when you're ready to scale.

Get Started
Stop Cyber Threats with VM-Series Next-Gen Firewall on Azure
Native application identity and user-based security for your Azure cloud

Gain integrated visibility across all traffic in a single pass. Deploy Palo Alto Networks VM-Series to determine application identity and content while automating security policy updates via rich APIs.

Get a free trial
1

RStudio

RStudio is an integrated development environment (IDE) for R

RStudio is a powerful, full-featured integrated development environment (IDE) tailored primarily for the R programming language but increasingly supportive of other languages like Python and Julia. It brings together console, editor, plotting, workspace, history, and file-management panes into a unified interface, helping data scientists, statisticians, and analysts to work more productively. The IDE is cross-platform: there are desktop versions for Windows, macOS and Linux, as well as a server version for remote or multi-user deployment via a web browser. In addition to code editing and execution, RStudio offers extensive support for reproducible research via R Markdown, notebooks, and integration with version control systems like Git and SVN. ...

Downloads: 81 This Week

Last Update: 2026-05-18
See Project
2

Canal

MySQL binlog

Canal is an open-source project developed by Alibaba that simulates MySQL slave functionality to parse MySQL binlog files. It enables real-time data synchronization and change data capture (CDC) between MySQL and other systems such as Elasticsearch, Kafka, or HBase. Canal is widely used for data integration, replication, and monitoring across distributed systems, offering high performance and low-latency log parsing.

Downloads: 4 This Week

Last Update: 2025-07-18
See Project
3

reticulate

R Interface to Python

reticulate is an R package from Posit that creates seamless interoperability between R and Python. It lets you call Python modules, classes, and functions from within R, automatically translating between R and Python data structures. Useful for combining Python tooling with R projects, data analysis, and RMarkdown reports.

Downloads: 0 This Week

Last Update: 2026-04-07
See Project
4

CellTypist

A tool for semi-automatic cell type classification, harmonization

CellTypist is an automated tool for cell type classification, harmonization, and integration. Classification, transfer cell type labels from the reference to query dataset. Harmonization, match and harmonize cell types defined by independent datasets. integration, integrate cell and cell types with supervision from harmonization. CellTypist recapitulates cell type structure and biology of independent datasets. Regularised linear models with Stochastic Gradient Descent provide a fast and...

Downloads: 0 This Week

Last Update: 2025-06-25
See Project
Go From AI Idea to AI App Fast
One platform to build, fine-tune, and deploy ML models. No MLOps team required.

Access Gemini 3 and 200+ models. Build chatbots, agents, or custom models with built-in monitoring and scaling.

Try Free
5

SeedCrackerX

Minecraft mod designed to reverse-engineer

SeedcrackerX is a Minecraft mod designed to reverse-engineer and determine a world’s seed by analyzing in-game structures and environmental data. It operates by collecting information from structures such as shipwrecks, temples, and monuments, then using that data to progressively narrow down possible seeds until the correct one is identified. The mod automates much of this process, initiating cracking procedures once sufficient data has been gathered, often requiring only exploration of...

Downloads: 237 This Week

Last Update: 2026-03-20
See Project
6

SymbolicNumericIntegration.jl

SymbolicNumericIntegration.jl: Symbolic-Numerics for Solving Integrals

SymbolicNumericIntegration.jl is a hybrid symbolic/numerical integration package that works on the Julia Symbolics expressions.

Downloads: 0 This Week

Last Update: 2026-01-13
See Project
7

Recap

Recap tracks and transform schemas across your whole application

Recap is a schema language and multi-language toolkit to track and transform schemas across your whole application. Your data passes through web services, databases, message brokers, and object stores. Recap describes these schemas in a single language, regardless of which system your data passes through. Recap schemas can be defined in YAML, TOML, JSON, XML, or any other compatible language.

Downloads: 2 This Week

Last Update: 2025-12-30
See Project
8

Apache Hudi

Upserts, Deletes And Incremental Processing on Big Data

Apache Hudi (pronounced Hoodie) stands for Hadoop Upserts Deletes and Incrementals. Hudi manages the storage of large analytical datasets on DFS (Cloud stores, HDFS or any Hadoop FileSystem compatible storage). Apache Hudi is a transactional data lake platform that brings database and data warehouse capabilities to the data lake. Hudi reimagines slow old-school batch data processing with a powerful new incremental processing framework for low latency minute-level analytics. Hudi provides...

Downloads: 6 This Week

Last Update: 4 days ago
See Project
9

Apache InLong

Apache InLong - a one-stop integration framework for massive data

Apache InLong is a one-stop integration framework for massive data that provides automatic, secure and reliable data transmission capabilities. InLong supports both batch and stream data processing at the same time, which offers great power to build data analysis, modeling and other real-time applications based on streaming data. InLong (应龙) is a divine beast in Chinese mythology who guides the river into the sea, and it is regarded as a metaphor of the InLong system for reporting data streams. ...

Downloads: 0 This Week

Last Update: 2025-11-13
See Project
MongoDB Atlas runs apps anywhere
Deploy in 115+ regions with the modern database for every enterprise.

MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.

Start Free
10

Alova.js

Workflow-Streamlined next-generation request tools

Extremely streamline API integration workflow. Quickly find APIs in the editor, and enjoy full type hints even in js projects with the API code automatically generated by Alova's extension. Request in various complex scenes by one line of code. Automatically manage paging data, and data preloading, reduce unnecessary data refresh, improve fluency by 300%, and reduce coding difficulty by 50%.

Downloads: 0 This Week

Last Update: 2026-03-09
See Project
11

LiteParse

A fast, helpful, and open-source document parser

...LiteParse supports integration with multiple language models, allowing developers to choose the best balance between accuracy and efficiency. It also includes mechanisms for validation and error handling, ensuring that outputs conform to expected schemas and reducing the need for manual postprocessing. The library is particularly useful for tasks such as data extraction, document processing, and building pipelines that require structured outputs from natural language input.

Downloads: 6 This Week

Last Update: 1 day ago
See Project
12

DeerFlow

Deep Research framework, combining language models with tools

...It supports asynchronous task coordination, modular tool integration, and orchestrates the data flow between agents — making it suitable for large-scale or multi-stage research pipelines. Users can deploy it locally or on server infrastructure, integrate custom tools, and benefit from its flexible configuration.

Downloads: 62 This Week

Last Update: 2026-05-18
See Project
13

Dagster

An orchestration platform for the development, production

Dagster is an orchestration platform for the development, production, and observation of data assets. Dagster as a productivity platform: With Dagster, you can focus on running tasks, or you can identify the key assets you need to create using a declarative approach. Embrace CI/CD best practices from the get-go: build reusable components, spot data quality issues, and flag bugs early. Dagster as a robust orchestration engine: Put your pipelines into production with a robust...

Downloads: 9 This Week

Last Update: 5 days ago
See Project
14

DBeaver

Free universal database tool

DBeaver is a free, multi-platform database tool that supports any database having a JDBC driver. It is useful for developers, SQL programmers, database administrators and analysts. DBeaver comes with plenty of great features such as metadata and SQL editors, ERD, data export/import/migration and more. Plugins are available for certain databases, and there are also several database management utilities. DBeaver’s Enterprise Edition provides even more features and supports non-JDBC datasources.

1 Review

Downloads: 201 This Week

Last Update: 2026-05-18
See Project
15

ScanAPI

Automated Integration Testing and Live Documentation for your API

Given an API specification, written in YAML/JSON format, ScanAPI hits the specified endpoints, runs the test cases, and generates a detailed report of this execution - which can also be used as the API documentation itself. With almost no Python knowledge, the user can define endpoints to be hit, the expected behavior for each response and will receive a full real-time diagnostic report of the API.

Downloads: 3 This Week

Last Update: 2026-05-18
See Project
16

JSON for Modern C++

JSON that's part of C++

This is JSON for C++, a JSON library unlike any other that's packed with plenty of great features. While there may be dozens of JSON libraries out there, JSON for C++ stands out with a focus on three things: an intuitive syntax, trivial integration and serious testing. Using the operator magic of modern C++, this library makes JSON feel like a first class data type. With trivial integration, the entire code is made up of a single header file json.hpp, no dependencies, no complex build system required. It's been heavily unit-tested covering 100% of the code, and follows the Core Infrastructure Initiative (CII) best practices to ensure the highest quality at all times. ...

Downloads: 20 This Week

Last Update: 2025-04-11
See Project
17

Positron

Positron, a next-generation data science IDE

...The IDE supports notebook and script workflows, integration of data-app frameworks (such as Shiny, Streamlit, Dash), database and cloud connections, and built-in AI-assisted capabilities to help write code, explore data, and build models.

Downloads: 0 This Week

Last Update: 2026-05-13
See Project
18

Cassandra Spark Connector

Apache Spark to Apache Cassandra connector

The Apache Cassandra Spark Connector allows Spark jobs (RDDs or DataFrames/Datasets) to read from and write to Cassandra tables. Compatible with Apache Cassandra (v2.1+), Spark 1.0–3.5, and Scala 2.11–2.13, it supports mapping Cassandra rows to Scala case classes, saving results back to Cassandra, and executing arbitrary CQL within Spark applications.

Downloads: 0 This Week

Last Update: 2025-08-04
See Project
19

Jan.ai

Open source alternative to ChatGPT that runs 100% offline

Jan.ai is an open-source, privacy-focused AI assistant that serves as an alternative to ChatGPT, running completely locally on your device. It allows you to download and run LLMs (local language models) offline while also offering optional integration with cloud-based model providers—giving you full control over your data and AI interactions. Download and run LLMs (Llama, Gemma, Qwen, GPT-oss etc.) from HuggingFace. Connect to GPT models via OpenAI, Claude models via Anthropic, Mistral, Groq, and others. Create specialized AI assistants for your tasks. MCP integration for agentic capabilities.

Downloads: 42 This Week

Last Update: 6 days ago
See Project
20

Suna

Suna - Open Source Generalist AI Agent

...Designed to assist users in accomplishing real-world tasks through natural conversation, Suna combines powerful capabilities with an intuitive interface. It serves as a digital companion for research, data analysis, and everyday challenges, integrating tools like browser automation, file management, web crawling, command-line execution, website deployment, and API integration. Suna's architecture comprises a FastAPI-based backend, a Next.js/React frontend, an agent Docker environment, and a Supabase database for state management. This modular design allows for seamless interaction and task execution through simple conversations.

Downloads: 5 This Week

Last Update: 2026-04-17
See Project
21

LakeSoul

An end-to-end, realtime and cloud native Lakehouse framework

LakeSoul is a high-performance, unified table storage framework for big data lakes, supporting both streaming and batch data in a single format. Built on top of Apache Spark and leveraging Apache Arrow and Parquet, LakeSoul provides ACID transactions, schema evolution, and time travel. It is designed for large-scale data lake architectures that require consistency, efficiency, and easy integration with modern data stacks.

Downloads: 0 This Week

Last Update: 2025-09-26
See Project
22

OpenDataMCP

Connect any Open Data to any LLM with Model Context Protocol

An initiative aimed at connecting open datasets to Large Language Models (LLMs) using the Model Context Protocol, facilitating seamless access and integration of public data into AI applications.

Downloads: 0 This Week

Last Update: 2025-04-07
See Project
23

Calculus.jl

Calculus functions in Julia

The Calculus package provides tools for working with the basic calculus operations of differentiation and integration. You can use the Calculus package to produce approximate derivatives by several forms of finite differencing or to produce exact derivatives using symbolic differentiation. You can also compute definite integrals by different numerical methods.

Downloads: 0 This Week

Last Update: 2024-10-20
See Project
24

Nelmio Alice

Expressive fixtures generator

Nelmio Alice is a PHP library designed to generate complex data fixtures for testing and development environments. It uses YAML, XML, or PHP files to define fixture templates, making it easy to create realistic and varied data sets. Alice integrates well with Doctrine ORM, allowing developers to quickly populate databases with test data, making it especially useful for automated testing and staging environments.

Downloads: 0 This Week

Last Update: 2026-02-24
See Project
25

keycloak-config-cli

Import YAML/JSON-formatted configuration files into Keycloak

keycloak-config-cli is a Keycloak utility to ensure the desired configuration state for a realm based on a JSON/YAML file. The format of the JSON/YAML file is based on the export realm format. Store and handle the configuration files inside git just like normal code. A Keycloak restart isn't required to apply the configuration. The config files are based on the keycloak export files. You can use them to re-import your settings. But keep your files as small as possible. Remove all UUIDs and...

Downloads: 130 This Week

Last Update: 5 days ago
See Project

Previous
1
You're on page 2
3
4
5
6
Next

Related Searches

dbeaver

bruto force seed

rstudio

canal

brute force seed

c++

jan ai

ai

ide

school database

Related Categories

Software Development

Artificial Intelligence

Business

Formats and Protocols

System

SourceForge

Create a Project
Open Source Software
Business Software
Top Downloaded Projects

Company

About
Team
SourceForge Headquarters
1320 Columbia Street Suite 310
San Diego, CA 92101
+1 (858) 422-6466

Resources

Support
Site Documentation
Site Status
SourceForge Reviews

© 2026 Slashdot Media. All Rights Reserved.

Terms Privacy Opt Out Advertise