Join/Login
Business Software
Open Source Software
For Vendors
Blog
About
More

For Vendors Help Create Join Login

Business Software

Open Source Software

SourceForge Podcast

Resources

Articles
Case Studies
Blog

Menu

Help
Create
Join
Login

Home
Open Source Software
Business
Data Management Systems
Search Results

Search Results for "open source jigsaw" - Page 2

x

Sort By:

Relevance

Clear All Filters

OS

Linux 447
Windows 405
Mac 351
More...
BSD 223
ChromeOS 142
Desktop Operating Systems 24
Mobile Operating Systems 8
Server Operating Systems 4

Category

Business 500
Scientific/Engineering 224
Multimedia 85
Software Development 77
Artificial Intelligence 41
System 22
Internet 20
Formats and Protocols 18
Education 17
Database 16
Games 10
Desktop Environment 5
Blockchain 3
Communications 3
Social sciences 3
Security 2
Printing 1
Text Editors 1

License

OSI-Approved Open Source 490
Creative Commons Attribution License 4
Public Domain 3
Other License 2
More...
GNU Free Documentation License 1

Translations

English 148
French 15
German 13
Spanish 10
More...
Russian 9
Brazilian Portuguese 6
Dutch 6
Italian 5
Polish 4
Ukrainian 4
Catalan 3
Chinese (Simplified) 3
Japanese 3
Czech 2
Korean 2
Swedish 2
Bulgarian 1
Chinese (Traditional) 1
Danish 1
Hungarian 1
Norwegian 1
Portuguese 1
Turkish 1
Vietnamese 1

Programming Language

Status

Beta 112
Production/Stable 106
Alpha 77
Pre-Alpha 43
More...
Planning 24
Mature 10
Inactive 7

Showing 500 open source projects for "open source jigsaw"

View related business solutions

Data Management Python Clear Filters & Widen Search

Our Free Plans just got better! | Auth0
With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.

Try free now
Custom VMs From 1 to 96 vCPUs With 99.95% Uptime
General-purpose, compute-optimized, or GPU/TPU-accelerated. Built to your exact specs.

Live migration and automatic failover keep workloads online through maintenance. One free e2-micro VM every month.

Try Free
1

AI Data Science Team

An AI-powered data science team of agents

AI Data Science Team is a Python library and agent ecosystem designed to accelerate and automate common data science workflows by modeling them as specialized AI “agents” that can be orchestrated to perform tasks like data cleaning, transformation, analysis, visualization, and machine learning. It provides a modular agent framework where each agent focuses on a step in the typical data science pipeline — for example, loading data from CSV/Excel files, cleaning and wrangling messy datasets,...

Downloads: 1 This Week

Last Update: 2026-01-26
See Project
2

Elementary

Open-source data observability for analytics engineers

Elementary is an open-source data observability solution for data & analytics engineers. Monitor your dbt project and data in minutes, and be the first to know of data issues. Gain immediate visibility, detect data issues, send actionable alerts, and understand the impact and root cause. Generate a data observability report, host it or share with your team.

Downloads: 0 This Week

Last Update: 2026-01-29
See Project
3

JILL.py

A cross-platform installer for the Julia programming language

The enhanced Python fork of JILL, Julia Installer for Linux (and every other platform), Light.

Downloads: 0 This Week

Last Update: 2025-06-07
See Project
4

PySR

High-Performance Symbolic Regression in Python and Julia

PySR is an open-source tool for Symbolic Regression: a machine learning task where the goal is to find an interpretable symbolic expression that optimizes some objective. Over a period of several years, PySR has been engineered from the ground up to be (1) as high-performance as possible, (2) as configurable as possible, and (3) easy to use. PySR is developed alongside the Julia library SymbolicRegression.jl, which forms the powerful search engine of PySR.

Downloads: 0 This Week

Last Update: 2025-07-15
See Project
Train ML Models With SQL You Already Know
BigQuery turns your data warehouse into an AI platform. No new languages required.

Build and deploy ML models using familiar SQL. Automate data prep with built-in Gemini. Query 1 TB and store 10 GB free monthly.

Try Free
5

ClearML

Streamline your ML workflow

...The ClearML Server storing experiment, model, and workflow data, and supports the Web UI experiment manager, and ML-Ops automation for reproducibility and tuning. It is available as a hosted service and open source for you to deploy your own ClearML Server. The ClearML Agent for ML-Ops orchestration, experiment and workflow reproducibility, and scalability.

Downloads: 0 This Week

Last Update: 2026-01-25
See Project
6

whylogs

The open standard for data logging

whylogs is an open-source library for logging any kind of data. With whylogs, users are able to generate summaries of their datasets (called whylogs profiles) which they can use to track changes in their dataset Create data constraints to know whether their data looks the way it should. Quickly visualize key summary statistics about their datasets. whylogs profiles are the core of the whylogs library.

Downloads: 0 This Week

Last Update: 2024-12-03
See Project
7

Lithops

A multi-cloud framework for big data analytics

Lithops is an open-source serverless computing framework that enables transparent execution of Python functions across multiple cloud providers and on-prem infrastructure. It abstracts cloud providers like IBM Cloud, AWS, Azure, and Google Cloud into a unified interface and turns your Python functions into scalable, event-driven workloads. Lithops is ideal for data processing, ML inference, and embarrassingly parallel workloads, giving you the power of FaaS (Function-as-a-Service) without vendor lock-in. ...

Downloads: 0 This Week

Last Update: 2026-02-01
See Project
8

Fondant

Production-ready data processing made easy and shareable

Fondant is a modular, pipeline-based framework designed to simplify the preparation of large-scale datasets for training machine learning models, especially foundation models. It offers an end-to-end system for ingesting raw data, applying transformations, filtering, and formatting outputs—all while remaining scalable and traceable. Fondant is designed with reproducibility in mind and supports containerized steps using Docker, making it easy to share and reuse data processing components....

Downloads: 0 This Week

Last Update: 2025-04-08
See Project
9

Dask

Parallel computing with task scheduling

Dask is a Python library for parallel and distributed computing, designed to scale analytics workloads from single machines to large clusters. It integrates with familiar tools like NumPy, Pandas, and scikit-learn while enabling execution across cores or nodes with minimal code changes. Dask excels at handling large datasets that don’t fit into memory and is widely used in data science, machine learning, and big data pipelines.

Downloads: 0 This Week

Last Update: 2026-01-30
See Project
Gemini 3 and 200+ AI Models on One Platform
Access Google's best plus Claude, Llama, and Gemma. Fine-tune and deploy from one console.

Build generative AI apps with Vertex AI Studio. Switch between models without switching platforms.

Start Free
10

Pathway

Python ETL framework for stream processing, real-time analytics, LLM

Pathway is an open-source framework designed for building real-time data applications using reactive and declarative paradigms. It enables seamless integration of live data streams and structured data into analytical pipelines with minimal latency. Pathway is especially well-suited for scenarios like financial analytics, IoT, fraud detection, and logistics, where high-velocity and continuously changing data is the norm.

Downloads: 0 This Week

Last Update: 2026-02-16
See Project
11

pydna

Clone with Python! Data structures for double stranded DNA

Clone with Python! Data structures for double stranded DNA & simulation of homologous recombination, Gibson assembly, cut & paste cloning. Planning genetic constructs with many parts and assembly steps, such as recombinant metabolic pathways, are often difficult to properly document as is evident from the poor state of documentation in the scientific literature. The pydna python package provide a human-readable formal description of cloning and genetic assembly strategies in Python which...

Downloads: 0 This Week

Last Update: 2026-02-06
See Project
12

Recap

Recap tracks and transform schemas across your whole application

Recap is a schema language and multi-language toolkit to track and transform schemas across your whole application. Your data passes through web services, databases, message brokers, and object stores. Recap describes these schemas in a single language, regardless of which system your data passes through. Recap schemas can be defined in YAML, TOML, JSON, XML, or any other compatible language.

Downloads: 0 This Week

Last Update: 2025-12-30
See Project
13

Ethereum ETL

Python scripts for ETL (extract, transform and load) jobs for Ethereum

Python scripts for ETL (extract, transform and load) jobs for Ethereum blocks, transactions, ERC20 / ERC721 tokens, transfers, receipts, logs, contracts, internal transactions. Data is available in Google BigQuery. Ethereum ETL lets you convert blockchain data into convenient formats like CSVs and relational databases.

Downloads: 0 This Week

Last Update: 2024-04-11
See Project
14

Run Page

Make your own running home page

GitHub Actions manages automatic synchronization of runs and generation of new pages. Gatsby-generated static pages, fast. Support for Vercel (recommended) and GitHub Pages automated deployment. React Hooks. Mapbox for map display. Supports most sports apps such as nike strava. Automatically backup gpx data for easy backup and uploading to other software.

Downloads: 0 This Week

Last Update: 2025-04-21
See Project
15

harmonypy

Integrate multiple high-dimensional datasets with fuzzy k-means

Harmony is an algorithm for integrating multiple high-dimensional datasets. harmonypy is a port of the harmony R package by Ilya Korsunsky. Harmony is a general-purpose R package with an efficient algorithm for integrating multiple data sets. It is especially useful for large single-cell datasets such as single-cell RNA-seq.

Downloads: 0 This Week

Last Update: 2026-01-09
See Project
16

Union Pandera

Light-weight, flexible, expressive statistical data testing library

The open-source framework for precision data testing for data scientists and ML engineers. Pandera provides a simple, flexible, and extensible data-testing framework for validating not only your data but also the functions that produce them. A simple, zero-configuration data testing framework for data scientists and ML engineers seeking correctness.

Downloads: 0 This Week

Last Update: 2026-01-29
See Project
17

miepython

Mie scattering of light by perfect spheres

miepython is a pure Python module to calculate light scattering for non-absorbing, partially-absorbing, or perfectly-conducting spheres. Mie theory is used, following the procedure described by Wiscombe. This code has been validated against his results. This code provides functions for calculating the extinction efficiency, scattering efficiency, backscattering, and scattering asymmetry. Moreover, a set of angles can be given to calculate the scattering for a sphere at each of those angles.

Downloads: 0 This Week

Last Update: 2026-02-08
See Project
18

Metacrafter

Metadata and data identification tool and Python library

Python command line tool and Python engine to label table fields and fields in data files. It could help to find meaningful data in your tables and data files or to find Personal identifiable information (PII). Metacrafter is a rule-based tool that helps to label fields of the tables in databases. It scans table and finds person names, surnames, midnames, PII data, basic identifiers like UUID/GUID. These rules written as .yaml files and could be easily extended.

Downloads: 0 This Week

Last Update: 2024-06-14
See Project
19

CellTypist

A tool for semi-automatic cell type classification, harmonization

CellTypist is an automated tool for cell type classification, harmonization, and integration. Classification, transfer cell type labels from the reference to query dataset. Harmonization, match and harmonize cell types defined by independent datasets. integration, integrate cell and cell types with supervision from harmonization. CellTypist recapitulates cell type structure and biology of independent datasets. Regularised linear models with Stochastic Gradient Descent provide a fast and...

Downloads: 0 This Week

Last Update: 2025-06-25
See Project
20

leafmap

A Python package for interactive mapping and geospatial analysis

...However, not everyone in the geospatial community has access to the GEE cloud computing platform. Leafmap is designed to fill this gap for non-GEE users. It is a free and open-source Python package that enables users to analyze and visualize geospatial data with minimal coding in a Jupyter environment, such as Google Colab, Jupyter Notebook, and JupyterLab. Leafmap is built upon several open-source packages, such as folium and ipyleaflet (for creating interactive maps), WhiteboxTools and whiteboxgui (for analyzing geospatial data), and ipywidgets (for designing interactive graphical user interface [GUI]).

Downloads: 0 This Week

Last Update: 2026-02-19
See Project
21

Mage.ai

Build, run, and manage data pipelines for integrating data

Open-source data pipeline tool for transforming and integrating data. The modern replacement for Airflow. Effortlessly integrate and synchronize data from 3rd party sources. Build real-time and batch pipelines to transform data using Python, SQL, and R. Run, monitor, and orchestrate thousands of pipelines without losing sleep. Have you met anyone who said they loved developing in Airflow?

Downloads: 0 This Week

Last Update: 2026-01-20
See Project
22

NannyML

Detecting silent model failure. NannyML estimates performance

NannyML is an open-source python library that allows you to estimate post-deployment model performance (without access to targets), detect data drift, and intelligently link data drift alerts back to changes in model performance. Built for data scientists, NannyML has an easy-to-use interface, and interactive visualizations, is completely model-agnostic, and currently supports all tabular classification use cases.

Downloads: 0 This Week

Last Update: 2025-07-12
See Project
23

Pyper

Concurrent Python made simple

Pyper is a Python-native orchestration and scheduling framework designed for modern data workflows, machine learning pipelines, and any task that benefits from a lightweight DAG-based execution engine. Unlike heavier platforms like Airflow, Pyper aims to remain lean, modular, and developer-friendly, embracing Pythonic conventions and minimizing boilerplate. It focuses on local development ergonomics and seamless transition to production environments, making it ideal for small teams and...

Downloads: 0 This Week

Last Update: 2025-04-08
See Project
24

Positron

Positron, a next-generation data science IDE

...It aims to unify exploratory data analysis, production code, and data-app authoring in a single environment so that data scientists move from “question → insight → application” without switching tools. Built on the open-source Code-OSS foundation, Positron provides a familiar coding experience along with specialized panes and tooling for variable inspection, data-frame viewing, plotting previews, and interactive consoles designed for analytical work. The IDE supports notebook and script workflows, integration of data-app frameworks (such as Shiny, Streamlit, Dash), database and cloud connections, and built-in AI-assisted capabilities to help write code, explore data, and build models.

Downloads: 2 This Week

Last Update: 2026-02-10
See Project
25

atpbar

Progress bars for threading and multiprocessing tasks on terminal

Progress bars for threading and multiprocessing tasks on the terminal and Jupyter Notebook. atpbar can display multiple progress bars simultaneously growing to show the progresses of iterations of loops in threading or multiprocessing tasks. atpbar can display progress bars on the terminal and Jupyter Notebook. atpbar can be used with Mantichora. atpbar started its development in 2015 as part of Alphatwirl. atpbar prevented physicists from terminating their running analysis codes, which...

Downloads: 0 This Week

Last Update: 2025-11-09
See Project

Previous
1
You're on page 2
3
4
5
6
Next

Related Searches

download installer

dna

ethereum

seb

map

lotto prediction algorithm

pentest

google earth

loop root

Related Categories

Business

Scientific/Engineering

Multimedia

Software Development

Artificial Intelligence

SourceForge

Create a Project
Open Source Software
Business Software
Top Downloaded Projects

Company

About
Team
SourceForge Headquarters
1320 Columbia Street Suite 310
San Diego, CA 92101
+1 (858) 422-6466

Resources

Support
Site Documentation
Site Status
SourceForge Reviews

© 2026 Slashdot Media. All Rights Reserved.

Terms Privacy Opt Out Advertise

Thanks for helping keep SourceForge clean.

X

You seem to have CSS turned off. Please don't fill out this field.

You seem to have CSS turned off. Please don't fill out this field.

Briefly describe the problem (required):

Upload screenshot of ad (required):

Select a file, or drag & drop file here.

✔

✘

Screenshot instructions:

Click URL instructions:
Right-click on the ad, choose "Copy Link", then paste here →
(This may not be possible with some types of ads)

More information about our ad policies

Ad destination/click URL: