linux is free download

Airbyte

Data integration platform for ELT pipelines from APIs, databases

We believe that only an open-source solution to data movement can cover the long tail of data sources while empowering data engineers to customize existing connectors. Our ultimate vision is to help you move data from any source to any destination. Airbyte already provides the largest catalog of 300+ connectors for APIs, databases, data warehouses, and data lakes. Moving critical data with Airbyte is as easy and reliable as flipping on a switch. Our teams process more than 300 billion rows...

Downloads: 19 This Week

Last Update: 2025-10-15

See Project

Dagster

An orchestration platform for the development, production

Dagster is an orchestration platform for the development, production, and observation of data assets. Dagster as a productivity platform: With Dagster, you can focus on running tasks, or you can identify the key assets you need to create using a declarative approach. Embrace CI/CD best practices from the get-go: build reusable components, spot data quality issues, and flag bugs early. Dagster as a robust orchestration engine: Put your pipelines into production with a robust...

Downloads: 5 This Week

Last Update: 3 days ago

See Project

Recap

Recap tracks and transform schemas across your whole application

Recap is a schema language and multi-language toolkit to track and transform schemas across your whole application. Your data passes through web services, databases, message brokers, and object stores. Recap describes these schemas in a single language, regardless of which system your data passes through. Recap schemas can be defined in YAML, TOML, JSON, XML, or any other compatible language.

Downloads: 2 This Week

Last Update: 2025-12-30

See Project

harmonypy

Integrate multiple high-dimensional datasets with fuzzy k-means

Harmony is an algorithm for integrating multiple high-dimensional datasets. harmonypy is a port of the harmony R package by Ilya Korsunsky. Harmony is a general-purpose R package with an efficient algorithm for integrating multiple data sets. It is especially useful for large single-cell datasets such as single-cell RNA-seq.

Downloads: 0 This Week

Last Update: 2026-01-09

See Project

CellTypist

A tool for semi-automatic cell type classification, harmonization

CellTypist is an automated tool for cell type classification, harmonization, and integration. Classification, transfer cell type labels from the reference to query dataset. Harmonization, match and harmonize cell types defined by independent datasets. integration, integrate cell and cell types with supervision from harmonization. CellTypist recapitulates cell type structure and biology of independent datasets. Regularised linear models with Stochastic Gradient Descent provide a fast and...

Downloads: 0 This Week

Last Update: 2025-06-25

See Project

Mara Pipelines

A lightweight opinionated ETL framework, halfway between plain scripts

This package contains a lightweight data transformation framework with a focus on transparency and complexity reduction. Data integration pipelines as code: pipelines, tasks and commands are created using declarative Python code. PostgreSQL as a data processing engine. Extensive web ui. The web browser as the main tool for inspecting, running and debugging pipelines. GNU make semantics. Nodes depend on the completion of upstream nodes. No data dependencies or data flows. No in-app data...

Downloads: 0 This Week

Last Update: 2023-12-06

See Project

Pytente

Uma Ferramenta Computacional para Análise e Recuperação de Patentes

O Pytente é uma solução avançada para automatizar o processo de coleta, armazenamento e tratamento de dados bibliográficos de patentes. A ferramenta foi projetada para simplificar a coleta de grandes volumes de dados em repositórios de acesso aberto. O Pytente garante o armazenamento estruturado das informações, além da validação e eliminação de registros duplicados. Dentre as diversas funcionalidades disponibilizadas pela ferramenta, destacam-se a extração personalizada de subconjuntos de...

Downloads: 0 This Week

Last Update: 2025-11-03

See Project

scArches

Reference mapping for single-cell genomics

Single-cell architecture surgery (scArches) is a package for reference-based analysis of single-cell data. scArches allows your single-cell query data to be analyzed by integrating it into a reference atlas. By mapping your data into an integrated reference you can transfer cell-type annotation from reference to query, identify disease states by mapping to healthy atlas, and advanced applications such as imputing missing data modalities or spatial locations.

Downloads: 0 This Week

Last Update: 2023-06-13

See Project

openISI : topical data integration

A tool for autonomous and virtual topical data integration using the focused web-harvesting method.

Downloads: 0 This Week

Last Update: 2013-04-09

See Project

DataSync Suite

DataSync Suite is an open source platform for integrating tools like Zimbra, SugarCRM, and Drupal. The tool is focused on a single sign-on, application data integration, and fast, flexible deployment.

Downloads: 0 This Week

Last Update: 2015-12-21

See Project

SnapLogic

SnapLogic is an Open Source Data Integration framework that combines the power of state-of-the-art dynamic programming languages with standard Web interfaces to solve today's most pressing problems in data integration.

Downloads: 3 This Week

Last Update: 2013-04-16

See Project

Python Data Integration (PyDI)

A lightweight, browsing-based, 100% Python, federated data integration framework. Users may create custom schemas for disparate sources, query and expand results across sources to find related data; for use in fields such as bioinformatics and datamining

Downloads: 3 This Week

Last Update: 2013-04-23

See Project

Search Results for "linux is"

Showing 12 open source projects for "linux is"

Airbyte

Dagster

Recap

harmonypy

CellTypist

Mara Pipelines

Pytente

scArches

openISI : topical data integration

DataSync Suite

SnapLogic

Python Data Integration (PyDI)

Search Results for "linux is"

Showing 12 open source projects for "linux is"

Airbyte

Dagster

Recap

harmonypy

CellTypist

Mara Pipelines

Pytente

scArches

openISI : topical data integration

DataSync Suite

SnapLogic

Python Data Integration (PyDI)

Related Searches

Related Categories