Build, run, and manage data pipelines for integrating data
Python ETL framework for stream processing, real-time analytics, LLM
A Python package for interactive geospaital analysis and visualization
The power of Chart.js with Python
A cross-platform installer for the Julia programming language
Monitor the stability of a Pandas or Spark dataframe
Train machine learning models within Docker containers
Docker image used to run data processing workloads
Pythonic tool for running machine-learning/high performance workflows
Metadata and data identification tool and Python library
Making DAG construction easier
Data science on data without acquiring a copy
High-Performance Symbolic Regression in Python and Julia
Main repository for Vispy
Benchmarking synthetic data generation methods
Detecting silent model failure. NannyML estimates performance
An orchestration platform for the development, production
CKAN is an open-source DMS for powering data hubs
Python module that helps you build complex pipelines of batch jobs
Scale your Pandas workflows by changing a single line of code
Uncover insights, surface problems, monitor, and fine tune your LLM
Production-ready data processing made easy and shareable
Best practices on recommendation systems
Efficiently diff rows across two different databases
DXF2GCODE: converting 2D dxf drawings to CNC machine compatible G-Code