Data integration platform for ELT pipelines from APIs, databases
A tool for semi-automatic cell type classification, harmonization
Positron, a next-generation data science IDE
Recap tracks and transform schemas across your whole application
Integrate multiple high-dimensional datasets with fuzzy k-means
Python ETL framework for stream processing, real-time analytics, LLM
An orchestration platform for the development, production
A multi-cloud framework for big data analytics
Burp Suite extension for JavaScript static analysis
Easy integration with Athena, Glue, Redshift, Timestream, Neptune
Pandas on AWS, easy integration with Athena, Glue, Redshift, etc.
Main repository for Vispy
An AI-powered data science team of agents
WebGL-based viewer for volumetric data
3D plotting and mesh analysis through a streamlined interface
Great Expectations Airflow operator
Diagram generation for understanding codebases and system architecture
Dataset Management Framework, a Python library and a CLI tool to build
Clean Jupyter notebooks of outputs, metadata, and empty cells
Scale your Pandas workflows by changing a single line of code
Python module that helps you build complex pipelines of batch jobs
DXF2GCODE: converting 2D dxf drawings to CNC machine compatible G-Code
Uma Ferramenta Computacional para Análise e Recuperação de Patentes
Real-time, incremental ETL library for ML with record-level depend
A lightweight opinionated ETL framework, halfway between plain scripts