Pandas on AWS, easy integration with Athena, Glue, Redshift, etc.
Data integration platform for ELT pipelines from APIs, databases
An AI-powered data science team of agents
A tool for semi-automatic cell type classification, harmonization
Recap tracks and transform schemas across your whole application
An orchestration platform for the development, production
Positron, a next-generation data science IDE
Integrate multiple high-dimensional datasets with fuzzy k-means
Python ETL framework for stream processing, real-time analytics, LLM
Burp Suite extension for JavaScript static analysis
A multi-cloud framework for big data analytics
Easy integration with Athena, Glue, Redshift, Timestream, Neptune
Main repository for Vispy
WebGL-based viewer for volumetric data
3D plotting and mesh analysis through a streamlined interface
Great Expectations Airflow operator
Dataset Management Framework, a Python library and a CLI tool to build
Diagram generation for understanding codebases and system architecture
Scale your Pandas workflows by changing a single line of code
Python module that helps you build complex pipelines of batch jobs
Clean Jupyter notebooks of outputs, metadata, and empty cells
DXF2GCODE: converting 2D dxf drawings to CNC machine compatible G-Code
Real-time, incremental ETL library for ML with record-level depend
Uma Ferramenta Computacional para Análise e Recuperação de Patentes
A lightweight opinionated ETL framework, halfway between plain scripts