Parallel computing with task scheduling
Build, run, and manage data pipelines for integrating data
Python ETL framework for stream processing, real-time analytics, LLM
A Python toolbox for gaining geometric insights
CKAN is an open-source DMS for powering data hubs
3D plotting and mesh analysis through a streamlined interface
Mie scattering of light by perfect spheres
A cross-platform installer for the Julia programming language
A tool for semi-automatic cell type classification, harmonization
AI-data warehouse to enrich, transform and analyze unstructured data
Progress bars for threading and multiprocessing tasks on terminal
Docker image used to run data processing workloads
Recap tracks and transform schemas across your whole application
Data integration platform for ELT pipelines from APIs, databases
Uncover insights, surface problems, monitor, and fine tune your LLM
WebGL-based viewer for volumetric data
An orchestration platform for the development, production
Python module that helps you build complex pipelines of batch jobs
Pythonic tool for running machine-learning/high performance workflows
https://github.com/JuliaPy/Conda.jl
BertViz: Visualize Attention in NLP Models (BERT, GPT2, BART, etc.)
A curated list of data mining papers about fraud detection
An interactive Formula 1 race visualisation and data analysis tool
Integrate multiple high-dimensional datasets with fuzzy k-means
Automatically find issues in image datasets