Metadata/data identification Java library
A curated list of data mining papers about fraud detection
Toolkit for making machine learning and data analysis applications
Automatically find issues in image datasets
Parse text and tables from PDF files.
Metadata and data identification tool and Python library
Linux system exploration and troubleshooting tool
Open-source data observability for analytics engineers
Create HTML profiling reports from pandas DataFrame objects
Python ETL framework for stream processing, real-time analytics, LLM
AI-data warehouse to enrich, transform and analyze unstructured data
Kubernetes-native platform to run massively parallel data/streaming
Climate science package for Julia
The standard data-centric AI package for data quality and ML
Graph theory library for visualization and analysis
Community-curated list of software packages and data resources
A graph database that supports more than 100+ billion data
Astronomical object/structure detection from 1D and 2D data sets.
Rapid, unbiased, reproducible analysis of synaptic events
Big Data Stream Analytics Framework.
A real-time 3D Engine written in Ada
Massive parallel data platform for analytics, machine learning and AI
Social Network Analysis and Visualization software
All-in-one text de-duplication