An orchestration platform for the development, production
Uncover insights, surface problems, monitor, and fine tune your LLM
CKAN is an open-source DMS for powering data hubs
Fast, flexible and powerful Python data analysis toolkit
Open-source data observability for analytics engineers
Orange: Interactive data analysis
Benchmarking synthetic data generation methods
Recap tracks and transform schemas across your whole application
Machine learning in Python
Python module that helps you build complex pipelines of batch jobs
Diagram generation for understanding codebases and system architecture
Visualize and compare datasets, target values and associations
Collaborative forensic timeline analysis
matplotlib: plotting with Python
Data science on data without acquiring a copy
A cross-platform installer for the Julia programming language
Dataset Management Framework, a Python library and a CLI tool to build
Panda-Helper: Data profiling utility for Pandas DataFrames and Series
Python ETL framework for stream processing, real-time analytics, LLM
Create HTML profiling reports from pandas DataFrame objects
Project structure for doing and sharing data science work
Parallel computing with task scheduling
AI-data warehouse to enrich, transform and analyze unstructured data
Data integration platform for ELT pipelines from APIs, databases
Convert Python notebook to web app and share with non-technical users