ydata-profiling primary goal is to provide a one-line Exploratory Data Analysis (EDA) experience in a consistent and fast solution. Like pandas df.describe() function, that is so handy, ydata-profiling delivers an extended analysis of a DataFrame while allowing the data analysis to be exported in different formats such as html and json.

Features

  • Automatic detection of columns’ data types (Categorical, Numerical, Date, etc.)
  • A summary of the problems/challenges in the data that you might need to work on (missing data, inaccuracies, skewness, etc.)
  • Descriptive statistics (mean, median, mode, etc) and informative visualizations such as distribution histograms
  • Correlations, a detailed analysis of missing data, duplicate rows, and visual support for variables pairwise interaction
  • Different statistical information relative to time dependent data such as auto-correlation and seasonality, along ACF and PACF plots
  • Most common categories (uppercase, lowercase, separator), scripts (Latin, Cyrillic) and blocks (ASCII, Cyrilic)

Project Samples

Project Activity

See All Activity >

Categories

Data Quality

License

MIT License

Follow ydata-profiling

ydata-profiling Web Site

You Might Also Like
Business Continuity Solutions | ConnectWise BCDR Icon
Business Continuity Solutions | ConnectWise BCDR

Build a foundation for data security and disaster recovery to fit your clients’ needs no matter the budget.

Whether natural disaster, cyberattack, or plain-old human error, data can disappear in the blink of an eye. ConnectWise BCDR (formerly Recover) delivers reliable and secure backup and disaster recovery backed by powerful automation and a 24/7 NOC to get your clients back to work in minutes, not days.
Rate This Project
Login To Rate This Project

User Reviews

Be the first to post a review of ydata-profiling!

Additional Project Details

Programming Language

Python

Related Categories

Python Data Quality Tool

Registered

2023-06-12