ydata-profiling primary goal is to provide a one-line Exploratory Data Analysis (EDA) experience in a consistent and fast solution. Like pandas df.describe() function, that is so handy, ydata-profiling delivers an extended analysis of a DataFrame while allowing the data analysis to be exported in different formats such as html and json.

Features

  • Automatic detection of columns’ data types (Categorical, Numerical, Date, etc.)
  • A summary of the problems/challenges in the data that you might need to work on (missing data, inaccuracies, skewness, etc.)
  • Descriptive statistics (mean, median, mode, etc) and informative visualizations such as distribution histograms
  • Correlations, a detailed analysis of missing data, duplicate rows, and visual support for variables pairwise interaction
  • Different statistical information relative to time dependent data such as auto-correlation and seasonality, along ACF and PACF plots
  • Most common categories (uppercase, lowercase, separator), scripts (Latin, Cyrillic) and blocks (ASCII, Cyrilic)

Project Samples

Project Activity

See All Activity >

Categories

Data Quality

License

MIT License

Follow ydata-profiling

ydata-profiling Web Site

You Might Also Like
Find out just how much your login box can do for your customer | Auth0 Icon
Find out just how much your login box can do for your customer | Auth0

With over 53 social login options, you can fast-track the signup and login experience for users.

From improving customer experience through seamless sign-on to making MFA as easy as a click of a button – your login box must find the right balance between user convenience, privacy and security.
Rate This Project
Login To Rate This Project

User Reviews

Be the first to post a review of ydata-profiling!

Additional Project Details

Programming Language

Python

Related Categories

Python Data Quality Tool

Registered

2023-06-12