This repository explores household energy usage over time using the “Individual household electric power consumption” dataset from the UC Irvine Machine Learning Repository. The dataset covers nearly four years of minute-level measurements, including power consumption, voltage, current intensity, and detailed sub-metering values for different household areas. For analysis, focus is placed on a two-day period in February 2007, highlighting short-term consumption trends. The data requires careful handling due to its size of more than 2 million rows and coded missing values. By processing the date and time fields into proper formats, it becomes possible to generate clear time-series plots of energy usage. The repository demonstrates effective exploratory data analysis practices in R with a reproducible workflow for transforming raw data into visual insights.
Features
- Generates plots from the Individual household electric power consumption dataset
- Provides four dedicated R scripts, each producing a different graph
- Demonstrates subsetting and handling of large datasets in R
- Uses base R plotting functions for reproducible graphics
- Offers a clear example for Coursera’s Exploratory Data Analysis course project
- Focused on energy usage data exploration, including time-series visualizations