Wednesday, 7 September 2011

The Reference Energy Disaggregation Data Set

Accurately comparing the performance of energy disaggregation (Non-intrusive Appliance Load Monitoring) methods is almost impossible unless the same test data is used. For this reason, a team at MIT have built The Reference Energy Disaggregation Data Set (REDD). A few weeks ago, an initial (v1.0) snapshot of this data set was released for use in academic work. The paper describing the equipment used for data collection is:

Kolter JZ, Johnson MJ. REDD : A Public Data Set for Energy Disaggregation Research. In: Workshop on Data Mining Applications in Sustainability (SIGKDD). San Diego, CA; 2011

The dataset contains data for 6 houses. For each house:
  • The current and voltage of the two mains circuits are monitored at 15kHz. This is also available for download as power readings down sampled to 1Hz
  • The power of each circuit in the home is also monitored to a frequency of 1 reading every 3-4 seconds. Each house contains 11-26 circuits, with large appliances often operating on their own dedicated circuit
  • The power drawn by individual appliances is also being monitored, although not enough data had been collected to warrant its release in v1.0
Collecting energy consumption data is a time consuming and expensive process. The lack of available data about appliance behaviour and energy consumption often limits the understanding necessary to design energy disaggregation methods. Through the study of this dataset, I hope to provide some insight into how the energy is consumed in the home and how energy disaggregation methods can take advantage of this.

Energy Breakdown

The aim of energy disaggregation methods is often to maximise the disaggregation accuracy, or minimise the difference between the predicted and actual energy consumption of each appliance. Therefore, the penalty for not recognising an appliance has consumed any energy increases with the energy consumption of the appliance. Consequently, it is more important for energy disaggregation methods to be able to disaggregate an appliance which consumes a high amount of energy than one that consumes a low amount of energy. So which appliances are most important? The figure below shows the relative contributions of each household circuit for house 1 of REDD.
Energy breakdown by circuit of house 1
The above figure shows that just 7 circuits collectively contribute over 75% of the house's total energy consumption. Admittedly, not all circuits contain a single appliance, although I doubt any circuit contains more than a handful of appliances. I'm a little surprised that the fridge comes out as the highest energy consumer, but I don't think it's unusual that it would contribute a fair portion of a house's energy consumption.

Appliance Behaviour

In order to disaggregate a household's energy consumption into the contributing appliances, it is often necessary to build models of such appliances. Analysing the power demand of each appliance is often a good place to start to decide what information such a model should contain. Since the fridge came out as the highest consumer, I've chosen to take a closer look the fridge's power demand in 5 of the REDD houses. Below is a plot of the power demand over a 45 minute period.
Almost all fridges have a different power demand, duration of use and periodicity. However, interestingly, they all share an identical model of usage, in that they all exhibit similar cyclic behaviour. For each fridge, the cycle begins with a single increase in power, which gradually decreases during the cycle and finally ends sharply. Although not ideal, this is encouraging for the purpose of general appliance models. Such a general model of fridges could be built, and would only need to be tweaked in order to match a specific fridge's signature.

4 comments:

  1. hello sir. currently im doing my final year project in my university. can i know how to do the plotting of Voltage-Current Trajectory? i already get the REDD from MIT but i dont know how to do the next step. really hope that you can help me. this is my email : zackzaim_mml14@yahoo.com

    ReplyDelete
    Replies
    1. Have you download the high-frequency version of the REDD data set? You might also want to look into the PLAID data set, which includes high frequency V & I current data: http://plaidplug.com/

      For an overview of V / I data, you might also want to read this paper: http://ieeexplore.ieee.org/document/7418189/ and this presentation: https://www.sigport.org/sites/default/files/A%20feasibility%20study%20of%20automated%20plug-load%20identification%20from%20high-frequency%20measurements.pdf

      Delete
    2. thank you sir. by the way, can i know how you obtain the graph of power versus time for refrigerator? the 45 minutes one. is it just simply plot using the data in the low frequency directory? thank you and sorry for inconvenience.

      Delete
    3. Yes - I used data from the low frequency directory, selecting the fridge from 5 of the houses.

      Delete