Wednesday, 25 June 2014

WikiEnergy Data Set Statistics

I recently wrote a post about the WikiEnergy data set released by Pecan Street Inc, and have since written a downloader and converter for the data set as part of NILMTK. In total, the data set contains 71 feed feeds monitored across 239 buildings over the period 1 Jan 2014 - 31 May 2014. However, only a subset of feeds were monitored for each building, and many buildings were not monitored for the full 5 months. This post provides a bit more insight into the content of the data set at the time of writing.

Feeds per building:


The histogram below shows the number of feeds monitored in each of the 239 buildings. It can be seen that the mode of the distribution is around 12 feeds per building, and therefore most of these buildings will be useful for evaluation energy disaggregation approaches.


Duration per building:


The histogram below shows the number of months for which each of the 239 buildings were monitored. It can be seen that vast majority of buildings were monitored for the full 5 months, while the remaining buildings were distributed between 1-4 months. However, this distribution will change dramatically once data for 2012-2013 is released.


Percentage energy sub-metered:


The histogram below shows the percentage of energy sub-metered in 235 of the 239 buildings. The remaining 4 buildings appeared to have energy sub-metered greater than 100%, and were therefore excluded from this plot. This distribution has two distinct peaks; one centred around 70% and another which peaks around 5%. The 63 buildings for which less than 40% of the energy was sub-metered are likely to be of limited use for evaluating energy disaggregation methods.



Buildings per feed:


The table below shows the number of buildings in which each of the 71 feeds were present. A description of each of the feeds is available from the Wiki-Energy Knowledge Base. It can be seen that the presence of feeds in buildings is quite sparse. However, the following feeds are present in the majority of buildings: the household aggregate power (use), air conditioning (air1), washing machine (clotheswasher1), dishwasher (dishwasher1), clothes dryer (drye1), electric heating (furnace1) and refrigerator (refrigerator1).

Feed Buildings
use 239
air1 224
air2 38
air3 5
airwindowunit1 3
aquarium1 1
bathroom1 57
bathroom2 7
bedroom1 65
bedroom2 30
bedroom3 4
bedroom4 0
bedroom5 0
car1 62
clotheswasher1 133
clotheswasher_dryg1 28
diningroom1 20
diningroom2 1
dishwasher1 150
disposal1 85
drye1 141
dryg1 29
freezer1 13
furnace1 184
furnace2 29
garage1 25
garage2 3
gen 116
grid 0
heater1 2
housefan1 2
icemaker1 1
jacuzzi1 13
kitchen1 46
kitchen2 17
kitchenapp1 103
kitchenapp2 73
lights_plugs1 79
lights_plugs2 40
lights_plugs3 16
lights_plugs4 4
lights_plugs5 2
lights_plugs6 0
livingroom1 64
livingroom2 10
microwave1 113
office1 31
outsidelights_plugs1 16
outsidelights_plugs2 3
oven1 89
oven2 3
pool1 4
pool2 0
poollight1 2
poolpump1 17
pump1 3
range1 61
refrigerator1 164
refrigerator2 14
security1 7
shed1 3
sprinkler1 9
unknown1 16
unknown2 6
unknown3 1
unknown4 1
utilityroom1 5
venthood1 19
waterheater1 21
waterheater2 2
winecooler1 4

2 comments:

  1. Hi Oliver
    How can access the database of wikienergy?

    Thanks

    ReplyDelete
    Replies
    1. Hi,

      WikiEnergy became the Dataport project, see this website for full details:
      https://dataport.cloud/

      Oli

      Delete