laurimyllyvirta
Feb 17, 2012 12:12 PM
Duplicate entries in the E-PRTR database
I was trying to work with the downloadable E-PRTR database, but ran into a problem where the same facility has been entered with two or more different Facility IDs, causing double counting of some of the emissions. The online search and the EEA report "Revealing the costs of air pollution from industrial facilities in Europe" do not appear to suffer from this problem. Is there a systematic way to clear up the double entries, or is it possible to obtain a list of all the most recent Facility IDs to filter them from the database? I am working with way too many entries for any manual approach to be viable.
Thank you so much for your help,
Lauri Myllyvirta
I was trying to work with the downloadable E-PRTR database, but ran into a problem where the same facility has been entered with two or more different Facility IDs, causing double counting of some of the emissions. The online search and the EEA report "Revealing the costs of air pollution from industrial facilities in Europe" do not appear to suffer from this problem. Is there a systematic way to clear up the double entries, or is it possible to obtain a list of all the most recent Facility IDs to filter them from the database? I am working with way too many entries for any manual approach to be viable.
Thank you so much for your help,
Lauri Myllyvirta
Blog
Status Log
Wiki
Indeed, there was an issue with facilityIDs in the E-PRTR database and this has now been corrected. The updated dataset can be obtained from the EEA data service here:
http://www.eea.europa.eu/[…]/617DD46F-1162-40DF-9B15-0FE7AFB9C5F9
Hope this helps!
Otherwise an example may help to clarify if the issue is a mistake in the database. Thank you!