edoc

Development of Europe-wide models for particle elemental composition using supervised linear regression and random forest

Chen, J. and de Hoogh, K. and Gulliver, J. and Hoffmann, B. and Hertel, O. and Ketzel, M. and Weinmayr, G. and Bauwelinck, M. and van Donkelaar, A. and Hvidtfeldt, U. A. and Atkinson, R. and Janssen, N. A. H. and Martin, R. V. and Samoli, E. and Andersen, Z. J. and Oftedal, B. M. and Stafoggia, M. and Bellander, T. and Strak, M. and Wolf, K. and Vienneau, D. and Brunekreef, B. and Hoek, G.. (2020) Development of Europe-wide models for particle elemental composition using supervised linear regression and random forest. Environmental science & technology, 54 (24). pp. 15698-15709.

[img] PDF - Published Version
Available under License CC BY-NC-ND (Attribution-NonCommercial-NoDerivatives).

5Mb

Official URL: https://edoc.unibas.ch/91047/

Downloads: Statistics Overview

Abstract

We developed Europe-wide models of long-term exposure to eight elements (copper, iron, potassium, nickel, sulfur, silicon, vanadium, and zinc) in particulate matter with diameter <2.5 mum (PM2.5) using standardized measurements for one-year periods between October 2008 and April 2011 in 19 study areas across Europe, with supervised linear regression (SLR) and random forest (RF) algorithms. Potential predictor variables were obtained from satellites, chemical transport models, land-use, traffic, and industrial point source databases to represent different sources. Overall model performance across Europe was moderate to good for all elements with hold-out-validation R-squared ranging from 0.41 to 0.90. RF consistently outperformed SLR. Models explained within-area variation much less than the overall variation, with similar performance for RF and SLR. Maps proved a useful additional model evaluation tool. Models differed substantially between elements regarding major predictor variables, broadly reflecting known sources. Agreement between the two algorithm predictions was generally high at the overall European level and varied substantially at the national level. Applying the two models in epidemiological studies could lead to different associations with health. If both between- and within-area exposure variability are exploited, RF may be preferred. If only within-area variability is used, both methods should be interpreted equally.
Faculties and Departments:09 Associated Institutions > Swiss Tropical and Public Health Institute (Swiss TPH)
09 Associated Institutions > Swiss Tropical and Public Health Institute (Swiss TPH) > Department of Epidemiology and Public Health (EPH) > Environmental Exposures and Health Systems Research > Physical Hazards and Health (Röösli)
09 Associated Institutions > Swiss Tropical and Public Health Institute (Swiss TPH) > Department of Epidemiology and Public Health (EPH) > Environmental Exposures and Health Systems Research > Environmental Exposure Modelling (Vienneau)
UniBasel Contributors:de Hoogh, Kees and Vienneau, Danielle
Item Type:Article, refereed
Article Subtype:Research Article
ISSN:0013-936X
Note:Publication type according to Uni Basel Research Database: Journal article
Language:English
Related URLs:
Identification Number:
edoc DOI:
Last Modified:28 Dec 2022 10:08
Deposited On:28 Dec 2022 10:08

Repository Staff Only: item control page