Applying machine learning to model radon using topsoil geochemistry
Citation:
Banr�on, M., Cobelli, M., Crowley, Q.G, Applying machine learning to model radon using topsoil geochemistry, Applied Geochemistry, 158, 2023, 105790-Download Item:
Abstract:
Radon is classified as a Class 1 carcinogen, being the leading cause of lung cancer in non-smokers. Understanding the prominent sources of radon helps to mitigate against the adverse effects of radon exposure. Considering soil-gas radon is the main contributor to indoor radon, it is possible that soil geochemistry can be used as a proxy for the soil radon emanation potential or geogenic radon classes for a particular location. This paper investigates the relationship between soil geochemistry and geogenic radon. A large area of 17,983 km2 from the West, Midlands and East of Ireland was selected to represent a range of geology types and radon categories. A rigorous assessment is presented to investigate the relationship of geogenic radon and topsoil geochemistry; using univariate processes (i.e. r2, Pearson r and heatmaps) and multivariate techniques (i.e. principle component analysis (PCA) and machine learning (ML) algorithms including Gaussian process regression, logistic regression and random forest). Here, PCA and ML techniques were used to test the utility of soil geochemistry to predict geogenic radon classes. Gaussian Process Regression yielded the highest accuracy (74%) and f1-score (0.74) of all models. The feature importance (i.e. highest ranking elements for predicting geogenic radon class) from the ML models outputs elements including [Y, Tl, Mn, Cr, Co, Be, Sc and Rb]. The PCA biplot demonstrates that these elements cluster in conjunction with higher geogenic radon categories. Multivariate data analysis reveals that certain elements important for predicting higher geogenic radon classes, also covary together within topsoil samples; here these are termed “radon-prone elements”. Spatial covariance of radon-prone elements permits soil geochemistry to be used as a tool for understanding the distribution of geogenic radon. The methodology presented in this paper provides a comprehensive geo-statistical approach to investigate the relation between topsoil geochemistry and geogenic radon. This approach could be applied as a diagnostic tool to assist radon mitigation measures, hence adding value to legacy soil geochemistry datasets.
Author's Homepage:
http://people.tcd.ie/crowleyqDescription:
PUBLISHED
Author: Crowley, Quentin
Type of material:
Journal ArticleCollections
Series/Report no:
Applied Geochemistry158
Availability:
Full text availableKeywords:
Geogenic radon potential, Radon, Topsoil geochemistry, Machine learning, Principal component analysisSubject (TCD):
Cancer , Smart & Sustainable Planet , GEOCHEMISTRY , MACHINE LEARNING , RADON , SOIL CHEMISTRYDOI:
https://doi.org/10.1016/j.apgeochem.2023.105790Metadata
Show full item recordLicences: