Skip to main content

Robust predictive modelling of water pollution using biomarker data.

Budka, M., Gabrys, B. and Ravagnan, E., 2010. Robust predictive modelling of water pollution using biomarker data. Water Research, 44 (10), 3294-3308.

Full text available as:

WaterResearch_Budka_Gabrys_Ravagnan_Robust_predictive_modelling_of_water_pollution_using_biomarker_data_2009.pdf - Submitted Version


DOI: 10.1016/j.watres.2010.03.006


This paper describes the methodology of building a predictive model for the purpose of marine pollution monitoring, based on low quality biomarker data. A step–by–step, systematic data analysis approach is presented, resulting in design of a purely data–driven model, able to accurately discriminate between various coastal water pollution levels. The environmental scientists often try to apply various machine learning techniques to their data without much success, mostly because of the lack of experience with different methods and required ‘under the hood’ knowledge. Thus this paper is a result of a collaboration between the machine learning and environmental science communities, presenting a predictive model development workflow, as well as discussing and addressing potential pitfalls and difficulties. The novelty of the modelling approach presented lays in successful application of machine learning techniques to high dimensional, incomplete biomarker data, which to our knowledge has not been done before and is the result of close collaboration between machine learning and environmental science communities.

Item Type:Article
Uncontrolled Keywords:biomarkers, water quality monitoring, marine pollution, ensemble classification, missing data, predictive modelling
Group:Faculty of Science & Technology
ID Code:11967
Deposited By: Professor Bogdan Gabrys LEFT
Deposited On:27 Oct 2009 00:20
Last Modified:14 Mar 2022 13:26


Downloads per month over past year

More statistics for this item...
Repository Staff Only -