The emergence of Machine learning (ML) algorithms have shown competency in a variety of fields and are growing in popularity in their application to geospatial science issues. Most recently and notably, ML algorithms have been applied to flood susceptibility (FS) mapping. Leveraging high power computing systems and existing ML algorithms with national datasets of Canada, this project has explored methods to create a national FS layer across a geographically large and diverse country with limited training data. First, approaches were considered on how to generate a map of FS for Canada, (i) national, which combined all training data into one model, and (ii) regional, where multiple models were created, based on regional similarities, and the results were mosaicked to generate a FS map. The second experiment explored the predictive capability of several ML algorithms across the geographically large and diverse landscape. Results indicate that the national approach provides a better prediction of FS, with 95.7 % of the test points, 91.5% of the pixels in the training sites, and 89.6% of the pixels across the country correctly predicted as flooded, compared to 65.5%, 80.6% and 75.6% respectively in the regional approach. ML models applied across the country found that support vector machine (svmRadial) and Neural Network (nnet) performed poorly in areas away from the training sites, while random forest (parRF) and Multivariate Adaptive Regression Spline (earth) performed better. A national ensemble model was ultimately selected as this blend of models compensated for the biases found in the individual models in geographic areas far removed training sites.
- 13 Reads