Enhancing Streamflow Modelling in Data-Scarce Catchments with Similarity-Guided Source Selection and Transfer Learning
Accepted version
Peer-reviewed
Repository URI
Repository DOI
Change log
Authors
Abstract
Abstract: Accurate streamflow modelling in data-scarce catchments remains a signifi-cant challenge due to the limited availability of historical records. Transfer Learning (TL), increasingly applied in hydrology, leverages knowledge from data-rich catchments (sources) to enhance predictions in data-scarce catchments (targets), providing new possibilities of hydrological predictions. Most existing TL approaches pre-train models on large-scale meteoro-hydrological datasets and show good generalizability across multiple target catchments. However, for a specific target catchment, it remains unclear which source catchments contribute most effectively to the accurate prediction. In-cluding many irrelevant sources may even degrade model performance. In this study, we investigate how source catchment selection affects TL performance by employing similarity-guided strategies based on three key factors, i.e., spatial distance, physical attributes and flow regime characteristics. Using the CAMELS-GB dataset, we conduct comparative experiments by pre-training the networks with different ranked groups of the source catchments and fine-tuning them on three target catchments representing distinct hydrological environments. Results show that carefully selected small subsets (fewer than 40, or even as few as 10) of highly similar catchments can achieve compa-rable or better TL performance than using all 668 available source catchments. All three target catchments yielded better NSE results from source catchments with closer spatial proximity and more consistent flow regimes. The TL performance of physical attribute similarity-based selection varied depending on the attribute combinations, with those related to land cover, climate, and soil properties leading to superior performance. These findings highlight the importance of similarity-guided source selection in hy-drological TL. In addition, they demonstrate ways to reduce computational costs while improving modelling accuracy in data-scarce regions.
Description
Keywords
Journal Title
Conference Name
Journal ISSN
Volume Title
Publisher
Publisher DOI
Publisher URL
Rights and licensing
Sponsorship
EPSRC (EP/Y034643/1)