A decision-led evaluation approach for flood forecasting system developments: An application to the Global Flood Awareness System in Bangladesh
This study aims to evaluate changes in the simulated behaviour of floods and the forecast skill of the two Global Flood Awareness System (GloFAS) versions based on different decision criteria for early action. The authors evaluate GloFAS reforecasts for the Brahmaputra and the Ganges Rivers in Bangladesh for the period 1999–2018. For the Brahmaputra River, the old GloFAS 2.1 version performs better than the 3.1 version, especially in predicting low- (90th percentile) and medium-level (95th percentile) floods. For the Ganges, GloFAS 3.1 shows improved probability of detection of low- to medium-level floods compared to version 2.1, especially for lead times longer than 10 days.
Both versions show limited skill for more extreme floods (99th percentile) but results are less robust for these less frequent floods given the lower number of events. Using lead-time dependent thresholds improves the false alarm ratio while reducing the probability of detection. The changes in model structures influence the model performance in a complex and varied way and forecast skill needs further investigation across regions and decision-making criteria. Understanding the skill changes between different model versions is important for decision-makers; however, focused case studies such as this should also be used by model developers to guide future changes to the system to ensure that they lead to improvements in decision-making ability.