Extreme hydrological events, as one of the most common and destructive disasters in the realm of natural calamities, exert an impact on infrastructure development and socio-economic progress at a global scale (Salman and Li, 2018). In order to mitigate the consequences resulting from extreme hydrological events, individuals often resort to the utilization of historical and monitoring data to establish statistical models or meteorological and hydrological models for the purpose of analysis and prediction (Intergovernmental Panel on Climate, 2023). Currently, the analysis of extreme hydrological events predominantly relies on statistical models built upon historical data, with this approach mainly drawing inspiration from the theory of extremes (Katz et al., 2002).
In 1941, the statistician Gumbel first applied extreme value theory to calculate the return period of floods, which has since been widely developed and applied (Gumbel, 1941; Khaliq et al., 2006; Pan and Rahman, 2022). Currently, the application of extreme value theory mainly involves two steps: 1. Selecting extreme value sequences; 2. Finding the appropriate probability distribution (Gomes and Guillou, 2015). The most common methods for selecting extreme value sequences are Block Maxima (BM), which samples based on fixed intervals, and Peaks Over Thresholds (POT), in which sample values exceed a certain threshold. Although POT is more difficult to apply than BM, the results of POT are more reliable (Bezak et al., 2014; Nguyen et al., 2017; Tabari, 2021). Once the extreme value sequences are selected, distribution fitting can be performed. Then, the fitting results can be evaluated using non-parametric testing methods to determine the optimal distribution and establish a statistical model. In traditional extreme value theory, stationarity is generally regarded as a fundamental assumption. However, in recent years, scholars have raised doubts about the stationarity of hydrological extreme value sequences, considering factors such as climate change and human activities (Salas and Obeysekera, 2014).
In general, stationarity refers to the distribution and statistical parameters of a time series not changing over time, and it is difficult for actual sequences to strictly meet the requirements of stationarity. Broadly speaking, the stationarity of extreme hydrological events refers to the relatively stable range of distribution changes (Slater et al., 2021). The question of whether hydrological sequences are stationary has sparked widespread scientific debate since the assertion of "stationarity is dead" by Milly et al. in 2008 (Milly et al., 2008; Serinaldi and Kilsby, 2015; Salas et al., 2018). The current majority of studies indicate that when a sequence becomes non-stationary due to driving factors that can be reasonably explained by physical mechanisms, non-stationary methods are recommended for analysis; while in other cases, it is advisable to prioritize the use of stationary methods (Serinaldi and Kilsby, 2015; Koutsoyiannis and Montanari, 2015; Harrigan et al., 2018). Therefore, determining whether hydrological sequences exhibit non-stationary characteristics and providing a reasonable explanation based on physical mechanisms remains a significant challenge faced by hydrologists in the present era (Ribes et al., 2017; Slater et al., 2021; Yu et al., 2023).
Currently, commonly used methods for identifying non-stationarity primarily employ statistical tests to determine its presence, aiming to achieve this by detecting changes in statistical parameters (Rahmani et al., 2016; Marston and Ellis, 2021). This approach essentially transforms the identification of non-stationarity in a sequence into the detection of parameter changes within the sequence and indirectly infers non-stationarity through these changes. Research has indicated that employing parameter changes for identifying non-stationarity may introduce uncertainty and errors (Beven, 2016; Wang et al., 2020).
Many studies have examined the non-stationarity of extreme precipitation in the United States using point or gridded precipitation data (Jiang et al., 2016; Um et al., 2017; Swain et al., 2020; Davenport et al., 2021). These studies have indicated the potential non-stationarity of extreme precipitation at daily and sub-daily scales in the US region (Carvalho, 2020; Kirchmeier-Young and Zhang, 2020; Coelho et al., 2022). Currently, the prevailing research approach focuses on assessing the non-stationarity of extreme precipitation through changes in statistical parameters (Um et al., 2017; Alashan, 2018; Zhang et al., 2023). However, it is essential to note that the non-stationarity of extreme hydrological sequences fundamentally refers to the deviation from the assumption of a homogeneous distribution. Surprisingly, few scholars have explored non-stationarity from the perspective of distributional changes.
We propose a new method, the Cumulative Distribution Function Change Indicator (CDCI), to estimate the non-stationarity of hydrological extreme value sequences. We apply this method to precipitation data from 102 meteorological stations in the United States. Based on extreme value theory, we first use the Peaks Over Threshold (POT) method to select extreme precipitation sequences. Then, we assess the most suitable distribution for each station and calculate the distribution change by dividing the sequence into two segments (Rahmani et al., 2016; Iliopoulou and Koutsoyiannis, 2020; Coelho et al., 2022). To validate the reliability of our method, we conduct parallel experiments at 24-time scales. Finally, using CDCI, we investigate the relationship between the non-stationarity, distribution change, and return period variation of extreme precipitation. The main objective of this study is to explore the feasibility of using distribution change to measure sequence non-stationarity and investigate the response of sequence distribution to non-stationarity, providing a foundation for attributing non-stationarity to physical mechanisms.