In this study, we begin by gathering precipitation data from the Indian Meteorological Department (IMD) 0.25°×0.25° grid rainfall dataset, extracting it from a Comma-separated value (CSV) file for analytical purposes. The primary objective was to conduct homogeneity tests on this rainfall data to determine whether any significant changes or trends exist in the precipitation pattern over a the 1960–2020 period. Homogeneity testing is crucial in hydrology and climate studies as it ensures the reliability of rainfall data, which is fundamental for various applications including water resource management, flood forecasting, and climate change analysis (Machiwal and Jha, 2006; Mohanty et al., 2018). XLSTAT, a statistical analysis (www.xlstat.com), offers a suite of tools and functions specifically designed for conducting various statistical tests, including homogeneity tests tailored for rainfall data (Kang and Yusof, 2012). These tests typically involve scrutinizing the statistical properties of the data series over time to detect any systematic changes or discontinuities. Subsequently, we employ the Mann-Kendall (later discussed in Eq. 1, 2, 3 & 4, respectively) (Mann 1945; Kendall 1975) and Sen slope (later discussed in Eq. 5, 6, & 7, respectively) (Sen, 1968) tests within the R statistical package (www.r-project.org) to gain further insights into the behavior of the rainfall data over time. These tests allow us to identify trends, evaluate their significance, and estimate the rate of change in precipitation patterns. R, along with packages like R-ClimDEX (www.acmad.net), provides powerful functionalities for calculating a wide range of climate indices, including extreme rainfall indices. These indices offer valuable information for understanding the impact of climate change on precipitation patterns (Thompson, 1984). Additionally, in ArcGIS v 10.8.2, spatial-temporal analysis by working with datasets that encompass both spatial (geographic) and temporal (time) components was performed. This involves tasks such as georeferencing, spatial interpolation, and temporal analysis to comprehend how spatial patterns evolve over time (Ansari et al., 2020). ArcGIS serves as a comprehensive platform for managing, visualizing, and analyzing spatial-temporal data, thereby enhancing our understanding of the spatial distribution and temporal dynamics of precipitation (Lafreniere and Gilliland, 2015).
This study utilizes a multi-faceted approach, integrating statistical analysis software like XLSTAT and R, along with GIS software such as ArcGIS, to comprehensively analyze precipitation data. Through homogeneity testing, trend analysis, and calculation of climate indices, the aim was to gain deeper insights into the behavior of rainfall patterns, which is crucial for informed decision-making in various fields related to hydrology, climate studies, and environmental management. The workflow diagram illustrating the study is presented in Fig. 2.
3.1. Homogeneity Test
The homogeneity test for rainfall data serves to evaluate the presence of significant changes or trends in rainfall patterns over time (Kang and Yusof, 2012; Das et al., 2018; Marengo, 2006). This test holds particular importance in hydrological and climatological studies, where it verifies the consistency and reliability of rainfall data for diverse applications such as water resource management, flood forecasting, and climate change analysis (Thompson, 1984; Troin et al.2021). Some common homogeneity tests used for rainfall data analysis include:
i. Pettitt test: This non-parametric test is used to detect a single change point in a time series.
ii. Buishand range test: This test is based on the range of the data series and detects multiple change points.
iii. Standard normal homogeneity test (SNHT): This test is based on the comparison of the mean of different segments of the data series to detect abrupt changes.
iv. Von-Neumann test: This test is used for assesses the equality of variances among groups, critical for valid statistical analyses.
3.2 Mann Kandell Test
The Mann-Kendall (MK) test (Mann 1945; Kendall 1975) stands as a robust statistical tool, particularly valuable in the analysis of time series (Douglas et al., 2000; Partal and Kahya, 2006 Tabari and Marofi, 2011; Singh et al., 2023; Abdullahi et al., 2015) data like rainfall records. Its non-parametric nature makes it versatile, capable of assessing trends without demanding strict adherence to data distribution assumptions (Sulaiman et al., 2015; Panda and Sahu, 2019). Initially, the test assigns rank to each data point, accommodating ties by averaging ranks. Subsequently, it computes Kendall's Tau (τ), measuring correlation across different time lags. With τ determination, the test statistic (S) is derived (Eqs. 1 and 2) from the count of concordant and discordant pairs within the dataset, thereby gauging the trend's strength. Following this, the variance (Var(S)) is calculated (Eq. 3) to ascertain the significance of the trend. This, in turn, enables the computation of a Z-score (Eq. 4), crucial for assessing statistical significance. By comparing this Z-score against critical values from the standard normal distribution, the test elucidates whether a statistically significant trend exists within the data, thus aiding in discerning patterns in rainfall behavior over time (Hamed and Rao, 1998; McLeod, 2005; Alhaji et al., 2018).
The MK test statistic S is calculated using the following formula:
S = \(\:\sum\:_{i=1}^{n-1}\sum\:_{j=i+1}^{n}\:sign({x}_{i}-{x}_{j})\) (1)
Where,
n is the number of data points; xi and xj are the data values in time series i and j, respectively
and \(\:sign({x}_{i}-{x}_{j})\) is the sign function as:
$$\:sign({x}_{i}-{x}_{j})\:=\:\left\{\begin{array}{c}+1,\:\:\:\:\:\:\:if\:\:({x}_{i}-{x}_{j})>o\\\:0,\:\:\:\:\:\:\:\:\:\:if\:\:\:({x}_{i}-{x}_{j})=0\\\:-1,\:\:\:\:\:\:\:\:if\:\:({x}_{i}-{x}_{j})<0\end{array}\right.$$
2
The variance is computed as:
Var (S) = \(\:\frac{n\left(n-1\right)\left(2n+5\right)-\:\sum\:_{i=1}^{m}{t}_{i}({t}_{i}-1)(2{t}_{i}+5)}{18}\) (3)
Where,
n is the number of data points; m is the number of tied groups; and \(\:{t}_{i}\) denotes the number of ties of extent.
The standard normal test statistic Zs is computed as:
$$\:{Z}_{S}=\left\{\begin{array}{c}\frac{S-1}{\surd\:Var\left(S\right)},\:\:\:if\:\:S>0\\\:0,\:\:\:\:\:\:\:\:\:\:\:\:\:if\:\:S=0\\\:\frac{S+1}{\surd\:Var\left(S\right)},\:\:if\:\:S<0\end{array}\right.$$
4
The assessment of a statistically significant trend relies on the Z-value. A positive Z-value signifies an upward trend, while a negative value indicates a downward trend.
3.3. Sen’s Slope Estimator
The Sen's Slope test (Sen, 1968) is a valuable tool in the analysis of time series data, including rainfall records, offering a non-parametric approach to detect trends (Trivedi and Gautam, 2022). This method is particularly advantageous when working with datasets of varying sizes and distributions. The test involves calculating the differences and slopes between pairs of data points in the time series, ultimately deriving the median slope (Eq. 5) (Sen, 1968), which represents the overall trend in the data. Unlike some other trend analysis methods, Sen's Slope test is robust against outliers and can effectively handle irregularly spaced data points. Its significance can be determined through hypothesis testing or by establishing confidence intervals around the median slope. When applied to rainfall data, Sen's Slope test aids in identifying significant trends in precipitation patterns over time (Hollander and Wolfe, 1973, Gilbert, 1987), providing insight into whether rainfall is increasing, decreasing, or remaining relatively stable throughout the analyzed period (Lettenmaier et al., 1994; Yue and Hashino, 2003; Yunling and Yiping, 2005; Tabari and Marofi 2011).
Sen’s Slope = \(\:Median\:\{\:\frac{{x}_{j}-{x}_{k}}{j-k}\::i<j\}\) (5)
A, 1– α confidence interval for Sen’s slope can be calculated as (lower, upper)
Where,
N = C (n, 2) k = Var (S) ⋅ \(\:{Z}_{crit}\) (6)
Lower = \(\:{m}_{(\text{N}-\text{k})/2}\) Upper = \(\:{\text{m}}_{(\text{N}+\text{k})/2+1}\) (7)
Here, N = the number of pairs of time series elements (\(\:{x}_{i}\:\),\(\:\:{x}_{j}\)) where i < j and Var (S) = the standard error for the Mann-Kendall Test.
Also,
\(\:{m}_{h}\) = the hth smallest in the set {(\(\:{x}_{j}\)–\(\:{x}_{j}\))/(j–i): i < j} and \(\:{z}_{\text{c}\text{r}\text{i}\text{t}}\) = the 1–α/2 critical value for the normal distribution.
3.4 Extreme Rainfall Indices
The Expert Team on Climate Change Detection and Indices (ETCCDI) has endorsed a compilation of 27 extreme rainfall and temperature indices designed to assess extreme weather events (Singh et al, 2022). These indices rely on daily temperature and precipitation data. In recent years, numerous studies have extensively examined these indices at both regional and global levels. (Sharma et al., 2018; Pradhan et al., 2019; Sillmann et al., 2013; Kim et al., 2020). This study delves into seven key precipitation indices, selected from a larger set of 27. These indices include Consecutive Dry Days (CDD), Consecutive Wet Days (CWD), R10 (the count of heavy precipitation days), R20 (the count of very heavy precipitation days), RX1 day (maximum precipitation in a single day), RX5 days (maximum precipitation over five consecutive days), and Simple Daily Intensity Index (SDII). CDD measures the uninterrupted duration of days without precipitation, indicating prolonged dry conditions which can lead to droughts impacting agriculture, water availability, and ecosystems. Conversely, CWD quantifies the uninterrupted duration of days with precipitation, signaling prolonged wet periods that heighten risks of flooding, soil erosion, and landslides, affecting infrastructure and agriculture. R10 and R20 signify the frequency of heavy and very heavy precipitation events respectively, indicating increased risks of flash floods and urban drainage issues. RX1 day highlights the intensity of extreme rainfall events, offering insights into potential hazards like flash floods and landslides, while RX5 days assess the persistence of heavy rainfall, affecting agriculture, water management, and ecosystems. SDII provides information on the average daily precipitation intensity, crucial for understanding runoff, erosion, and flooding risks in both urban and rural areas. These indices collectively offer valuable insights into precipitation dynamics and their impacts on various sectors, aiding in risk assessment and management strategies. These indices were computed using Climate Data Operators (CDO) software. Table 1 provides the list of precipitation indices utilized, along with their definitions and units.
Table 1
The list of precipitation indices used, and the definition and units.
Indices
|
Definition
|
Unit
|
CDD
|
Maximum number of consecutive dry days with RR < 1 mm
|
Days
|
CWD
|
Maximum number of consecutive wet days with RR > 1 mm
|
Days
|
R10
|
Annual count of days when PRCP ≥ 10 mm
|
Days
|
R20
|
Annual count of days when PRCP ≥ 20 mm
|
Days
|
RX1
|
Monthly maximum 1-day rainfall
|
mm
|
RX5
|
Monthly maximum 5-days Rainfall
|
mm
|
SDII
|
Annual total precipitation divided by the number of wet days
(defined as PRCP ≥ 1.0 mm in the year)
|
mm/year
|