3.1.1 Selection Criteria
RCAnMax (Ability of model to simulate long term annual maximum): This criterion is used to capture interannual variability of rainfall over the all India monsoon region. First, the maximum of each year is calculated which is called annual maximum after that, average of the maximum value is calculated then we compared with the observed value, whichever run value was near to the observed value, a highest rank was assigned to that run.
RCMSE (Mean square error of daily precipitation): This criterion is used to capture the daily variability of all India monsoon rainfall. First, mean square error of both observed data and model data are calculated then deviation from the observed data is calculated, whichever run value was near to the observed value, a highest rank was given to that run. Mean square error is calculated by the following formula (1).
$$MSE= \frac{1}{n}\sum _{i=1}^{n}{{(Y}_{oi}-{Y}_{mi})}^{2} \left(1\right)$$
\(\)where, Yoi = observed rainfall data
Ymi = model rainfall data
n = total no. of data
RCRMS (Root mean square of daily precipitation):- This criterion is used to capture the daily variability of all India monsoon rainfall. First, the root means square of both observed data and model data are calculated then deviation from the observed data is calculated, whichever run value was near to the observed value, a highest rank was given to that run. Root mean square is calculated by the following formula (2).
$$RMS= \sqrt{\frac{\sum _{i=1}^{n}{{(Y}_{oi}-{Y}_{mi})}^{2}}{n}} \left(2\right)$$
where, Yoi = observed rainfall data
Ymi = model rainfall data
n = total no. of data
RCCMZCorr (Correlation of JJAS daily precipitation of core monsoon zone): This criterion is used to capture grid wise seasonal variability of core monsoon zone rainfall. First, the correlation coefficient of both the observed data and model data is calculated then deviation from the observed data is calculated, whichever run value was near to the observed value, a highest rank was assigned to that run. Correlation coefficient is calculated by the following formula (3). (Rajeevan et al., 2010)
$$r= \frac{\sum _{i=1}^{n}({X}_{i}-\stackrel{-}{X})({Y}_{i}-\stackrel{-}{Y})}{\sqrt{\sum _{i=1}^{n}{({X}_{i}-\stackrel{-}{X})}^{2}}\sqrt{\sum _{i=1}^{n}{({Y}_{i}-\stackrel{-}{Y})}^{2}}}$$
3
where, Xi = observed rainfall dataset
\(\stackrel{-}{X }\) = mean of observed rainfall
Xi = observed rainfall dataset
Yi = Model rainfall dataset
\(\stackrel{-}{Y}\) = mean of model rainfall
\(r=\) Correlation coefficient
Figure 5 shows the correlation between the core monsoon zone and rainfall over all India nodes in IMD and run060. Pink color shows the minimum value of the correlation coefficient and dark purple color shows the maximum value of the correlation coefficient. Maximum correlation coefficient is observed over the core monsoon zone. Therefore, we can confirm that there is a good correlation between observed data and run060. So we are selecting run number 060 because it shows a good correlation with observed data.
RCRXAday (Skill score of JJAS daily precipitation): This criterion is used to capture the daily variability of all India monsoon rainfall. First, the probability distribution of both the observed and model data is calculated for a given bin then the cumulative minimum value of the observed and model data is calculated which is called skill score value (Change and Frontiers, 2007). Whichever runs skill score value is observed maximum that run is given the highest rank. If a model simulates observed condition perfectly it will have a skill score is equal to one, it means that there is a perfect overlap between the observed data and model data. If a model simulates observed condition poorly it will have zero skill score with negligible overlap between the observed data and model data. S score is calculated by the following equation (4). (Perkins et al., 2007)
$$S score= \sum _{1}^{n}min({Z}_{m},{Z}_{o}) \left(4\right)$$
Where, n = no. of bins used to calculate Probability Density Function (PDF)
Zm = Probability of model data in a given bin
Zo = Probability of observed data in a given bin
Figure 6 shows the comparison of skill score between observed data and run078. In this figure, the y-axis represents the probability, and the x-axis represents precipitation in mm/day. Blue colour histogram represents observed value and orange colour histogram represents run078 value. Maroon colour histogram represents overlapping between observed data and run078. The graph shows that there is a 74% overlap between observed data and run078. So we are selecting run 078 because it shows the good overlap with the observed data.
RCR95P (Annual JJAS daily precipitation > 95th percentile of JJAS wet days): This criterion is used to capture spatial and extreme variability of all India monsoon rainfall. Wet days are defined as the annual count of JJAS days when daily precipitation is ≥ 1mm/day. Ninety-five percentile of JJAS wet days is calculated and we are taken only that value of annual JJAS daily precipitation whose value is greater than 95 percentile of wet days, similarly for the observed datasets. After that, we calculated the skill score for both the observed data and model data. Whichever runs skill score value is maximum that run is given the highest rank. (Diaconescu et al., 2017)
RCRX5Day (Annual JJAS maximum of 5-day accumulated precipitation): This criterion is used to capture the spatial variability of all India monsoon rainfall. Five-day accumulated precipitation is calculated by the moving average method. From the five-day accumulated precipitation, we are taken the annual maximum of each JJAS, similarly for the observed datasets. After that, we calculated the skill score for both the observed data and model data. Whoever skill score value was maximum that value is given the highest rank. (Diaconescu et al., 2017)
Table 2
Sr.
No.
|
Criteria
Code
|
Description
|
Formula
|
Run No.
|
1
|
RCAnMax
|
Long term annual maximum
|
|
052
|
2
|
RCRX5Day
|
JJAS max of 5-day accumulated precipitation
|
\(S score= \sum _{1}^{n}min({Z}_{m},{Z}_{o})\)
|
056
|
3
|
RCCMZCorr
|
Correlation of JJAS daily precipitation of CMZ
|
\(r= \frac{\sum _{i=1}^{n}({X}_{i}-\stackrel{-}{X})({Y}_{i}-\stackrel{-}{Y})}{\sqrt{\sum _{i=1}^{n}{({X}_{i}-\stackrel{-}{X})}^{2}}\sqrt{\sum _{i=1}^{n}{({Y}_{i}-\stackrel{-}{Y})}^{2}}}\)
|
060
|
4
|
RCR95P
|
Annual JJAS precipitation > 95th percentile
|
\(S score= \sum _{1}^{n}min({Z}_{m},{Z}_{o})\)
|
070
|
5
|
RCRMS
|
Root mean square of daily precipitation
|
\(RMS= \sqrt{\frac{\sum _{i=1}^{n}{(Yoi-{Y}_{mi})}^{2}}{n}}\)
|
073
|
6
|
RCMSE
|
Mean square error of daily precipitation
|
\(MSE= \frac{1}{n}\sum _{i=1}^{n}{(Yoi-{Y}_{mi})}^{2}\)
|
074
|
7
|
RCRXADay
|
S score of JJAS daily precipitation
|
\(S score= \sum _{1}^{n}min({Z}_{m},{Z}_{o})\)
|
078
|
Figure 7 shows the selection of the best runs based on different criteria. In this figure, the rows represents seven criteria and the column represents runs. The value represented with red colour shows the highest rank (minimum value) and shades of grey colour shows the rank for each of these criteria. It is clearly visible from the graph that for RCAnMax, RCRX5Day, RCCMZCorr, RCR95P, RCRMS, RCMSE, and RCRXADay criteria, run052, run056, run060, run070, run073, run074, and run078 is the best run. This is also summarized in Table 2.