Delineating and characterizing changes in heat wave events across the United States climate regions

Exposure to extreme heat, or heat waves, represents a public health threat as well as an environmental health threat. With projections for further increases in temperature in some regions, resulting from global environmental change, it is important to understand the spatiotemporal impacts of heat waves in order to minimize risk. To understand heat wave impacts, one cannot look solely at the spatial and temporal temperature regimes but much also consider key heat wave characteristics: duration, size, magnitude, frequency, and region of impact. To understand the consequences of heat wave events, it is critical to analyze such extreme events based on the cumulative impacts of these characteristics. This study, which is conducted across the whole of the continental United States, looks to map and analyze such cumulative impacts of heat wave characteristics. Heat waves were spatially defined for the period of May–September of each year. Using persistence analyses and the development of a new index (combined heat wave characteristics index (CHCI)), we can define regions of consistent heat wave exposure and impact. Results show that there are large differences across the United States in terms of heat wave exposure and impacting heat wave characteristics. Across much of the analysis, a clear east versus west difference in patterns is seen. Overall, such work shows how and where differing but covarying heat wave characteristics impact the United States. Information from this work can be combined with demographic and health metrics to better pinpoint susceptibility to heat waves—and therefore inform better management decisions.


Introduction
A single extreme event (e.g., flood, drought, and heat wave) can, without a doubt, have a large impact on the socioeconomic condition as well as environmental and human health (Kravchenko et al. 2013;Keellings & Waylen 2014a, b;Chen & Li 2017). When an area is repeatedly impacted by extreme events, especially those as impactful as heat waves, there are longer term socioenvironmental impacts to consider. The frequency, intensity, and duration of heat wave events have increased over time as seen in regions such as Europe, China, Australia, and the United States (US) (Field et al. 2012;Seneviratne et al. 2014;Keellings & Moradkhani 2020). Additionally, these events are projected to increase as climate change progresses leading to more frequent, more intense, and longer-lasting events in the future (Meehl & Tebaldi 2004;Barros et al. 2014;Lopez et al. 2018). Recent work suggests that around half of the world's population will be annually exposed to deadly heat waves by 2100 and that even in the US there will likely be a four-to-six-fold increase in exposure to heat waves by 2070 (Jones et al. 2015;Mora et al. 2017). Despite the severity of these projections, heat waves are not universally defined thus hindering inter-regional transferability of knowledge about these events.
Heat waves are typically defined as a sequence of days with temperature exceeding a high threshold of daily maximum or mean temperatures. The heat wave threshold is often between the 90 th and 99 th percentiles of the local temperature distribution, and it may or may not include some measure of humidity. Numerous heat wave metrics have been developed (Meehl & Tebaldi 2004;Hajat et al. 2006;Tan et al. 2007;Anderson & Bell 2009;Peng et al. 2011;Keellings & Waylen 2014a, b;Photiadou et al. 2014;Russo et al. 2014), yet there is still much debate as how best to define heat waves, and no definition or metric has been widely accepted (Perkins-Kirkpatrick et al. 2016). In practice, the US National Weather Service (NWS) issues excessive heat watches and warnings based on elevated heat index (combination of temperature and relative humidity) temperatures. However, the NWS-defined heat index temperature threshold for excessive heat varies across forecast offices and is based on a collaboration between local NWS offices and local partners. Unlike other extremes, such as hurricanes and tornadoes, heat waves have no universally accepted definition, and this complicates comparisons between heat wave events and the analysis of their impacts. This lack of an accepted definition of heat waves highlights the need for exploration of how best to conceptualize these events and how to develop a more unified definition. There is substantial knowledge on physical drivers of heat waves, but there is need for further quantification of the relationships between drivers and heat wave characteristics across larger regions and more events (Perkins 2015).
Extensive research has been conducted to understand, characterize, and predict heat waves (Baldwin et al. 2019). Recent research has found that heat waves in North America are increasing in number and spatial extent at the continental scale, but they are becoming smaller and more fragmented at the local level (Keellings et al. 2018). Adverse impacts of these events have been reported, including declines in agricultural yields (Mishra & Cherkauer 2010;Wreford and Neil 2010), elevated human mortality (Guo et al. 2017;World Meteorological Organization 2015;Merte 2017;Singh et al. 2021), failure of critical infrastructure (Clark et al. 2019), and increased energy demands (Miller et al. 2008;Zhao et al. 2018). The level of vulnerability of a system to the effects of heat waves varies by location, dictated by the system's physical, social, and demographic characteristics. For example, urban residents are more vulnerable to the impacts of heat waves due to the compounding effect of urban heat island effect (Habeeb et al. 2015;Zhao et al. 2018), and even in these areas, vulnerability is highest in inner cities compared to peripheral locations (Fenner et al. 2019). Yet, lacking in current literature is an understanding of the spatial dynamics and what drives aspects like areal extent and intensity of heat waves as changes in these spatial characteristics could have serious implications for socioecological impacts (Hattis et al. 2012;Kloog et al. 2015;Lee et al. 2016;Keellings & Moradkhani 2020). In addition, only a few heat wave characteristics have been studied (especially frequency, magnitude, and/or intensity), leaving out other important characteristics that potentially drive observed heat wave patterns. The current study presents a novel approach for characterizing spatial dynamics in heat waves across the contiguous US by assessing spatiotemporal patterns in the most persistent and longest/shortest-duration heat wave events.
In this study, we present a methodology for identifying regions impacted by differing and/ or compounding heat wave characteristics. We use Geographic Information Systems (GIS) and overlay analysis to identify regions where temperatures are persistently extreme and compute at multiple scales (monthly, seasonal, and decadal) various characteristics associated with the heat wave regions.
This study looks to highlight new approaches to studying heat wave impacts that are both multivariate in nature and spatially explicit. It is widely recognized that heat waves will intensify as the climate changes. Hence, effective tools are necessary for quantifying and assessing heat wave incidences for better planning and management (Das et al. 2020). Indices have commonly been used as they reveal how multiple variables interact, aggregate, and affect a given location (Defne et al. 2020). In heat vulnerability studies, several indices have been proposed including heat wave duration index (HWDI) (Frich et al. 2002), heat wave magnitude index (HWMI) (Russo et al. 2014), and excess heat factor (EHF) (Nairn & Fawcett 2014), among others. Over time, the combined effects of heat wave characteristics have been recognized and necessitated the need for indices that integrate different aspects of heat waves. Recently, the combined heat wave index (CHI) was proposed (Das et al. 2020). By integrating heat wave magnitude, duration, and extent, CHI allows for greater discrimination among events, thus enabling one to chronicle fine scale impacts on the landscape. Elsewhere, the Heat Severity and Coverage Index (HSCI) combines magnitude, frequency, areal extent, and duration to characterize heat waves more comprehensively (Keellings & Moradkhani 2020). Two metrics are developed in this study: persistence and the combined heat wave characteristics index (CHCI). At its simplest, the persistence metric highlights how many times a heat wave, over time, has consistently impacted the same area. Further, we breakdown the heat wave data based on duration of events, therefore further identifying whether a region has been consistently impacted by long-or short-duration events. CHCI, the other metric develop in this study, looks to combine all heat wave characteristic variables into a single index. This is different from the aforementioned indices related to heat waves in that CHCI also include an output called CHCI components that helps the index users to interpret what heat wave characteristic variable is driving the CHCI trends.
Analysis in this study was performed per climate region in the contiguous US, and a comparative analysis was performed to reveal patterns and any significant differences in the occurrence, duration, region of impact, and severity of heat waves in the regions.

Data
The Parameter elevation Regression on Independent Slopes Model (PRISM) daily temperature is utilized to identify heat waves (see Keellings & Moradkhani 2020). These data are a combination of meteorological station observations with a digital elevation model that yields a gridded surface for the continental US with a spatial resolution of 4 km from 1981 through 2018 (Daly et al. 2008;Thornton et al. 2014). PRISM has gone through quality control, been validated against heat wave metrics from four reanalysis products, and is widely used in climatological studies (Schoof et al. 2017).
This study was conducted across the continental US, though analysis was subdivided by the NOAA climate regions (Karl & Koss 1984) (Fig. 1). There are nine climate regions in total: Northwest, Southwest, West, West North Central, South, East North Central, Central, Northeast, and Southeast. While the analysis was done at the regional level, the comparison and statistical analysis highlights patterns across the continental US. NOAA climate regions were selected as the unit of analysis given that climate patterns extend beyond state borders.
Heat wave data were delineated using the methodology outlined by Keellings & Moradkhani (2020). A threshold equal to the 95 th percentile of local (grid cell) daily maximum temperature for each extended summer month (May through September) in each year is used to define heat waves. Heat wave events must also last for at least three consecutive days, and events are considered temporally independent if separated by four or more days of below threshold temperatures. A hierarchical clustering algorithm works in combination with indicator semivariograms to identify clusters of heat and delineates heat wave events by Euclidean distance between clusters. A bounding polygon is then drawn around all heat clusters belonging to a single heat wave, and the polygon is tracked spatially at the daily time-step. As the heat wave is tracked from genesis to lysis, metrics are calculated daily to determine the heat wave area, magnitude, duration, and fragmentation of heat within the bounding heat wave polygon. To perform this, spatial analysis of heat waves requires fine-resolution gridded data, and as such, the PRISM dataset (https:// prism. orego nstate. edu) was chosen as it has been utilized extensively in applied climatological studies. Numerous studies have made use of PRISM to analyze extreme temperatures or heat waves using similar event definitions and similar or shorter lengths of record (see, for example, Lu and Kueppers 2015;Schoof et al. 2017;Keellings and Moradkhani 2020). It should also be noted that our analysis is undertaken at a regional scale and is based on spatially identifying events rather than examining individual time series at single locations. Therefore, a single event encompasses potentially thousands of grid cells that have crossed their local heat wave threshold and the bounding event polygon will also encompass grid cells that have not crossed their local heat wave threshold, but are nonetheless within the identified heat wave area.

Methods
This section describes methods used in this study. These include systematic data thresholding, polygon flattening, and variable compositing applied to various heat wave characteristics.

Computing heat wave persistence
A GIS model was developed to compute heat wave persistence. This model, developed in ArcPy and implemented in ArcGIS Pro, was generally used to define and count the number of overlapping heat wave polygons at a location. The initial heat wave polygons can be thought of as a set of rings A, B, and C with overlaps (Fig. 2). This tool will flatten the rings and produce polygons with no overlap. The output includes more features than the input, and each feature has a count equivalent to the number of polygon overlaps. The counts represent heat wave persistence. The methodology for computing heat wave persistence is as follows: initially, the model creates an empty point feature class. It then creates a feature class containing polygons generated from areas enclosed by input heat wave polygons, while using the empty point feature class to label the polygons. The output polygons are then converted to points. Finally, attributes from the point feature class are spatially joined with the polygons. At this step, only important heat wave characteristics (see Table 1) are recalculated and retained in the output feature class.

Identifying regions of most and least extreme heat waves
This analysis was initially conducted per climate region. To identify areas with the most (least) extreme heat waves, we selected areas in the upper (lower) quartile based on the heat wave duration variable. This data reduction step allowed us to compare between the most (TOP25) and least (BOTTOM25) extreme heat wave event durations within each climate region. With such a breakdown of the data, we can not only look at extreme long duration events but also the impact of events which might be short in nature but extreme in other aspects, say magnitude or size. One key aspect of this paper is an analysis of multiple heat wave characteristics on the landscape, and this is done via the breakdown between the top 25 and bottom 25 percent duration events. Next, we computed heat wave persistence using the model described in Fig. 2. A second round of thresholding was performed by identifying regions in the upper quartile based on heat wave persistence. The outputs were then merged to generate regions across the US on which the next part of the study focused. This was done on both TOP25 and BOTTOM25 polygons. Descriptive statistics were computed to characterize average temperatures, area, duration, fragmentation, persistence, magnitude, and number of patches for the identified heat wave regions.

Combined heat wave characteristics
Since multiple heat wave variables were assessed in this study, it was necessary to generate composites, with the aim of identifying patterns in the ways these variables covaried across the climate regions. To achieve this, heat wave fragmentation, mean magnitude, number of patches, and mean area were each standardized and then combined by multiplication to compute a combined heat wave characteristics index (CHCI). See Eq. 1 below.
where CHCI is the combined heat wave characteristics index, Mean Area is the mean size of heat wave events during the set time scale by pixel, Mean Mag is the mean magnitude of heat wave events during the set time scale by pixel, Mean NP is the mean number of patches of heat wave events during the set time scale by pixel, and Mean Frag is the mean fragmentation score of heat wave events during the set time scale by pixel. The fragmentation score was developed and presented by Keelings and Moradkhani 2020.
The outputs here would therefore represent the spatiotemporal variability of heat waves driven by these four variables at locations with the longest (or shortest) and most persistent heat wave events. Another composite was generated to indicate categorized values for each of the four variables at a location using Eq. 2 below. To do this, we first reclassified the standardized variables into three classes defined below.
1. Low-values from 0 to the lower quartile 2. Mid-values from the lower to the upper quartiles 3. High-values from the upper quartile to 1 (1) CHCI = MeanArea * MeanMag * MeanNP * MeanFrag where Comp is the component of CHCI, Mean Mag is the mean magnitude, Mean NP is the mean number of patches, and Mean Frag is the mean fragmentation score. All four heat wave variables in this equation were reclassified into the three classes above.
The results from this computation were maps indicating the categorized components of heat wave events at each location. For instance, a pixel with a Comp value of 3213 indicates that the heat wave at the pixel had high mean area, mid mean magnitude, low mean fragmentation, and high mean number of patches. This composite would therefore help in explaining the observed patterns in the CHCI.
To ensure a detailed characterization of heat wave events in the US, we performed these analyses over several temporal scales including the entire 38 years , by decade (1981-1990, 1991-2000, 2001-2010, and 2011-2018), and by month (May, June, July, August, and September). The 38-year time span was determined based on data availability. This would provide better insights into how most (and least) extreme heat wave events have changed over time in the US. The flowchart (Fig. 3) shows the main steps and analyses performed.

Results
This study delineates (using GIS overlay, thresholding, and data compositing methodologies) and characterizes long-term seasonal, decadal, and monthly heat wave patterns in US climate regions.

Heat wave extent per climate region
This study found that, generally, longer heat wave events (TOP25) covered vastly more of the climate regions compared to the shortest events (BOTTOM25). TOP25 heat wave regions constituted as much as 57% of the land area in Northwest, 51% of the Southwest, and 47% of the area in Eastern North Central (refer to Fig. 4). Top three climate regions by area include the Southwest (> 557,000 km 2 ), South (> 524,000 km 2 ), and Northwest (> 368,000 km 2 ). Proportions of area with BOTTOM25 events, on the other hand, ranged from 11% in Central US to 41% in the Southwest. Based on land area, Southwest, South, and Western North Central regions were top ranked for BOTTOM25 events, with the shortest-duration heat waves extending over > 452,000 km 2 , 262,000 km 2 , and > 229,000 km 2 , respectively. Regions affected by BOTTOM25 and TOP25 events were similar (in size and location) for Western North Central (19% vs. 22%), Western (32% vs. 39%), and the Southwest (41% vs. 51%).
Different patterns were observed in heat waves over the decade scale (Fig. 5). Separability tests show statistically significant trends between TOP25 and BOTTOM25 both by region and across decades; however, the n for such statistics is too small to present. Additionally, statistics work is being developed that looks at the statistical difference for both persistence and the CHCI index at the pixel rather than event scale. In the 1980s, TOP25 events were commonest in the Northeast (> 73% of land area), Northwest (> 64% of area), and Central US (> 53%). In the 1990s, the Northwest topped with over 70% of its area experiencing TOP25 events, followed by the Southwest (> 65%) and the Northeast (> 55%). Different patterns were observed in the 2000s, in which the West region was ranked second (having TOP25 events in over 70% of land area) after the Northwest (> 72%). The Southeast was ranked third in this decade with over 67% of its land area experiencing these extreme heat events. Northwest, West, and Northeast regions had the highest proportions of area under TOP25 heat waves (approx. 67%, 62%, and 59%, respectively). Proportions of area under BOTTOM25 events were always lower than those for TOP25 events, except for West North Central (in the 1980s), Southwest (1980s), Central (in the 1990s), East North Central (in the 1990s and 2010s), and South (in the 2000s). Proportions of land under heat waves generally increased from a decade to the next in Northwest and Southeast, except for the 2000s to 2010s. Consistent expansion of area under heat waves was observed in the West North Central region. The study found that, for some climate regions, heat waves extended over more land in the months when extremely high temperatures were expected to subside (Fig. 6A, B). For instance, the highest proportions of area under TOP25 heat waves were observed in the month of May for the Southwest, Southeast, Northeast, and South (84%, 78%, 71%, and 60%, respectively, see Fig. 6A). Similar patterns were observed in Northwest and Central regions for the month of September (Fig. 6A, B). Here, heat waves were respectively identified in approx. 96% and 81% of the land area (Fig. 6A). This indicates that the severe heat wave period is expanding for these climate regions. Proportions of land under TOP25 heat waves generally exceeded proportions of land under BOTTOM25 events, except for Northwest in May (0% vs. 22%) and West North Central in August (16% vs. 17%).

Long-term heat wave characteristics
Heat wave characteristics generally showed similar patterns among the shortest-and longest-duration events. While area of heat waves was significantly higher for the TOP25 group compared to BOTTOM25 (US averages of 606,000 vs 481,000 km 2 , respectively), there were similar spatial patterns in distribution (Fig. 7A). In both groups, heat wave events were highly extensive in the South and Southwest regions. Events in these regions, respectively, had average areas of 887,000 km 2 and 758,000 km 2 (TOP25) and 597,000 km 2 and 667,000 km 2 (BOTTOM25).
The least expansive heat wave events of all were found in Northeast in both TOP25 and BOTTOM25 groups (average area of 312,000 km 2 and 254,000 km 2 , respectively) and East North Central (265,000 km 2 ) in the BOTTOM25 group. Average temperatures ranged between 28 and 38 °C for the shortest-duration events (US average of 32.7 °C) and 30 to 38 °C for the longest-duration events (US average of 34.6 °C). BOTTOM25 events in the Northwest and Northeast regions experienced comparatively lower temperatures (Fig. 7B). Temperatures averaged 28.4 and 30.1 °C in these regions, respectively.
West region events in both groups (TOP25 and BOTTOM25) were found to have the highest mean magnitude (Fig. 7F)-averaging 1.83 (Celsius above T95 threshold) for BOTTOM25 events and 1.94 (Celsius above T95 threshold) for TOP25 events. TOP25 events in this region were also the most persistent of all (Fig. 7E), meaning there were large numbers of overlapping heat wave events over time. Heat wave persistence averaged 2000 events for this region compared to a US average of 475. The least persistent heat wave events (i.e., lower number of overlapping heat wave events over time) were found mainly in the East North Central, Central, and Northeast regions (mean persistence < 183 events). Further, heat wave events experienced in the eastern half of the US were more cohesive than the western ones as shown by the fragmentation statistic (Fig. 7D). Events in the East North Central were most cohesive of all with an average fragmentation statistic of 0.69 and 0.76 for the shortest-duration and longest-duration events, respectively. The Southwest had the most fragmented events, with the fragmentation statistic for the shortest-duration and longest-duration events averaging 0.18 and 0.2, respectively. The study also found that the shortest-duration events were generally less patchy (ranging from 2200 to 12,000 patches or individual heat wave event clusters) compared to the longest-duration events which ranged from 4200 to over 19,700 patches (Fig.7G). The longest-duration events in the South, East North Central, and Central had the greatest number of patches (> 15,000). TOP25 events exhibited significantly different durations compared to BOT-TOM25 events. The former events lasted longer (14-99 days) than the latter (2.8-9.6 days) on average (Fig. 7C). However, this results from the initial use of this variable in defining BOTTOM25 and TOP25 heat wave events. Clear inter-region differences were observable in this variable especially in the TOP25 events. The West and South events lasted the longest (more than 98 and 71 days, respectively).

Decadal heat wave characteristics
Spatial patterns observed in the decadal distribution of heat wave characteristics were similar to those found in 1981-2018. For example, heat wave events in both groups were most cohesive in East North Central and generally most fragmented in Southwest (compare Figs. 8 and 7D). However, the study found that the higher-than-average fragmentation statistic in East North Central was influenced greatly by events in the 1980s and 2000s (average fragmentation statistic was 0.77 and 0.73, respectively). Further, the highest magnitude (for both groups) and most persistent events (for the longest-duration group) were consistently observed in the West region. In this region, mean magnitude was > 1.79 overall while mean persistence ranged 399-620 (for the longest-duration events). However, by splitting the data into individual decades, the study found that while the shortest-duration events occurred in each decade for all climate regions, the longest-duration events were absent in East North Central (in the 1990s and 2010s) and In addition, the study quantified changes in average heat wave characteristics per climate region. Results indicate that, generally, average heat wave characteristics varied more for the longest-duration events compared to the shortest-duration events. Among the characteristics, number of patches and heat wave persistence showed greater variability across the decades (Fig. 9). The highest increases in average heat wave persistence occurred in TOP25 events in West North Central (+ 377% in the 1990s-2000s), Southwest (+ 306% in the 1990s-2000s), Southeast (+ 279% in the 1980s-1990s), and South (+ 251% in the 1980s-1990s and + 125% in the 2000s-2010s). The greatest changes in number of patches were found in West North Central (+ 318% in the 1990s-2000s) and South (+ 128% and + 176% in the 1980s-1990s and 2000s-2010s, respectively). Other TOP25 heat wave characteristics that saw major increases include mean area (+ 75% in South in the 2000s-2010s, + 71% in West North Central in the 1990s-2000s), average fragmentation (increase of 67% in West North Central in the 1990s-2000s), and mean magnitude (58% increase in South in the 2000s-2010s). The greatest changes in BOTTOM25 events were found in number of patches in the Southwest, West (respective increases of 96% and 70% in the 1990s-2000s), and Central (+ 72% in the 2000s-2010s) regions. Average temperatures showed the least variability across decades for both event groups. Rates of change for most of the variables in the South were lower for 1990s-2000s compared to the other two periods. Contrasting patterns were seen for most heat wave characteristics in West North Central, as well as some elsewhere (e.g., number of patches and persistence in the Northwest, Southwest, and West). These patterns are more detectable in TOP25 events.

Monthly heat wave characteristics
The temporal heat wave characteristic results fall in line with seasonal temperature trends common in the U.S. during the summer months. As previously mentioned, heat wave events were defined based on crossing the T95 threshold which is determined for each Meaning that as we progress throughout the summer season and temperature patterns increase, we see a similar trend of increasing heat wave persistence. Specifically, across this study period, heat wave persistence, for both the top and bottom 25% events, shows a distinct pattern that varies both temporally and spatially (Fig. 10). The highest persistence is seen in events in July and August (for the TOP25 group) and in May and September (for the BOTTOM25 group). For the top 25%, in May, the average persistence across the US was 20. However, the South and Southeast regions were above this average with 41 and 25, respectively. In June, the average US-wide persistence for the top 25% was 71, 255% higher than May. This average was greatly influenced by persistence in events occurring in West, Southwest, and South which, respectively, had persistence values of 333 (+ 1981% from May), 78 (+ 457% from May), and 71 (+ 73% from May). In July, the West persistence values further increased by 77% compared to June. Overall, in July, the US-wide persistence averaged 168 but the West, Southwest, and parts of the Northwest were above this average. August had a shockingly similar spatial pattern to the July persistence with a US-wide average of 170. Lastly, in September, the persistence dropped to 58 across the US. All regions exhibited a sharp decline in the persistence values at this time. Overall, in September, the US pattern of persistence was homogenous, except for events in the West (average persistence of 339).
A higher persistence of heat wave events was seen in the beginning of the season for the bottom 25% events but a switch was observed as summer progressed, and the persistence numbers were higher for the top 25%. In May, the average persistence value for the bottom 25% was 144, almost six times greater than during top 25%. Average persistence in June reached 107 for the US which was 49% higher than the top 25%. These values continued dropping to 87 in July. However, in August, the average persistence switched to 94 and continued to increase to 128 in September. For the bottom 25%, there was a distinct difference in the persistence between the eastern and western portions of the country. In May, the western regions (West, Southwest, and Northwest) had the highest persistence across the whole country. In June, this pattern was observed in the same western regions and West North Central in addition. It was also observed that bottom 25% events were generally homogenous across the summer months compared to top 25% (which had many extremely persistent events especially in West).
Conversely, to the persistence metric, the heat wave patch counts do not align perfectly with summer seasonal temperature trends (Fig. 11). Overall, we see that heat waves are more patchy at the start and end of the summer season and less patchy (i.e., more cohesive) during the middle most extreme portion of the summer season. More specifically, within the top 25% group, the maximum number of patches, generally, was seen in May (US mean of 16,100), June (US mean of 14,100), and September (US mean of 15,000), a result in contrast to the persistence results which exhibited higher values during the months of July and August. Further, for the top 25%, the South region consistently had the highest number of patches compared to all other regions (ranging 12,500-29,600). The West coast regions, Northwest and West, had some of the lowest number of patches, for the top 25%, across May, July, and August when compared with the other regions. In May, for instance, number of patches for heat wave events in the West averaged 5042 compared to the US average of 16,058. The number of patches for the lower 25% heat waves was significantly lower than the top 25% (e.g., US average of 5249 and 14,084 patches for bottom 25% and top 25% events, respectively, over the month of June). Additionally, there is not a large difference in number of patches across the summer months for the shortest-duration events. The highest numbers were observed in the West North Central region (average range of 5664-13,075 across the summer months) compared to the rest of the study area.
Other major changes include significant areal coverage increases in mean magnitude (+ 72%), fragmentation (+ 57%), and mean area (+ 47%), in the West North Central during the August-September period. More specifically, in this region, larger swaths of land are being impacted by longer duration and higher magnitude, area, and fragmented heat wave events. Mean magnitude also increased consistently in the South region from a − 26% change in June-July to a + 84% in August-September. Events in East North Central also experienced a similar change in mean magnitude with a 30% decrease in June-July and a 51% increase in July-August.

Combined heat wave characteristics
The CHCI values ranged from 0 to about 0.145 (Fig.12A). The top 25% version of the CHCI exhibited the highest values overall. For this group, CHCI values ranged from 0 to 0.145 and averaged 0.031 across the US. The highest CHCI averages occurred in the East North Central (0.086), South (0.085), and Central (0.063) regions. Ironically, while the three regions with higher CHCI values are adjacent to one another, the variables driving the index pattern differ (Fig. 12B). Within the South region, the CHCI components were either 3223 or 3233. These index values show that heat waves in this region were of mid (average) magnitude and higher-than-average area and number of patches. What differs across the South region is the fragmentation of the heat waves. Regions in yellow (Fig. 12B) had heat waves that were significantly more cohesive (higher fragmentation statistic). Within the Central region, all the CHCI component values were the same (2233). This value indicates an average area and magnitude but a significantly above average fragmentation pattern (more cohesive events) and number of patches. Lastly, within the East North Central, the CHCI component values included 1233 and 1333. This indicates that heat wave events around the Great Lakes and into Wisconsin and parts of Minnesota were highly cohesive and had more than average number of patches and lower than average area, while their magnitude varied from average in some locations to greater than average in others. Across the entirety of the region, most of the heat waves had a larger than average fragmentation pattern and number of patches.
In terms of the bottom 25% CHCI, a completely different pattern and distribution of CHCI is exhibited (Fig. 12A)

Spatial patterns of heat waves across US climate regions
This study comprehensively characterized spatial patterns in the longest/shortest-duration and highly persistent heat waves in US climate regions. Varying spatial patterns were observed per heat wave characteristic and climate region, and this variability influences the level of heat wave impacts on socioecological systems. For instance, both the longest and shortest-duration events in the West region had the highest magnitude. The highest persistence was observed in the longest-duration events in West, while the longest-duration events in the South and Southwest were the most expansive. Events with the highest number of patches were found in the South, East North Central, and Central regions, mostly in the longest-duration group. Highly cohesive events were observed in the East North Central region. In addition, significant variability was found in these events over decades and summer months, indicating the dynamic Prior studies have found that, generally, heat waves in the US are intensifying in duration, intensity, and frequency (Habeeb et al. 2015;Schoof et al. 2017). While an investigation of large-scale drivers of heat wave trends is beyond the scope of this paper, it should be noted that past studies have linked upward trends in heat wave characteristics to modes of natural climate variability and the impact of long-term human-induced climate change (DeGaetano and Allen 2002;Meehl and Tebaldi 2004;Hoerling et al. 2013;Keellings & Waylen 2014b;Jones et al. 2015;Mora et al. 2017). Due to the enhanced soil moisture-temperature coupling, warming temperatures have been found to amplify the intensity of heat waves in dry areas of southern and southwestern U.S. but not in energy-limited regions of northern and northeastern U.S. (Cheng et al. 2019). In addition, spring precipitation deficits limit soil moisture, reduce evapotranspiration, and increase sensible heating that is then amplified by the presence of anticyclonic blocking (Lau & Nath 2012). Indeed, exceptional heat waves across the Great Plains of the U.S., including those of the 1930s Dust Bowl era, have been found to be preconditioned by springtime drought and strengthened by continentalscale anticyclonic flow and warm air advection (Cowan et al. 2017). Synoptic systems are also associated with heat waves, as stationary high pressure sits adjacent to affected areas for an extended period, resulting in advection of warm and dry air (Dole et al. 2011;Marshall et al. 2014;Miralles et al. 2014;Schumacher et al. 2019). Moist tropical and dry tropical are the synoptic weather types most associated with heat events (Sheridan & Kalkstein 2004), and the frequency of such weather types has increased in several regions (Senkbeil et al. 2017;Vanos et al. 2015). Such conclusions underscore the importance of characterizing spatial dynamics in heat waves per climate region and calls for region-specific measures, given differing climate regimes (e.g., winds and humidity), to deal with the impacts of heat waves.

Novelty and meaning of persistence
Persistence analysis was developed to follow trends in environmental health and/or productivity using the Normalized Difference Vegetation Index (NDVI) Waylen et al. 2014;Bunting et al. 2018). Within the original context of persistence, one can see the landscape impacts over time, even being able to pinpoint the time in which landscape change occurred and its duration. However, the NDVI version of persistence does not give context as to the drivers of change, such as climate extremes. With our application of persistence, we look to further define the spatial dimensions of heat waves as it can impact socioenvironmental health. This metric maps regions consistently impacted by heat waves of intensities which, when overlaid with information on human demographics, health, and environmental health, can be used to pinpoint not only triggers and timeframe of change but also rates of and susceptibility to change. For instance, the highly persistent 2013-2015 North Pacific marine heat wave (Di Lorenzo & Mantua 2016) has been linked to significant alterations in the biological composition and structure of the ocean and coastal ecosystems (Cavole et al. 2016). Changes in heat wave persistence may be more relevant to society than changes caused by mean temperature changes alone (Lorenz et al. 2010).

Novelty and meaning of CHCI index
The CHCI index presented in our study looks to build on such work by highlighting the combined heat wave impacts in a cumulative fashion that is both spatially explicit and representative of temporal variations. First, through quartile thresholding of heat wave events based on duration and persistence and finally computing CHCI using magnitude, number of patches, fragmentation, and areal extent parameters, heat wave events were comprehensively characterized using all six variables. Secondly, standardized variables were used in computing CHCI, thus permitting a comparative assessment of heat waves across climate regions in the contiguous US. Lastly, by exploring the components of CHCI for each pixel, important information was revealed about the main characteristics driving heat wave impacts in each climate region. For instance, the high CHCI values in the South were related to high mean area and number of patches, average magnitude, and at least average fragmentation statistic. In the East North Central region, on the other hand, CHCI was associated with events that were highly cohesive, had more-than-average number of patches, lower-than-average area, and at least average magnitude. CHCI is calculated at the finest spatial resolution possible, as dictated by the data. The CHCI index represents a novel means to understand the impact of heat waves across the US.

Key contributions of this work
Heat waves impact a wide variety of processes including human health, environmental health, and management of systems. CHCI provides another context to study and develop policy within these arenas as the index highlights both the cumulative pattern of impact, spatial dimensions, and driving factors. The CHCI index and component are key contributions to the literature for several reasons. Other heat wave indices have not highlighted the driving characteristic behind the index value. Our study shows (1) the driving heat wave characteristics that contribute to the CHCI value and (2) that across the U.S. there is not one driving heat wave variable. Section 3.5 highlights the spatial diversity in heat wave impacts both within and across the differing climate regions. Therefore, it is important to study more than just the maximum temperature associated with heat wave events, as commonly done by forecasters and emergency managers (Bunting et al. IN PREP).
Beyond CHCI, the heat wave persistence data is a large contribution to the field as we can quantify human and environmental health impacts from long-term repeated heat wave exposure. Such data can be analyzed statistically in tandem with multiple datasets including demographics, soil moisture, urban extent, population, NDVI, and many more to fully develop our understanding of heat wave impacts. For instance, major single heat wave events across the U.S. and Europe have been linked to human health and mortality events. Now, with heat wave persistence, we can add to this body of literature and see not only how short-term exposure to extreme events impacts human health but how long-term repeated exposure impacts a system. Our results, in Section 3.4, highlight extreme spatial and temporal diversity in heat wave persistence. Heat-related deaths occur when a rapid rise in temperatures outpaces the body's ability to cool itself (Habeeb et al. 2015). All heat wave variables have been studied in relation to their impacts on human health. For instance, heat wave duration has been shown to modify mortality rates (Kalkstein & Smoyer 1993). Longer heat waves have been noted to increase dangerous thermal exposure, particularly in urban areas (Hajat et al. 2002;Anderson & Bell 2009). Longer heat waves have also been shown to impact mortality rates in major cities such as Madrid and St. Louis (Smoyer 1998;Díaz et al. 2002). Timing of the event has shown to impact overall risk (Anderson & Bell 2011). Lastly, heat-related mortality increases during heat waves with greater intensity (Anderson & Bell 2009). Though all these studies highlight the impact of a single heat wave variable on mortality, they also note that one variable could not fully explain catastrophic heat wave impacts. There is no doubt that susceptibility to heat varies across the population and from region to region; therefore, with the CHCI index, we can more wholistically look at heat wave-related mortality rates and better pinpoint susceptibility by combining this data with demographic and health metrics. Additionally, with persistence, we can understand repeated exposure and its impacts.

Drawbacks in heat wave studies
The study of heat waves faces numerous challenges. In general, heat waves are defined as extended periods of extreme heat, though no consistent definition exists regarding the actual temperature threshold, metric, and event duration that definitively defines a heat wave. In this study, we utilized a standard definition for heat waves, using the climatological literature to guide us. A relative location-specific percentile-based threshold has been widely utilized across previous heat wave studies. However, differing heat wave definitions can result in inconsistent data and results. For example, research indicates that significant positive associations between heat waves and health outcomes (preterm birth and nonaccidental death) are more common when relative (rather than absolute) heat wave indices are used to define exposure (Kent et al. 2014). However, the approach employed in this study, using a percentile-based threshold of temperature, is common within climatological extreme studies, and therefore, the results highlight realistic heat wave patterns and impacts. We also do not consider humidity in our definition of heat waves. Humidity is known to negatively impact human comfort and amplify heat wave health impacts.