The rise of morphological diversity in butterflies and moths

doi:10.21203/rs.3.rs-3288352/v1

Download PDF

Biological Sciences - Article

The rise of morphological diversity in butterflies and moths

https://doi.org/10.21203/rs.3.rs-3288352/v1

This work is licensed under a CC BY 4.0 License

Version 1

posted

You are reading this latest preprint version

As one of the most speciose animals groups^1,2, butterflies and moths (Lepidoptera) are well known for their distinctive and colorful wings^3,4. Yet, due to difficulties in quantifying differences in appearance^5-7, we have a surprisingly poor understanding of how interspecific differences in wing appearance and function (morphological disparity) evolved within this diverse group. Here, we use generative artificial intelligence (AI) to analyse more than 32,000 images from over 17,000 species in order to quantify wing morphological disparity and evaluate the underlying macroevolutionary processes. We show that only three major morphological groups existed 140 million years ago, but this increased to six groups 110 million years ago, a time when the terrestrial angiosperm revolution provided new ecological opportunities⁸ but also 30 million years after the explosion of Lepidopteran taxonomic diversity and the first emergence of flowering plants. The evolution of wing shape during this period, transitioning from forms adapted for stable flight to those designed for agile maneuvering, is a product of gradual change governed by morphological functional constraints intrinsic to primitive flyers⁹. Accelerated evolution of wing shape and color also occurred 66 million years ago, following the Cretaceous-Paleogene (K-Pg) mass extinction event that further expanded ecological opportunities by amplifying habitat diversity. Thus, Lepidoptera wing morphology is shaped by both gradual evolution and ecological opportunity, resulting in a decoupling of morphological disparity and taxonomic diversity. Ultimately, tools like generative AI will help revolutionize the study of adaptation and the macroevolutionary processes that influence biological diversity^10-12

Biological sciences/Evolution

Biological sciences/Ecology/Evolutionary ecology

A central focus of biology at least since Darwin has been to understand the tempo and mode of evolution^10,12-14, which is typically studied by quantifying changes in taxonomic diversity (the quantity and distribution of lineages) or morphological disparity (the interspecific differences in appearance or structure)^5,15,16. Three hypotheses have been proposed to explain how these macroevolutionary processes influence biological diversity: (1) the gradual evolution hypothesis, which suggests that because evolution occurs via the steady accumulation of variation, both taxonomic diversity and morphological disparity should increase gradually over time and coevolve^10,12; (2) the ecological opportunity hypothesis^17,18, which similarly suggests that when new resources or habitats emerge within an ecosystem, lineages may exhibit increased rates of either or both taxonomic and morphological evolution; (3) and the early burst hypothesis, which posits that in contrast to the first two hypotheses, taxonomic and morphological evolution should initially be rapid when lineages first enter empty ecological niche space, but the rate of both will eventually slow as niches become filled¹¹. Yet, because relationships between taxonomic diversity and morphological disparity are often inconsistent¹⁹, with some clades of organisms rich in both diversity and disparity (e.g., birds²⁰ or cichlids^21,22), some rich in one but not the other (e.g., nematodes²³ or blister beetles²⁴), and some poor in both (e.g., sea turtles²⁵), we must examine how taxonomic diversity and morphological disparity interact in order to differentiate among these macroevolutionary hypotheses and truly understand the tempo and mode of evolution.

Although molecular tools have greatly increased our understanding of the evolutionary history of taxonomic diversity^26-28, since biologists subjectively determine which features are most important for quantifying morphological disparity (as well as species differences)^29-33, only relatively simple features are often used^6,7,34, leading to a limited understanding of the patterns of morphological disparity change over deep evolutionary time. Led by advances in computing hardware and the rapid development of artificial intelligence (AI) like deep learning algorithms, computer vision research has achieved extensive breakthroughs in the last decade^35-37. Machines can now easily learn to quantify features from a large and dense set of images that are typically abstract to humans^38-40. The key to deep learning algorithms is that a model learns important features on its own through multiple layers of artificial neural networks, a process that allows for the extraction of increasingly complex features and higher accuracy in tasks like image classification, all without relying on predefined features^41-47. Yet, feature matrices extracted from deep learning models have historically been treated as “black boxes” that were difficult to interpret because of their high dimensionality and complex non-linear relationships with the task objectives⁴⁷. More recently, however, it has become possible to use generative AI (e.g., a variational autoencoder in self-supervised learning tasks) to objectively quantify abstract morphological features in images in order to construct a morphospace⁴⁸, an abstract space that represents the range of possible morphologies (shapes or forms) an organism can take. Within this generative AI framework, the decoder within the autoencoder plays the role of generator and produces images of the quantified features that are interpretable to humans, rendering it possible to not only quantitatively describe and analyze the patterns of data points within a rich morphospace, but also to generate morphologies by identifying the specific feature vectors and differences most relevant to the biological problem of interest. Using these biologically-relevant features, researchers should be able to interpret the potentially adaptive functions of the analyzed morphological differences from the visualization results. Thus, using advanced AI techniques, it is now possible to investigate how biotic and abiotic factors shape morphological evolution by simultaneously quantifying and visualizing how morphological disparity across biological lineages varies through time and interacts with taxonomic diversity to shape the temp and mode of evolution.

Quantifying Lepidoptera morphological variation using artificial intelligence

Using state-of-the-art AI techniques^49-51, we analyzed Lepidoptera images of over 17,000 species to quantify the deep evolutionary history of morphological disparity. Lepidoptera contains at least 160,000 species and is the second most diverse animal taxon on earth after beetles^1,2. Since ancient times, humans have marveled at the rich colors, shapes, and sizes of butterflies and moths^52-54. Although numerous studies have examined morphological (e.g., coloration⁵⁴, size and shape^56-58, patterning⁵⁸) and functional features (e.g., flight ability⁵⁹ and mimicry⁶⁰) of Lepidoptera wings, the study of the group’s deep evolutionary history of wing morphology has been hindered by an inability to objectively and comprehensively quantify morphological variation within and among species. Since the evolutionary relationships of Lepidoptera at the family level are well established¹, and because a large number of standardized species photographs are publicly available, we were able to apply the latest deep learning algorithms to address the problem of morphological quantification and study the tempo and mode of evolution in butterflies and moths.

To maximize the number of species in our database, we began by downloading one image per Lepidoptera species from each museum registered in the Global Biodiversity Information Facility (GBIF; www.gbif.org). We then used the object detection model YOLO v4⁴⁹ to detect the specimen, crop the image, and standardize the resolution to 256x256 pixels, each of which contains information about the color and brightness of a specific point in the image. Ultimately, a total of 32,262 standardized images from 17,186 species (90 families) of Lepidoptera were obtained. From each image, we extracted a 512-dimensional feature vector (Fig. 1a) using a custom variational autoencoder (DFC-VSC) with self-supervised learning. To further reduce dimensionality, we trained a simple subfamily classifier with these extracted features (297 categories), applied a back-propagation algorithm to the classifier, and selected the two most influential (and distinct) features (i.e., those with the highest gradients related to accurate subfamily predictions) from each subfamily. We then combined these selected features to form a union, resulting in 40 unique features that generated the Lepidoptera morphospace for our subsequent analyses (Extended Fig. 1). The final number of unique features was determined through a sensitivity analysis that evaluated the performance of three clustering indices targeting subfamilies on the feature spaces (see Materials and Methods). We can visually see the natural clustering of families in feature space in UMAP (Fig. 1b), which illustrates how we are able to represent the feature similarities and differences among Lepidoptera families with subfamily features at high resolution and detail.

To quantify morphological disparity, we estimated the morphological similarity at the species level by calculating the Euclidean distance between the centroids (i.e., mean values of features) of each species in morphospace, and at the family level by calculating the mean Euclidean distance between all of the species within each family. Next, we examined the morphological similarity between families by employing the neighbor-joining method to construct a morphological dendrogram, which we used to divide all of the families into distinct morphological groups (hereafter morphogroups) with the highest intragroup similarity and intergroup variation (Extended Fig. 2a, 2b, see Materials and Methods). By varying the number of morphogroups from 3 to 10, we found that six morphogroups gave the best grouping result (Extended Fig. 2c). We then reconstructed the evolutionary history of these six morphogroups based on the most recently published Lepidoptera phylogeny¹ (Fig. 1c).

Morphological disparity increases with ecological opportunity

Although the origin of Lepidoptera can be traced back 300 million years, we found that only three morphogroups (hereafter “primitive morphogroups”: indicated by gray, blue, and green in Fig. 1 and Extended Fig. 2a and b) were present around 140 million years ago (Ma). This result is not consistent with the early burst hypothesis for morphological disparity, which predicts that different Lepidoptera morphologies should only have emerged early in the group’s evolutionary history, and the rate of morphological evolution should have declined steadily thereafter. Instead, the increase in taxonomic diversity—at the approximate midpoint of Lepidoptera’s evolutionary history—coincides with the first emergence of angiosperms during the early Cretaceous⁸. Yet, we found that the three other morphogroups (hereafter “recent morphogroups”: indicated by red, yellow, and purple in Fig. 1 and Extended Fig. 2a and b) evolved only around 110 Ma, 30 million years after the onset of rapid Lepidopteran diversification but during the explosion of flowering plant diversity that occurred with the angiosperm terrestrial revolution in the mid-Cretaceous⁸. The newly evolved red morphogroup consists mainly of diurnal butterflies (Nymphalidae and Lycaenidae, see Extended Fig. 2a), while the other two morphogroups consist mainly of nocturnal moths and a single family of diurnal butterflies (yellow morphogroup: Geometridae, Saturniidae; purple morphogroup: Erebidae and Noctuidae; see Extended Fig. 2a). Because this period coincides with the dramatic increase in angiosperm productivity (measured by leaf vein density and stomatal number) and diversity^62,63, and is also the first time that terrestrial biodiversity exceeded marine biodiversity⁶³, our results are most consistent with the ecological opportunity hypothesis and the idea that new ecological niches provided by the massive increase in angiosperm productivity and diversity may have contributed to the evolution of the three recent morphogroups of Lepidoptera.

To further test the influence of ecological opportunity on morphological evolution in Lepidoptera, we applied a principal component analysis (PCA) on the mean features of 68 families (each with at least 3 sampled species) and took the top three principal component (PC) axes (which explain 47.2% of the total variance, Extended Table 1) to represent the morphospace of each of the 52 families within the primitive and recent morphogroups (Fig. 2a-c). After showing that the primitive and recent morphogroups occupied different regions of morphospace (PERMANOVA: F = 31.26, R² = 0.39, p <0.001), we used hypervolume analysis⁶⁴ to quantify the morphological disparity of the families within each morphogroup. We found that not only did the hypervolume occupied by Lepidoptera families increase by 14.8% after the emergence of the three recent morphogroups, but that 24.4% of the morphospace of these recent morphogroups represented distinctive and new forms compared to those from the primitive morphogroups (Fig. 2d-i). We also calculated the trait probability density⁶⁵ for each of the families, finding that the families in the recent morphogroups have a higher level of functional redundancy than those in the primitive morphogroups (55.2% vs. 30.0%; Extended Fig. 3, Extended Table 2). Together, these results indicate that the families in the recent morphogroups are not only morphologically distinct from those in the primitive morphogroups, but that they have extended their occupied morphospace by nearly 15% and done so in similar ways, resulting in a higher degree of morphological similarity among families in the recent morphogroups than among those in the primitive morphogroups.

Finally, to gain a deeper understanding of the tempo and mode of morphological evolution, we investigated different models of morphological evolution by determining the fit of our data to different evolutionary models (Brownian motion, early burst, directional trend, diversity dependence, and single and multiple adaptive peaks). These tests were applied to the morphological features described by the three principal components (PCs) (see Fig. 4c and Extended Table 3 for detailed results). We found that the morphological evolution of PC1 and PC2 is consistent with the idea of multiple adaptive peaks (Extended Table 3), further indicating that the primitive and recently-evolved Lepidoptera families have distinct morphological features that likely arose due to different selection pressures. However, the evolution of PC3 is closer to Brownian motion or a single adaptive peak (the AICs of these two modes are similar; Extended Table 3), suggesting that this feature varies randomly across families or has a common selection pressure. Interestingly, the disparity of PC2 rapidly increased at about 66 Ma, right after the Cretaceous-Paleogene (K-Pg) mass extinction (Fig. 4c) and much later than when the three recent morphogroups appeared (110 Ma) (Fig. 4a and b). Given this result, we further subdivided the 27 families in the recent morphogroup into two subgroups using 66 Ma as a boundary, and then performed hypervolume analysis. Although the seven families (Brahmaeidae, Eupterotidae, Noctuidae, Euteliidae, Nolidae, Lycaenidae, and Riodinidae) that emerged after 66 Ma comprise fewer than 24% of the total species in the recent morphogroup (3158 of 13549 sampled species), they occupied 59% of the recent morphogroup’s total hypervolume. This suggests that although the K-Pg mass extinction had no significant effect on the taxonomic diversity at either the species- or family-levels, it still had an important impact on the morphological evolution of Lepidoptera.

Unveiling Lepidoptera morphospace using generative artificial intelligence

Given the apparent gradual change in morphology over time in response to increased ecological opportunity, as well as the overall morphological differences between the primitive and recent morphogroups that first occurred around 110 Ma, we next determined the reasons why these groups diverged in functional redundancy by identifying their biologically-relevant, family-specific feature vectors using generative AI. According to the gradual evolution hypothesis, Lepidopteran wing morphology should have evolved from a form that is intrinsically stable for gliding and flying but with low maneuverability to a highly maneuverable but less stable form. This idea was first proposed by John Maynard Smith⁹, who argued that primitive flyers must be intrinsically stable because, in the absence of highly evolved sensory and nervous systems, they would have been unable to fly if they were not. Thus, highly maneuverable, inherently unstable flyers with active stabilizing mechanisms must be the derived species that evolved later. We tested Maynard Smith’s hypothesis and compared the wing shapes of the primitive and recent morphogroups generated with the trained decoder to describe the mean family-level morphological feature values (Extended Fig. 4), and then measured wing shape using geometric morphometric analysis⁶⁶. We found that the forewings of the primitive morphogroup families have a higher aspect ratio and second moment of area than those of the recent morphogroup species. Such wing morphology is aerodynamically more stable for gliding and provides greater lift when the wings are flapping⁶⁷. However, the forewings of the recent morphogroup families have a lower aspect ratio and second moment of area that is more susceptible to airflow and wing flapping behavior, resulting in a greater unpredictable flight behavior (Fig. 3a). This capability for more unpredictable flight likely aided moths and butterflies in evading predators such as dinosaurs, and later birds and bats^57,59,69. Thus, these results are consistent with Maynard Smith’s hypothesis and the idea that Lepidoptera wings evolved from forms suitable for stable gliding and flight to forms with high maneuverability.

In addition to the shape of the wings, we systematically compared the color and patterning of the wings in the primitive and recent morphogroups, including features such as brightness, saturation, richness, evenness, and the Pattern Distinctiveness-Distance Index (PDDI) (see wing shape and color patterning analysis section in the Materials and Methods) (Extended Fig. 5, Extended Table 4). We found that the hind wings of the recent morph groups exhibited higher color saturation (bootstrap p<0.001) and richness (bootstrap p<0.001), as well as greater variation in saturation (bootstrap, p=0.04) and richness (bootstrap p=0.014) (Fig. 3b and Extended Fig. 5d, e). Brightness, saturation, and PDDI did not differ between the morphogroups (Extended Table 4). Thus, not only did Lepidoptera wing shape become more maneuverable over time, but wings simultaneously evolved to have more intense and vivid coloration.

Finally, to further visualize the difference in morphospace between the primitive and recent morphogroups and to simultaneously explore differences axes of morphological variation, we equidistantly sampled on the three PC axes representing the average features of each family, and then generated the corresponding images in morphospace using our trained decoder. This approach allowed us to directly observe the changes in morphological features along the three PC axes and to quantify wing shape, color, and patterning in the generated images (Fig. 2g-i) to understand the continuous changes in the three PC axes from 250 million years ago to the present (Fig. 4b and 4c and Extended Fig. 6). We found that images generated by PC1 become squarer in appearance (i.e., a smaller aspect ratio of wing spread and smaller perimeter-to-area ratio), have richer and more contrasting (uneven) colors, and have a lower average brightness, but are more saturated and more reddish in tone (Extended Fig. 6, 7, 8a). The images generated by PC2 also tend to have a squarer appearance, forewings that tend to be reddish in tone, and distinctive regions on the wings that are positioned farther away from the wing bases (i.e., closer to the outer margins) (Extended Fig. 6, 7, 8b). In contrast, the images generated by PC3 tend to have deep notches or tails on the hindwing (increasing the ratio of perimeter to area, with no change in the aspect ratio of wing spread), the color of both the fore- and hindwings tends to be darker, and the color richness of the forewing decreases (Extended Fig. 6, 7, 8c). In summary, species in the recent morphogroups have less variation in aspect ratio of wing spread, are squarer, and have unique hindwings with deep notches and caudal protrusions as well as higher color richness and contrast than those in the primitive morphogroups.

Here we quantify both taxonomic diversity and morphological disparity over time to investigate the tempo and mode of evolution in butterflies and moths (Lepidoptera), showing that although both macroevolutionary processes are influenced by ecological opportunity, they are decoupled and impacted by different periods of major global change. Specifically, we find that the dramatic increase in Lepidoptera taxonomic diversity began about 140 Ma, coinciding with the rise of the flowering plants, but well before the burst of morphological disparity that we show occurred nearly 30 million years later. Therefore, we find no evidence to support an early burst model of morphological disparity or taxonomic diversity in Lepidoptera. Instead, by carefully quantifying morphological disparity using deep learning, we find that the three recent morphogroups in Lepidoptera emerged around 110 Ma, a time that coincides with the time of continued rapid speciation in angiosperms and their dramatic increase in productivity during the mid-Cretaceous, a period considered to be a product of angiosperm-insect coevolution (i.e., the angiosperm terrestrial revolution)^8,69. In addition, a second rapid increase of Lepidoptera morphological disparity and a further change in their wing shape and coloration took advantage of the ecological opportunities provided by the K-Pg mass extinction event 66 Ma^71,72. Both of these results are consistent with the gradual evolution and ecological opportunity hypotheses, and the idea that not only will taxonomic diversity and morphological disparity increase gradually over time¹⁰, but that the rate of evolution will also accelerate when new ecological opportunities arise. Interestingly, the net diversification rate of Lepidoptera peaked around 110 Ma and did not increase with the rapid increase in morphological disparity 66 Ma. This result is similar to studies in other taxa showing that the relationship between diversity rates and disparities is often decoupled¹⁵.

After identifying broadscale patterns of morphological change over time using deep learning, we then used a trained generative decoder to visualize the biologically-relevant traits. Our visualization analysis shows that the morphogroups that emerged 110 Ma evolved from long, slender winged forms that provided stable gliding and flight lift to wider, shorter winged forms with higher flight maneuverability. This result is consistent with the classic but rarely tested hypothesis proposed by John Maynard Smith⁹ that primitive flyers must have been intrinsically stable. In other words, if the evolutionary sequence of wing morphology is constrained for strong functional reasons (e.g., morphology must first be suitable for stable gliding/flight before it can become complex), an early burst of wing morphology is unlikely, as we showed. This result and the idea of morphological functional constraints may partially explain the decoupling of diversity rates and disparities and the discovery that the net (taxonomic) diversification rate of Lepidoptera species began to increase around 140 Ma, but morphological disparity lagged by 30 million years, occurring only around 110 Ma. Such morphological functional constraints are likely to explain the occurrence of similar lags in morphological diversity in other taxa^72,73. In addition to these major flight-related morphological changes 110 Ma, the accelerated evolution of morphological variation 66 Ma was mainly reflected in the locations of the distinctive color regions on the wings and the shape of wings becoming squarer, which is likely due to the fact that closed canopy rainforests only formed after the K-Pg extinction at roughly the same time period⁷¹. The diversity of habitats and the richer nutritional resources provided by the new rainforest has been shown to have allowed Lepidoptera to coevolve with the concurrent increase in angiosperm diversity⁷⁰. The subsequent emergence of day and night predators of Lepidoptera such as birds and bats likely further intensified the selection pressure for wing shape^57,69 and camouflage to avoid natural enemies⁵⁸, resulting in an accelerated rate of morphological evolution. Thus, our study shows that the deep evolutionary history of taxonomic diversity and morphological disparity in Lepidoptera has been shaped by a combination of gradual evolution due to functional constraints on morphology (wing shape and color) and ecological opportunity (emergence of flowering plants, the terrestrial revolution in flowering plants, and mass extinction during the Cretaceous).

Stephen Jay Gould⁵ argued that the objective quantification of biological features is a vital, yet challenging aspect of understanding the evolutionary process of morphological disparity. By applying cutting-edge artificial intelligence, including generative AI, we have shown that it is now possible to solve a problem that has plagued evolutionary biologists for more than a century^5,75,76. Using the image generation model demonstrated here, combined with more traditional methods of morphological analysis like geometric morphometric analysis⁶⁶, we can now analyze morphology objectively and quantitatively to achieve the goal of an explainable deep learning model that can be used to study adaptation and macroevolutionary processes that influence biological diversity^10-12. Ultimately, we believe that artificial intelligence will help bring new perspectives and breakthroughs to the evolution of morphological disparity, just as molecular biology has revolutionized the study of taxonomic diversity.

Phylogenetic tree

We used a recently published Lepidoptera phylogeny that was time-calibrated with genomic and fossil data¹. Because our study focused on the relationship of family-level morphological variation, we dropped those tips representing lower taxonomic ranks (e.g., subfamily and genus), using the ‘drop.tip’ function in the R package ‘ape’⁷⁶. The resulting Lepidoptera tree contained 83 tips, including 74 Lepidoptera families (containing 145,623 described species⁷⁷) and 9 outgroups.

Estimating diversification rates

To estimate diversification rates through time across the phylogeny, we applied the Bayesian Analysis of Macroevolutionary Mixtures (BAMM) model⁷⁸ on the trimmed phylogeny using the R package ‘BAMMtools’⁷⁹. The sampling fraction was adjusted based on the species numbers within each family⁷⁷ before the BAMM analysis. The priors for the BAMM analysis were determined by the setBAMMpriors function with the modified tree and the estimated number of Lepidoptera species. To avoid difficulty in initial likelihood estimation due to the extremely low sampling fraction, we specified the extinctionProbMax as 0.99999 in the control file. A total of 10⁹ generations of MCMC searches were launched for the BAMM speciation-extinction analysis, with samples saved every 10⁵ generations.

We used the ‘plot.bammdata’ function to visualize diversification rates on the phylogenetic tree. The first 50% of samples were discarded as burn-in. We then used the ‘getBestShiftConfiguration’ function to detect the greatest number of possible locations where the diversification rate changed. The location of inferred rate shifts on the phylogeny was added to the visualized tree with the ‘addBAMMshifts’ function. Next, we used the ‘credibleShiftSet’ function to estimate the 95% credible set of different rate shift scenarios. A diversification rate through time with an associated confidence interval was plotted using the ‘plotRateThroughTime’ function for the Lepidoptera only (i.e., the outgroups were excluded).

Ancestral state estimation

We dropped tips (families) lacking morphological data from the Lepidoptera phylogeny using the ‘drop.tip’ function. Fifty-two tips remained in the resulting phylogeny for morphological disparity analysis. We treated the six morphogroups (“Similarity tree and morphogroups” section) as six discrete character states, and we applied three different evolutionary models (equal rates (ER), symmetrical (SYM), and all-rates different (ARD)) to the phylogeny to describe discrete state evolutions and estimate their likelihoods using the ‘ace’ function in the R package ‘ape.’ The best-fit model was chosen using a chi-square analysis. We then applied stochastic trait mapping simulations⁸⁰ on the chosen model to estimate the ancestral states at nodes, as well as the time spent along edges of the phylogeny using the ‘make.simmap’ and ‘describe.simmap’ functions in the R package ‘phytools’⁸¹.

Morphological evolution based on continuous phenotypic states

To understand the evolutionary history across the morphospace, we analyzed the evolutionary dynamics of the morphological data in continuous states. Specifically, we applied PCA on the mean key features of families (“Identifying key image features” section) to further reduce the dimensions of interest. The resulting axes of PC1, PC2, and PC3 explained 28.6%, 10.1%, and 8.5% of variance, respectively (47.2% explained in total, Extended Table 1) of the family-level morphological variation. The values of PC1, PC2, and PC3 were used as input trait values for the following analyses.

We performed a disparity through time (DTT) analysis and plotted the results using the ‘dtt’ function in the R package ‘geiger’⁸². The DTT analysis compared the trends of morphological diversification to the expectations from a Brownian motion (BM) model constructed with 1000 random simulations. We further estimated morphological evolutionary rate shifts along lineages using the ‘phyloEM’ function implemented in the R package ‘PhylogeneticEN’⁸³. We determined if the morphological evolutionary change rates of any of the three PC axes deviated from the Brownian Motion (BM) or Ornstein-Uhlenbeck (OU) process expectations on the plot, and when such deviations occurred.

We employed three functions, namely ‘fitContinuous’ from geiger⁸⁴, ‘fitDiversityModel’ from phytools⁸¹, and ‘OUwie’ from OUwie⁸⁵, to assess the statistical adequacy of several models explaining the values of PC1, PC2, and PC3. The models we tested were, rate trend (trend), early burst (EB), diversity dependent (DD), OU model with a single regime (OU1), and OU model with multiple regimes (OUM). For OUM, we estimated multiple regimes or adaptive peaks for the different morphogroups (morphogroups 1-6, as detailed in the“Similarity tree and morphogroups” section). To explain the continuous character evolution, we determined the best-fitting models using the AIC weight, calculated with the ‘aicw’ function in geiger⁸⁴.

Image collection

The specimen images of Lepidoptera were searched within the open database of GBIF and downloaded from their registered institutional websites (accessed on 2021-02-03, the full citations and rights are declared on https://github.com/yuhinas/LepMorphAI). To cover the widest range of taxa (i.e., the greatest number of species) and photographic conventions (e.g., lighting environments and backgrounds), while keeping the total number of images low, we downloaded one image per species from each registered institutional website. That is, if different institutions had photos of the same species, there would be multiple photos of the same species, but from different institutions. We trained a object detection model YOLO v4⁴⁹ for locating and cropping specimens from the original image. The anchors were tweaked for Lepidoptera specimen images, and the loss functions for bounding boxes were modified to punish more on small predicted boxes that would crop off the edges of specimens. The cropped images were further resized and padded to a resolution of 256x256. The padded areas were filled with the mean RGBs of pixels sampled from the edge. In total, we ended up with 32,262 standardized specimen images of 17,186 species. Our modification of YOLO v4 was based on https://github.com/bubbliiiing/yolov4-pytorch.

Data augmentation

We applied two sets of random augmentations for variational autoencoder training. To prevent potential learning biases, the first set simulated the variation among photographic conventions, including RGB adjustments, independent treatment of RGB channels for contrast adjustments, and slight variations in zoom levels. The second set was treated as noise, including xy translations, rotations, and random percentage of pixel obscuring with three different patch sizes. The model was forced to learn the random variations introduced from the first set and to ignore the noises introduced from the second set. The model needed to catch as many details as possible to compensate for the information loss with pixels obscured.

Model and training

We used a customized variational autoencoder called DFC-VSC for self-supervised feature extraction (Fig. 1a) and image generation. DFC-VSC was composed of the variational sparse coding (VSC) autoencoder⁴⁸ and the deep feature consistency module from DFC-VAE⁵⁰. The former was for extracting sparse and disentangled representations for better interpretability, and the latter guided the model in the training process to focus more on relevant foregrounds (the specimens of Lepidoptera) over the various backgrounds.

The encoder of our VSC implementation digested noise-introduced images of 256x256x3 dimensions with layers composed of residual blocks and encoded them into 512-dimension latent vectors. The latent vectors were regularized to follow spike and slab prior distributions with alpha=0.01, as recommended by VSC⁴⁸. The decoder was symmetric to the encoder, and reconstructed denoised images from the latent vectors in the training process. The decoder could later generate specimen images from any given vectors of the same dimensions for feature exploring, visualization, or random creations. Pixel-wised MSE reconstruction loss (L-rec) and spike-and-slab prior distribution loss (L-prior, KL-divergence of two arbitrary spike-and-slab distributions instead of two Gaussian distributions in VAE) were adopted in the VSC part.

The DFC module played the role of supervisor and defined deep features on which to be focused more by the VSC module. Our DFC implementation used a pre-trained ResNet50 model targeting the classification of subfamilies of Lepidoptera from scratch. The reason we chose subfamilies for classification was that images of each subfamily covered a sufficient amount of variety of photographic conventions such as various backgrounds and lighting conditions. In addition, the number of subfamilies (~300) was more appropriate for the classifier than the number of genera (~5000) or families (~90), depending on the sample size requirements per class for training a classifier. The DFC module could therefore learn to ignore irrelevant backgrounds for the classification task, as shown with the Relevance-weighted Class Activation Mapping (Relevance-CAM)⁸⁶ visualization (Extended Fig. 9)

In the main DFC-VSC training process, the parameters of the DFC module were locked, and the DFC module only worked as a feature extractor. The feature maps of major layers extracted from the original and reconstructed images were compared and were used to calculate deep feature MSE loss (L-dfc). The loss was then propagated back to the VSC model for updating parameters. The process made the VSC learn to reconstruct images with high deep feature consistency to the original images, and so the VSC model parameters were balanced toward the relevant foregrounds. Ultimately, DFC-VSC was trained for 40,000 epochs (learning rate=0.0002, Adam optimizer), and all three losses reached their plateau.

Identifying key image features

Features of 512-D from all 32,262 standardized specimen images (a 32262x512 matrix) were extracted with the trained DFC-VSC encoder. To further reduce dimensions for easier analysis and interpretation, we managed to identify the key features related to subfamilies. We applied the back-propagation algorithm on a trained simple classifier targeting subfamilies to determine the dimensions of features with the steepest gradients. The reason we chose subfamily instead of family as the learning target was that we wanted to find an adequate number of key features, while maintaining a higher resolution of features over families. The key features related to subfamilies were then aggregated to the family level for all following analyses.

To avoid the difficulty of interpretation for interactions of signed feature values and signed gradients, we extended and converted the 512-D features into 1024-D vectors with non-negative values (referred as the non-negative features). To do this, we concatenated the original 512-D features with their sign-reversed clone into 1024-D vectors and then set the values in locations originally with negative values to zero. The new 1024-D non-negative vectors were then used to train the simple classifier. The classifier consisted of only two linear layers, each with 512 neurons and followed by 50% dropout and leaky ReLU activations. It achieved 75% top-1 accuracy for classifying 297 subfamilies, each of which contained at least three species. The term "top-1 accuracy" refers to the accuracy of a model in correctly predicting the highest-ranked class label. It is a measure of the model's performance that evaluates the percentage of times the model predicts the correct class label as the top output. After applying back-propagation, the top-N key features of each subfamily were voted by the dimension locations with the largest N mean gradients of their member species (at least three sampled). We tested the number N from 1 to 20. For each N, we took the corresponding N-union features (i.e., the union of top-N features of each subfamily) and calculated three different indices of clustering, including silhouette score, Davies-Bouldin score, and Calinski-Harabasz score, for evaluating and selecting the appropriate N according to subfamily clustering. The overall evaluations showed that the optimal scenario occurred with N=2 and N-union=59 (Extended Fig. 10). Finally, we used 40 unique key features by converting the non-negative 1024-D features back to 512-D features. We could see that these 40 features were highly representative of the original 512 features, as shown by the high similarity between typical forms of subfamilies reconstructed with the 40 features and those with all 512 features (Extended Fig. 11).

Similarity tree and morphogroups

We used the mean of the key features of each species to calculate the morphological similarity between any two families (i.e., the mean pairwise distances between any two species from different families). The resulting distance matrix was then used to build a similarity tree with the neighbor-joining algorithm implemented in the ‘scikit-bio’ package in python3. We set the root of the similarity tree to the family Eriocraniidae, the earliest-evolved family with enough data sampled in our dataset. The branches were ladderized with morphological similarity to the root.

The tree was then split into K morphogroups for categorical analysis, and each group was required to be composed of one or several adjacent clades. The best K was decided using the following process. First, based on a general definition of clustering⁸⁷, we attempted to maximize the similarity in each cluster and minimize the similarities among clusters. For any given number K, we calculated the total variation within groups, the total variation of intragroup variation to intergroup variation, and the variation among characteristics of groups from the enumerations of all possible combinations of K groups of our similarity tree. The sum of these calculations was defined as "the lower the better" grouping index. Next, we evaluated the grouping indices for each K from 3 to 10. Finally, we found the optimal result (the minimum index) when K = 6 (Extended Fig. 2c).

Wing shape and color patterning analysis

To understand the basic flight type of each family and their morphogroups, we first generated each family’s typical form from their mean key features. We then isolated the forewings with the background removed from the reconstructed images to calculate the aspect ratios (AR) and the second moment of area (2nd MoA) following the approach of Roy et al.⁶⁷. For each image, we selected only the left or right forewing, depending on the clarity of reconstructed wing boundaries. We followed the procedures and equations of the Matlab tool ‘WingImageProcessor’ (https://biomech.web.unc.edu/wing-image-analysis/) to reimplement the calculations for AR and 2nd MoA with Python.

For the other features of shape and color patterning, we measured both the isolated and background-removed forewings and the hindwings of each family. We used the built-in functions of the ‘opencv-python’ package to find bounding boxes, perimeters, and areas. The aspect ratio of the bounding box was defined as the box width divided by the box height. The perimeter area ratio was defined as the wing perimeter divided by the wing area. The mean brightness was defined as the mean grayscale value of pixels of a wing. We applied the BIRCH clustering algorithm (implemented in the ‘scikit-learn’ python package, with n_clusters=None and threshold=16) on the RGB values according to all forewings and all hindwings to get non-arbitrary K-forewing and K-hindwing color groups in total. The wing color richness was defined as the number k (1≤ k ≤ K) of color groups a wing has. We calculated color evenness using the number of color groups and the pixel counts of each color group, using the same formula as for species evenness.

We defined a measurement named Pattern Distinctiveness-Distance Index (PDDI) to indicate the mean horizontal distances of distinctive color regions (such as belts and spots) from wing bases using the following procedure. First, we calculated pairwise color distances (RGB Euclidean distances) between any two pixels on a wing and then take the mean for each pixel. (Extended Fig. 12a). The higher mean color distance indicates higher distinctiveness (i.e., the color differed more from the others and occupied less proportion of a full wing) (Extended Fig 12b). We truncated the mean values higher than 95% to suppress noise. Second, we aggregated the 2-dimensional wing distinctiveness maps into a 1-dimensional vector by taking the mean of the highest 50% of each pixel column (Extended Fig 12c) to highlight the occurrence of distinctiveness on the horizontal axis. Third, we extracted the relative distances from the wing bases of the highest 50% aggregated distinctiveness, and calculate the mean distance normalized by the variance as the final PDDI (Extended Fig. 12d). A higher value of PDDI indicated that the distinctive color regions were located farther away from the wing base (i.e., closer to the outer edges of the wings,) on the horizontal direction. Finally, we used bootstrapping (resampled 10000 times with replacement) to estimate and compare the means and variance in wing color patterning between the primitive and recent morphogroups, with the “boot” library in R.

PCA axis visualization and analysis

We applied PCA to mean key features of families with at least three sampled species (68 families) to investigate morphological disparity through time. For an intuitive understanding of the synergistic effects of PC axes, instead of using biplots, we visualized the planes constructed with PC axes through the following steps. We first sampled a set of 4*4 points whose coordinates are the unique combination of (x, y) for x, y ∈ {-3, -1, 1, 3} on the 2-D planes constructed with any two PC axes. We then inverse-transformed these points back to the morphospace of key features and generated the corresponding 16 images with the decoder of DFC-VSC. By rearranging these images back to 4*4 grids according to their coordinates on the PC axes, we could easily see how the wing shapes and color patterns vary across PC planes.

We further quantified wing shape and color patterning to look for potential trends along the PC axes. The left forewings and hindwings were isolated from the generated images on a grid (g-x, g-y) for g-x, g-y ∈ {-3, 0, 3} with the background removed. Their characters were then measured, including the aspect ratios of the bounding boxes of wings, mean saturation, mean brightness, color richness, and evenness, the perimeter area ratios, and the PDDI. The characters related to flight capability (i.e., wing AR a2nd MoA) were only measured on the forewings (see “Wing shape and color patterning analysis” section for further details).

Hypervolume and trait probability density

We first projected the mean key features of each species to the PC axes of families and then calculated the trait hypervolumes, performed their set operations, and calculated the trait probability density (TPD) indices of the primitive and recent morphogroups. Only those families that were included in the ancestral state estimation were included in the following analyses. The total number of families used for these analyses is 52 (containing 16849 species), including 25 families from the primitive morphogroup (containing 3300 species), and 27 families from the recent morphogroup (containing 13549 species).

The hypervolume calculations were performed with the R package ‘hypervolume’ v3.0, using the Gaussian kernel and the ‘cross-validation’ method for bandwidth estimation. Other parameters were set to the default values of the package. The resulting hypervolume of the primitive morphogroup was 47.78 with a uniqueness of 25.93, and the hypervolume of the recent morphogroup was 28.91 with a uniqueness of 7.06. Their intersection was 21.85, and their union was 54.84. The recent morphogroups increased the total hypervolume by 14.8% (7.06 / (54.84 - 7.06)).

The TPD indices were calculated with the R package ‘TPD’ v1.1. We changed the bandwidth estimation method from ‘plug-in’ to ‘smooth cross-validation’ to match our hypervolume calculation. The TPD was applied on the assemblage level, and the TPDs were applied on the family level. The TPDs required each family to have more observations (i.e., the number of sampled species in a family) than the number of trait dimensions (3 PC axes), so we took two approaches to meet the requirement. First, we removed small families, so the number of families decreased from 52 to 47. Second, we added one dummy observation imputed with median trait values for each of the five smallest families. The currently known species numbers of each family⁷⁷ were used as weights for calibrating redundancy estimation. Importantly, both approaches produced qualitatively similar results, with the main difference being in the relative redundancy due to the difference in the number of families. Finally, the distribution difference between families of the primitive and recent morphogroups in the morphospace of the PCA was analyzed with Permutational Multivariate Analysis of Variance (PERMANOVA), implemented with the `adonis` function of the `vegan` library, using the distance method=`euclidean` and permutations=1000000.

Data and codes availability

The data and codes are available on https://github.com/yuhinas/LepMorphAI.

Acknowledgements

We would like to express our gratitude to Daniel Rubenstein and Patrick Kennedy for their valuable comments on an earlier version of this paper. S.-F.S. was supported by Academia Sinica (AS-IA-111-L04).

Kawahara, A. Y. et al. Phylogenomics reveals the evolutionary timing and pattern of butterflies and moths. Proceedings of the National Academy of Sciences 116, 22657-22663 (2019). https://doi.org:doi:10.1073/pnas.1907847116
Allen, C. E., Zwaan, B. J. & Brakefield, P. M. Evolution of Sexual Dimorphism in the Lepidoptera. Annual Review of Entomology 56, 445-464 (2011). https://doi.org:10.1146/annurev-ento-120709-144828
Wallace, A. R. Tropical Nature, and other Essays. (Macmillan and Company, 1878).
Darwin, C. The Descent of Man, and Selection in Relation to Sex. (John Murray, 1871).
Gould, S. J. The disparity of the Burgess Shale arthropod fauna and the limits of cladistic analysis: why we must strive to quantify morphospace. Paleobiology 17, 411-423 (1991). https://doi.org:10.1017/S0094837300010745
White, T. E. et al. Reproducible research in the study of biological coloration. Animal Behaviour 106, 1e57 (2015).
Kemp, D. J. et al. An Integrative Framework for the Appraisal of Coloration in Nature. The American Naturalist 185, 705-724 (2015). https://doi.org:10.1086/681021
Benton, M. J., Wilf, P. & Sauquet, H. The Angiosperm Terrestrial Revolution and the origins of modern biodiversity. New Phytologist 233, 2017-2035 (2022). https://doi.org:https://doi.org/10.1111/nph.17822
Smith, J. M. The importance of the nervous system in the evolution of animal flight. Evolution 6, 127-129 (1952).
Darwin, C. On the Origin of Species by Means of Natural Selection, or the Preservation of Favoured Races in the Struggle for Life. (John Murray, 1859).
Harmon, L. J. et al. Early bursts of body size and shape evolution are rare in comparative data. Evolution: International Journal of Organic Evolution 64, 2385-2396 (2010).
Simpson, G. G. Tempo and Mode in Evolution. (Columbia University Press, 1944).
Gould, S. J. The Structure of Evolutionary Theory. (Harvard university press, 2002).
Harmon, L. J., Schulte, J. A., Larson, A. & Losos, J. B. Tempo and Mode of Evolutionary Radiation in Iguanian Lizards. Science 301, 961-964 (2003). https://doi.org:doi:10.1126/science.1084786
Guillerme, T. et al. Disparities in the analysis of morphological disparity. Biology Letters 16, 20200199 (2020). https://doi.org:doi:10.1098/rsbl.2020.0199
Brocklehurst, N. & Benson, R. J. Multiple paths to morphological diversification during the origin of amniotes. Nature Ecology & Evolution 5, 1243-1249 (2021). https://doi.org:10.1038/s41559-021-01516-x
Yoder, J. et al. Ecological opportunity and the origin of adaptive radiations. Journal of Evolutionary Biology 23, 1581-1596 (2010).
Stroud, J. T. & Losos, J. B. Ecological Opportunity and Adaptive Radiation. Annual Review of Ecology, Evolution, and Systematics 47, 507-532 (2016). https://doi.org:10.1146/annurev-ecolsys-121415-032254
Slater, G. J. Iterative adaptive radiations of fossil canids show no evidence for diversity-dependent trait evolution. Proceedings of the National Academy of Sciences 112, 4897-4902 (2015). https://doi.org:doi:10.1073/pnas.1403666111
Cooney, C. R. et al. Mega-evolutionary dynamics of the adaptive radiation of birds. Nature 542, 344-347 (2017). https://doi.org:10.1038/nature21074
Seehausen, O. African cichlid fish: a model system in adaptive radiation research. Proceedings of the Royal Society B: Biological Sciences 273, 1987-1998 (2006). https://doi.org:doi:10.1098/rspb.2006.3539
Ronco, F. et al. Drivers and dynamics of a massive adaptive radiation in cichlid fishes. Nature 589, 76-81 (2021). https://doi.org:10.1038/s41586-020-2930-4
Bongers, T. & Bongers, M. Functional diversity of nematodes. Applied Soil Ecology 10, 239-251 (1998). https://doi.org:https://doi.org/10.1016/S0929-1393(98)00123-1
López-Estrada, E. K., Sanmartín, I., García-París, M. & Zaldívar-Riverón, A. High extinction rates and non-adaptive radiation explains patterns of low diversity and extreme morphological disparity in North American blister beetles (Coleoptera, Meloidae). Molecular Phylogenetics and Evolution 130, 156-168 (2019). https://doi.org:https://doi.org/10.1016/j.ympev.2018.09.014
Pimiento, C. et al. Functional diversity of marine megafauna in the Anthropocene. Science Advances 6, eaay7650 (2020). https://doi.org:doi:10.1126/sciadv.aay7650
Field, K. G. et al. Molecular phylogeny of the animal kingdom. Science 239, 748-753 (1988).
Galtier, N., Gouy, M. & Gautier, C. SEAVIEW and PHYLO_WIN: two graphic tools for sequence alignment and molecular phylogeny. Bioinformatics 12, 543-548 (1996).
Murphy, W. J. et al. Molecular phylogenetics and the origins of placental mammals. Nature 409, 614-618 (2001).
Roy, K. & Foote, M. Morphological approaches to measuring biodiversity. Trends in Ecology & Evolution 12, 277-281 (1997).
Swenson, N. G. The functional ecology and diversity of tropical tree assemblages through space and time: From local to regional and from traits to transcriptomes. ISRN Forestry 2012, 1-16 (2012). https://doi.org:10.5402/2012/743617
Klingenberg, C. P. Evolution and development of shape: integrating quantitative approaches. Nat Rev Genet 11, 623-635 (2010). https://doi.org:10.1038/nrg2829
Cervantes, E., Martin, J. J. & Saadaoui, E. Updated methods for seed shape analysis. Scientifica (Cairo) 2016, 5691825 (2016). https://doi.org:10.1155/2016/5691825
Cooke, S. B. & Terhune, C. E. Form, function, and geometric morphometrics. Anat Rec (Hoboken) 298, 5-28 (2015). https://doi.org:10.1002/ar.23065
Hughes, M., Gerber, S. & Wills, M. A. Clades reach highest morphological disparity early in their evolution. Proceedings of the National Academy of Sciences 110, 13875-13879 (2013). https://doi.org:doi:10.1073/pnas.1302642110
He, K., Zhang, X., Ren, S. & Sun, J. in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR). 770-778.
Long, J., Shelhamer, E. & Darrell, T. in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR). 3431-3440.
LeCun, Y., Bengio, Y. & Hinton, G. Deep learning. Nature 521, 436-444 (2015). https://doi.org:10.1038/nature14539
Krizhevsky, A., Sutskever, I. & Hinton, G. E. Imagenet classification with deep convolutional neural networks. Communications of the ACM 60, 84-90 (2017).
Russakovsky, O. et al. ImageNet Large Scale Visual Recognition Challenge. International Journal of Computer Vision 115, 211-252 (2015). https://doi.org:10.1007/s11263-015-0816-y
Carion, N. et al. 213-229 (Springer International Publishing).
Hinton, G. E., Osindero, S. & Teh, Y.-W. A fast learning algorithm for deep belief nets. Neural computation 18, 1527-1554 (2006).
Bengio, Y., Lamblin, P., Popovici, D. & Larochelle, H. Greedy layer-wise training of deep networks. Advances in neural information processing systems 19 (2006).
Krizhevsky, A., Sutskever, I. & Hinton, G. E. Imagenet classification with deep convolutional neural networks. Advances in neural information processing systems 25 (2012).
Kingma, D. P. & Welling, M. in International Conference on Learning Representations (2014).
Dosovitskiy, A. et al. in International Conference on Learning Representations (2021).
Simonyan, K. & Zisserman, A. in International Conference on Learning Representations (2015).
He, K., Zhang, X., Ren, S. & Sun, J. in Proceedings of the IEEE conference on computer vision and pattern recognition. 770-778.
Papernot, N. et al. in Proceedings of the 2017 ACM on Asia Conference on Computer and Communications Security 506–519 (Association for Computing Machinery, Abu Dhabi, United Arab Emirates, 2017).
Tonolini, F., Jensen, B. S. & Murray-Smith, R. in Proceedings of The 35th Uncertainty in Artificial Intelligence Conference Vol. 115 (eds P. Adams Ryan & Gogate Vibhav) 690--700 (PMLR, Proceedings of Machine Learning Research, 2020).
Bochkovskiy, A., Wang, C.-Y. & Liao, H.-Y. M. Yolov4: Optimal speed and accuracy of object detection. arXiv preprint arXiv:2004.10934 (2020).
Hou, X., Shen, L., Sun, K. & Qiu, G. in 2017 IEEE winter conference on applications of computer vision (WACV). 1133-1141 (IEEE).
Houlihan, P. F. in A History of the Animal World in the Ancient Near East (ed Billie Jean Collins) 97-143 (Brill, 2002).
Schneider, J. G. Aristotle's History of Animals: In Ten Books. (George Bell & Sons, 1897).
Kim-Chong, C. Zhuangzi and the Nature of Metaphor. Philosophy East and West 56, 370-391 (2006).
Wu, S. et al. Artificial intelligence reveals environmental constraints on colour diversity in insects. Nature Communications 10, 4554 (2019). https://doi.org:10.1038/s41467-019-12500-2
Chazot, N. et al. Morpho morphometrics: Shared ancestry and selection drive the evolution of wing size and shape in Morpho butterflies. Evolution 70, 181-194 (2016). https://doi.org:https://doi.org/10.1111/evo.12842
Chotard, A. et al. Evidence of attack deflection suggests adaptive evolution of wing tails in butterflies. Proceedings of the Royal Society B: Biological Sciences 289, 20220562 (2022). https://doi.org:doi:10.1098/rspb.2022.0562
Owens, H. L., Lewis, D. S., Condamine, F. L., Kawahara, A. Y. & Guralnick, R. P. Comparative Phylogenetics of Papilio Butterfly Wing Shape and Size Demonstrates Independent Hindwing and Forewing Evolution. Systematic Biology 69, 813-819 (2020). https://doi.org:10.1093/sysbio/syaa029
Stevens, M. The role of eyespots as anti-predator mechanisms, principally demonstrated in the Lepidoptera. Biological Reviews 80, 573-588 (2005). https://doi.org:10.1017/S1464793105006810
Aiello, B. R. et al. The evolution of two distinct strategies of moth flight. Journal of The Royal Society Interface 18, 20210632 (2021). https://doi.org:doi:10.1098/rsif.2021.0632
Nixon, K. J. A. & Parzer, H. F. Mimicry: just wing it. Wing shape comparison between a mimicking swallowtail and its toxic model. Biological Journal of the Linnean Society 134, 707-715 (2021). https://doi.org:10.1093/biolinnean/blab107
McElwain, J. C., Yiotis, C. & Lawson, T. Using modern plant trait relationships between observed and theoretical maximum stomatal conductance and vein density to examine patterns of plant macroevolution. New Phytologist 209, 94-103 (2016). https://doi.org:https://doi.org/10.1111/nph.13579
Feild, T. S. et al. Fossil evidence for Cretaceous escalation in angiosperm leaf vein evolution. Proceedings of the National Academy of Sciences 108, 8363-8366 (2011). https://doi.org:doi:10.1073/pnas.1014456108
Vermeij, G. J. & Grosberg, R. K. The Great Divergence: When Did Diversity on Land Exceed That in the Sea? Integrative and Comparative Biology 50, 675-682 (2010). https://doi.org:10.1093/icb/icq078
Blonder, B., Lamanna, C., Violle, C. & Enquist, B. J. The n‐dimensional hypervolume. Global Ecology and Biogeography 23, 595-609 (2014).
Carmona, C. P., de Bello, F., Mason, N. W. & Lepš, J. Trait probability density (TPD): measuring functional diversity across scales based on TPD with R. Ecology 100, e02876 (2019).
Baken, E. K., Collyer, M. L., Kaliontzopoulou, A. & Adams, D. C. geomorph v4.0 and gmShiny: Enhanced analytics and a new graphical interface for a comprehensive morphometric experience. Methods in Ecology and Evolution 12, 2355-2363 (2021). https://doi.org:https://doi.org/10.1111/2041-210X.13723
Le Roy, C., Debat, V. & Llaurens, V. Adaptive evolution of butterfly wing shape: from morphology to behaviour. Biological Reviews 94, 1261-1281 (2019).
Rubin, J. J. et al. The evolution of anti-bat sensory illusions in moths. Science Advances 4, eaar7428 (2018). https://doi.org:doi:10.1126/sciadv.aar7428
Herendeen, P. S., Friis, E. M., Pedersen, K. R. & Crane, P. R. Palaeobotanical redux: revisiting the age of the angiosperms. Nature Plants 3, 17015 (2017). https://doi.org:10.1038/nplants.2017.15
Asar, Y., Ho, S. Y. W. & Sauquet, H. Early diversifications of angiosperms and their insect pollinators: were they unlinked? Trends in Plant Science 27, 858-869 (2022). https://doi.org:10.1016/j.tplants.2022.04.004
Carvalho, M. R. et al. Extinction at the end-Cretaceous and the origin of modern Neotropical rainforests. Science 372, 63-68 (2021). https://doi.org:doi:10.1126/science.abf1969
Erwin, Douglas H. Novelty and Innovation in the History of Life. Current Biology 25, R930-R940 (2015). https://doi.org:https://doi.org/10.1016/j.cub.2015.08.019
Uyeda, J. C., Hansen, T. F., Arnold, S. J. & Pienaar, J. The million-year wait for macroevolutionary bursts. Proceedings of the National Academy of Sciences 108, 15908-15913 (2011). https://doi.org:doi:10.1073/pnas.1014503108
Budd, G. E. Morphospace. Current Biology 31, R1181-R1185 (2021). https://doi.org:https://doi.org/10.1016/j.cub.2021.08.040
Raup, D. M. Geometric analysis of shell coiling: general problems. Journal of Paleontology 40, 1178-1190 (1966).
Paradis, E. & Schliep, K. ape 5.0: an environment for modern phylogenetics and evolutionary analyses in R. Bioinformatics 35, 526-528 (2018). https://doi.org:10.1093/bioinformatics/bty633
Goldstein, P. Z. in Insect Biodiversity: Science and Society Vol. 1 (eds R. G. Foottit & P. H. Adler) 463-495 (John Wiley & Sons, 2017).
Rabosky, D. L. Automatic detection of key innovations, rate shifts, and diversity-dependence on phylogenetic trees. PLoS ONE 9, e89543 (2014).
Rabosky, D. L. et al. BAMMtools: an R package for the analysis of evolutionary dynamics on phylogenetic trees. Methods in Ecology and Evolution 5, 701-707 (2014). https://doi.org:https://doi.org/10.1111/2041-210X.12199
Bollback, J. P. SIMMAP: Stochastic character mapping of discrete traits on phylogenies. BMC Bioinformatics 7, 88 (2006). https://doi.org:10.1186/1471-2105-7-88
Revell, L. J. phytools: an R package for phylogenetic comparative biology (and other things). Methods in Ecology and Evolution 3, 217-223 (2012). https://doi.org:https://doi.org/10.1111/j.2041-210X.2011.00169.x
Harmon, L. J., Weir, J. T., Brock, C. D., Glor, R. E. & Challenger, W. GEIGER: investigating evolutionary radiations. Bioinformatics 24, 129-131 (2007). https://doi.org:10.1093/bioinformatics/btm538
Bastide, P., Solís-Lemus, C., Kriebel, R., William Sparks, K. & Ané, C. Phylogenetic Comparative Methods on Phylogenetic Networks with Reticulations. Systematic Biology 67, 800-820 (2018). https://doi.org:10.1093/sysbio/syy033
Pennell, M. W. et al. geiger v2.0: an expanded suite of methods for fitting macroevolutionary models to phylogenetic trees. Bioinformatics 30, 2216-2218 (2014). https://doi.org:10.1093/bioinformatics/btu181
OUwie: analysis of evolutionary rates in an OU framework (2012).
Lee, J. R., Kim, S., Park, I., Eo, T. & Hwang, D. in Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 14944-14953.
Funes, A. & Dasso, A. in Encyclopedia of Information Science and Technology, Fourth Edition (ed D. B. A. Mehdi Khosrow-Pour) 1919-1933 (IGI Global, 2018).

There is NO Competing Interest.

ExtendedFiguresandTables.docx

Download PDF

Version 1

posted

You are reading this latest preprint version

The rise of morphological diversity in butterflies and moths

Status:

Version 1

Abstract

Figures

Main

Discussion

Materials and Methods

Declarations

References

Additional Declarations

Supplementary Files

Status:

Version 1