Nowadays, involving machines to reduce digitization errors is becoming popular. The repeatability and reproducibility help estimate digitization precision (Fruciano et al., 2017). Using the Burnaby method (1996), many studies removed digitization errors (Julien et al., 2021; Fruciano et al., 2017). However, one should remember that Burnaby designed the method to remove variation due to the size factor from a set of linear distance measurements so that the remaining variation would be size-independent shape descriptors. Gharaibeh (2005) removed even errors due to object orientation in historical images by a single vector eliminated from the sample using orthogonal projection. In the present study, we quantified the digitization error, ensuring it was almost negligible rather than eliminating them through such methods.
Where X' is aligned shape coordinates of specimens (icons), Xi is original specimens, ρ is a scale to unit size, H is a rotation matrix, and τ is a translation (Rohlf 1999).
Where Xzero° is the specimens oriented along the image plane (0°) in which all their landmarks are coplanar (Fig. 1a), whereas, Xob° represents the specimens oriented away from the image plane with an angle θ ∈ {5°≥ θ ≤ 80°}(Fig. 1b).
It is important to note that the matrix [H of equation (1)] is often constrained to effect only rigid rotations of the configurations about their centroids to obtain the best possible fit between corresponding landmark positions. The reference (Ri) is rotated to an orientation of maximum shape correspondence to the target (Ti). Sneath (1967) provided equation (3) to calculate the optimal angle to rotate the Ri configuration to match the Ti configuration. It involves rigidly rotating the landmark configurations about their centroids to obtain the best possible fit (Boas 1905; Sneath 1967; Rohlf & Slice 1990; Goodall 1991; Dryden & Mardia 2016). The core of geometric morphometrics defines Procrustes distance, the metric used to quantify differences between shapes, and determines Kendall's shape space (S), which forms the theoretical foundation for statistical shape analysis (Kendall 1984; Bookstein 1996; Kendall et al., 1999). In GLS Superimposition, the criterion that gets minimized is the sum of squared distances (Procrustes) between all the landmarks of the configuration (equation 4).
GPA estimates the μ of equation (4) using a least-square approach. It repeats the rotation and scaling of equation (1) until the Procrustes sum of squares of equation (4) cannot be reduced further (i.e., the difference in the sum of squares is less than another tolerance parameter). Where μ is the shape of the population mean with an 'average' shape from the sample.
MacLeod (2010) aptly described the method of Superimposition alignment and elaborated on the advantages and limitations of using (μ) as a reference shape (2010). A careful examination of the landmark positions across CMF (icons) photographed at different orientations/angles undergoes compression and shear, which is referred uniform transformation that creates shape differences (Zelditch et al., 2012). Landmarks near distal ends of icons along the axis of object orientation get perceptible displacement than those along the proximal ends. Such displacements of landmarks could be reasonably adjustable with 3D data sets. However, it is inevitable in 2D data sets, where some parts of affine transformation (compression/shear) of landmarks remain in the configurations that form the difference in shapes.
We conducted an exploratory principal component analysis (PCA) on the GPA-aligned shape coordinates X' (equation 1). It provides a graphical output that indicates the differences in orientations ranging from 0° to 80° (Fig. 4). Further, we tested the differences in the Object-orientation with a confirmatory test using Procrustes ANOVA that showed a significant shape difference between 0° and ≥ 20°. Therefore, we noted a lower residual sum of square values from 1.69 to 1.71 for icons with Object-orientations (up to ≤20°), whereas higher residual values for others in the Procrustes ANOVA (>20° Object-orientation and mirror icons) (Table 1; Figs. 3 & 4).
When displaced to non-Euclidean or hyperbolic spaces, the landmarks originally in the Euclidean space equally produce differences in icons. An alternative algorithm, Procrustes Superposition-Resistant Fit Theta-Rho Analysis (RFTRA), has been used in the case of Pinocchio Effects (where a relative displacement of one/few/group/any landmarks) (Siegel & Benson 1982; Rohlf & Slice 1990; Hallgrimsson et al., 2015; Klingenberg 2021). The irony is that never a single algorithm eliminates all residual errors; if so, it also over-fits. Klingenberg (2021) argued that some aspects of the information contained in each configuration of landmarks, particularly the position (of specimens relative to the camera or digitizing equipment) and orientation (the direction specimen points towards during the data collection) of each configuration, are irrelevant from a biological perspective (Klingenberg 2021). It might be true in the case of 3D data, but it requires scrutiny when dealing with 2D icons representing specimens or other physical objects.
Filter Criteria
In the Procrustes GLS method of optimization, variance gets distributed equally among landmarks by which the shape variation due to Object-orientation issues up to ≤ ±20° gets adjusted; therefore, differences were not statistically significant up to 20°, which aids us in proposing the filter criteria. The PCA plot shows the variation in the first principal axis where the icons up to 20° angle clustered together than the others (Fig. 4). Given these GLS properties, we have profitably developed an algorithm for the filter criteria (Figs. 4 & 5). Although the correction by the GLS Superimposition is limited to a range of icons, it assumes significance because it encourages profitable use of digital images, with Object-orientation issues, taken in field conditions for shape analysis. With no filter criteria, in earlier times, the object-oriented digital images were discarded or used in many studies without quantifying their effects, as in the case of free-ranging horses (Gmel et al., 2018), macaques (Kurita et al., 2012), birds (Foster et al., 2008), its postural effects (position of body parts viz., head or tail towards the camera). Taking pictures without orientation is often rare under field conditions, which is inevitable in 2D images. Again, shape deformation owing to Object-orientation is unavoidable in fossil specimens, for they have been subjected to geological pressure for many years. Object-orientation seems analogous to the articulation function but is quite different. With R tools, one can eliminate angle distortions with (fixed. angle) R functions of Geomorph and ShapeRotator (Vidal-García et al., 2018).
Transfer Learning
In taxonomic studies, subtle variations matter a lot when compared with holotypes. It dramatically impacts results when the phenotypic variation is in the same direction as other errors, viz., measurement errors and distortion of specimens. It is not easy to tease out the difference under GPA. It was clear that between 0° and 5° data sets (Table 1) showed non-significant results in Procrustes ANOVA; however, the higher Z values indicate the subtle differences (Table 1). Therefore, we classified the subtle orientations > 5 of the icons directly from the images (without GPA) via standard classifying algorithms using the transfer learning approach (Table 2).
Transfer learning is a paradigm to prevent overfitting. It involves copying the weights learned from a base model and refining it on a target model, improving model accuracy, and reducing model training time and the amount of labeled data required (Willi et al., 2019; Shorten & Khoshgoftaar 2019). We used transfer learning through the Weka and Google teachable machine learning. Google teachable machine achieved excellent accuracies, whereas, in Weka, we got lesser accuracy due to limited pre-trained models. It also took considerable time to download the pre-trained weights and run various zooModel. The output of the Transfer Learning version of MobileNet and PoseNet in Google Teachable Machine has the upper hand in discriminating even the subtle shape differences (Table 2) with pose angles (Laborde 2021).
Augmentation of digital images is essential for machine learning problems to create various images with manipulation functions such as rotating the pictures in different axes. Augmentor (0.2.6), a Python package, allows the augmentation of digital photos by invariably rotating the pictures. However, it fails to mention specific ranges for image manipulation by rotational functions. Thus, the proposed filter criteria would be a guiding framework for augmentation using image manipulation functions that increase the sample sizes and ensure the non-distortion of specimens' shapes. Therefore, in future studies, digital images that require annotations (Gerum et al., 2017), and augmentation (Willi et al., 2019), our suggested filter criteria would form a first step that would meaningfully augment images through machine learning. Our results closely follow the digit recognition tasks such as MNIST (Shorten & Khoshgoftaar 2019).
We prefer Transfer Learning as an add-on tool when dealing with subtle variations using Google teachable machine and WEKA. This work encourages the usage of images collected in different ways (Webcam, Digital cameras, Camera traps, Unmanned Vehicles) in the shape analysis by in silico methods. Our results and pose angle usage through PoseNet have broader applications in e-commerce platforms.