Field-measured canopy height may not be as accurate and heritable as believed – Evidence from advanced 3D sensing

doi:10.21203/rs.3.rs-2431189/v1

Download PDF

Research Article

Field-measured canopy height may not be as accurate and heritable as believed – Evidence from advanced 3D sensing

https://doi.org/10.21203/rs.3.rs-2431189/v1

This work is licensed under a CC BY 4.0 License

Journal Publication

published 02 Apr, 2023

Read the published version in Plant Methods →

You are reading this latest preprint version

Canopy height (CH) is an important trait for crop breeding and production. The rapid development of 3D sensing technologies shed new light on high-throughput height measurement. However, a systematic comparison of the accuracy and heritability of different 3D sensing technologies is seriously lacking. Moreover, it is questionable whether the field-measured height is as reliable as believed. This study uncovered these issues by comparing traditional height measurement with four advanced 3D sensing technologies, including terrestrial laser scanning (TLS), backpack laser scanning (BLS), gantry laser scanning (GLS), and digital areal photogrammetry (DAP). A total of 1920 plots covering 120 varieties were selected for comparison. Cross-comparisons of different data sources were performed to evaluate their performances in CH estimation concerning different CH, leaf area index (LAI), and growth stage (GS) groups. Results showed that 1) All 3D sensing data sources had high correlations with field measurement (r>0.82), while the correlations between different 3D sensing data sources were even better (r>0.87). 2) The prediction accuracy between different data sources decreased in subgroups of CH, LAI, and GS. 3) Canopy height showed high heritability from all datasets, and 3D sensing datasets had even higher heritability (H²=0.79-0.89) than FM (H²=0.77). Finally, outliers of different datasets are analyzed. The results provide novel insights into different methods for canopy height measurement that may ensure the high-quality application of this important trait.

canopy height

comparison

field measurement

terrestrial laser scanning

backpack laser scanning

gantry laser scanning

digital areal photogrammetry

The effects of canopy height, leaf area index, and growth stage on the accurate monitoring of canopy height with different 3D sensors were systematically evaluated.
Field-measured canopy height may not be as accurate as believed, especially in the plots with higher canopy height and at later growth stages.
3D sensing methods achieved higher heritable canopy height estimation than field measurement.

Canopy height (CH) is an important and heritable agronomic trait for breeding and field management (Li et al., 2022a). Breeders have paid much effort to selecting the ideal plant height to maximize light interception, increase yield (Madec et al., 2017), enhance logging resistance (Su et al., 2019; Zhou et al., 2020), and facilitate mechanical harvesting. Agronomists often use CH to indicate the growth of other complicated and difficultly accessible traits, such as phenology (Wang et al., 2020), leaf area index (LAI) (Gong et al., 2021), and biomass (Ravi et al., 2018). Therefore, high-throughput and accurate evaluation (e.g., ensuring high heritability) of CH are critical for accelerating crop breeding and production.

Traditional CH estimation methods mainly use rulers by selecting a few representative positions within a canopy. Manual measurement is time-consuming, labor-intensive, tedious, and error-prone due to subjective selection and visual observation. However, it is still the most widely adopted way due to its visibility and reliability during the past decades. Recently, many studies have demonstrated CH can be efficiently acquired from advanced three-dimensional (3D) sensing techniques (Jin et al., 2021b; Sun et al., 2017; Madec et al., 2017; Walter et al., 2019; Jin et al., 2021a). It brings us naturally to a fundamental and essential question: are 3D sensing techniques as accurate as field measurement?

Recent studies have explored the applicability of some mainstream 3D sensing techniques for CH measurements in agriculture, including LiDAR (light detection and management) and multi-view images (Jin et al., 2018). LiDAR is an active sensing technology that records 3D structure information of objects by measuring the distance with the laser (Eitel et al., 2016; Jin et al., 2021b). LiDAR has many advantages, including 1) strong penetration ability that can characterize the inner structure of the canopy, 2) real and direct 3D characterization of an object without a complicated reconstruction process, and 3) insensitive to illumination. According to different mounting platforms, LiDAR systems used for crop height measurement mainly include terrestrial laser scanning (TLS) (Tilly et al., 2014b; Guo et al., 2019), backpack laser scanning (BLS) (Zhu et al., 2021), gantry laser scanning (GLS) (Li et al., 2022c; Sun et al., 2022), and unmanned-aerial-vehicle laser scanning (ULS) (Zhou et al., 2020; Luo et al., 2021; Sofonia et al., 2019). In contrast to the active LiDAR sensing technologies, passive sensing-based methods (e.g., Multi-view images) can also measure 3D structure through methods like structure from motion (SFM) (Holman et al., 2016; Malambo et al., 2018). Among the passive sensing-based techniques, digital areal photogrammetry (DAP) is one of the most popular ways for field CH estimation due to its low cost, high efficiency, and high accuracy comparable to ULS (Hartley et al., 2020; Zhang et al., 2021b; Zhang et al., 2021a). These 3D sensing techniques have been successfully applied to CH measurement, including the adoption of TLS for accurate height measurement of maize (R² = 0.93) (Tilly et al., 2014b), cotton (R² = 0.97) (Sun et al., 2018), rice (R² = 0.91) (Tilly et al., 2014a), barley (R² = 0.95), pea (R² = 0.93), and bean (R² = 0.91) (El-Naggar et al., 2021); the use of BLS for efficient height measurement of large-scale wheat (Zhu et al., 2021) and forest (Niu et al., 2019; Hyyppä et al., 2020; Su et al., 2021); the exploration of ULS for estimating CH of sugar beet (R² = 0.70), wheat (R² = 0.78), and potato (R² = 0.50) (Jelle ten Harkel et al., 2019), and DAP for measuring corn CH (R² = 0.78) (Su et al., 2019). In all, current studies demonstrated that TLS and BLS usually performed better than ULS and DAP due to their close range of sensing, and the accuracy of DAP was comparable to ULS.

In addition to the exploration of high estimation accuracy, more and more studies are attempting to explore the genetic bases (e.g., heritability) of high-throughput phenotype (Song et al., 2021; Xiao et al., 2022; Tao et al., 2022). CH is a high heritability trait, as effective as yield (Singh et al., 2005). Higher heritability indicates that the environment has less influence on the trait, and further describes the value of breeding (Oumata et al., 2022; Chandana et al., 2022). Several studies have already verified the potential of CH from many 3D sensing platforms, including the use of LiDAR (Kronenberg et al., 2017; Walter et al., 2019) and UAV imagery (Volpato et al., 2021). Interestingly, recent studies declared 3D sensing-derived CH showed better heritability than field measurement. For example, Madec et al. (2017) proved high heritability values (H² > 0.90) of CH derived from both LiDAR and DAP; Volpato et al. (2021) compared the height heritability from UAV imagery (H² = 0.71–0.97) and field measurement (H² = 0.62–0.96) across four different growth stages (GS), which showed the UAV imagery had better heritability. These novel studies inspire us to rethink a questionable and challenging question: is field-measured CH as accurate and heritable as believed?

Some critical discussions about the accuracy of field-measured CH have been raised in recent years. On the one hand, field measurements are believed as accurate benchmarks. For example, Wang et al. (2018) found the heights measured by the LIDAR-Lite v2, the Kinect v2 camera, ultrasonic, and the imaging array sensors had high correlations (r ≥ 0.90) with manual measurements. They believed that the errors among sensors and field measurements come from the sensor's error. On the other hand, more and more studies emphasized that there may be systematic errors in the ground truth values. For example, Maesano et al. (2020) pointed out that LiDAR can detect more precise height differences than field measurement by comparing the accuracy of grass CH derived from ULS and field measurement. The inaccuracy of field-measured CH may be attributed to the variations of CH (Walter et al., 2019) and canopy structure (Zhou et al., 2020). Similarly, the heritability between 3D sensing and field measurement is also worth exploring.

This study aims to compare CH extraction accuracy and heritability from field measurements and four different proximal 3D sensing technologies, including TLS, BLS, GLS, and DAP in a wheat field of different varieties across different growth stages. Unlike previous studies, we make the following contributions: 1) systematically evaluating the accuracy of different data sources (TLS, BLS, GLS, DAP, and FM) in estimating CH, 2) exploring the variations of height measurement accuracy concerning different CH, LAI, and the GS groups, 3) deciphering the error sources of CH measurement among different data sources, and 4) exploring the heritability of 3D sensing data sources in estimating CH.

2.1 Study area and experimental design

The study area was located at the Baima Experimental Station (119°18′71″E, 31°62′00″N) of Nanjing Agricultural University, China. A total of 480 plots were cultivated with 120 wheat varieties, two treatments of nitrogen fertilization (0 and 240 kg/ha), and two replications. The plot size is 1m × 1m with a plot spacing of 0.5m, row spacing of 0.25 m, and sowing density of 300 seeds/m² (Fig. 1a). Different varieties, nitrogen treatments, and growth stages provided diverse canopy structure for further comparison of CH from different data sources.

2.2 Data collection

To make a systematic comparison of different height measurement methods, TLS, BLS, GLS, and DAP were selected to collect 3D data at four key growth stages that were jointing (134 days after seeding/DAS), heading (151 DAS), flowering (174 DAS), and maturity stages (188 DAS). Some important technical specifications used by the four 3D sensing systems are presented in Table 1. Meanwhile, field-measured CH and LAI were implemented with a ruler and the Sunscan Canopy Analyzer (Delta-T Devices Ltd, U.K.). Different data sources were collected within 1 to 2 days at each growth stage to ensure cross-comparability.

Table 1

Technical specifications of TLS, BLS, GLS, and DAP systems.
	TLS	BLS	GLS	DAP
System	FARO Focus^3D S70	LiBackpack D50	PlantEye F500	DJI Phantom4
Laser wavelength, nm	1550	905	940	RGB image
Field of view, º	H: 360º V: 300º	H: 360º V: -90º~ +90º	~ 53º	94º
Detection range, m	0.60–70 @10% ref.	100 @ 20% ref.	0.40–1.50	Flight altitude: 10-6000
Data resolution	0.30 mm @ 10m@ 90% ref.	30 mm	H: ~0.59 mm V: ~1.62 mm	4000$\times$3000 pixels
Weight, kg	4.20	8	8.30	0.138
Size, mm	240×200×100	960×300×318	440×210×99	Wheelbase: 350
Battery capacity, h	4.50	2	unlimited	0.50

2.2.1 TLS data

The TLS data was collected using the FARO Focus^3D S70 scanner (FARO Technology Inc, FL, USA). The sensor weight is 4.2 kg with a size of 240 mm × 200 mm × 100 mm. The field of view is 360°× 300°. The sensor emits lasers at a wavelength of 1550 nm and a pulse emitting rate of 244 kHz. The detection range is 0.6–70 m with upright incidence to a 10% reflective surface. The scanning accuracy is 0.3mm @10m @ 90% reflectance (Table 1).

The LiDAR sensor was mounted on a tripod (around 1.8 m above the ground) that was placed uniformly in the study area (Fig. 1b). The north-south and east-west distances between the two scanning locations were around 4 m and 7.5 m, respectively. The operating mode of the sensor was set as “Outdoor within 10 m Scanning Profile” without color information, which is suitable for acquiring detailed information with high efficiency (~ 5 min/scan) within a short distance (< 10 m) (Jin et al., 2020a). A total of 65 scans were implemented over the entire wheat field (Fig. 2b), taking around 6 hours.

2.2.2 BLS data

The BLS data was acquired using the LiBackpack D50 system (Green Valley International Ltd., Beijing, China) that was equipped with two Velodyne VLP-16E sensors (Velodyne Lidar Inc., San Joe, CA, USA). The system weight is about 8 kg with a size of 960 mm ⋅300 mm ⋅318 mm. The field of view is 360°× 180° (-90º~ +90º). The sensor emits lasers at a wavelength of 905 nm and a pulse emitting rate of 30 kHz. The detection range is 100 m with upright incidence to a 20% reflective surface. The scanning accuracy is ±3 cm (Table 1).

BLS was carried on the shoulder (Fig. 1c), enabling efficient and flexible mobile acquisition. Because BLS uses the SLAM (simultaneous location and mapping) algorithm for data acquisition, the moving trajectory was designed like a series of closed “triangles” (Fig. 2c). The collection time was around 20 min for the whole field.

2.2.3 GLS data

The GLS data was acquired by using the FieldScan Phenotyping Platform (Fig. 1c), which is equipped with four high-resolution 3D laser scanners, PlantEye F500 (Phenospex Inc, Heerlen, The Netherlands) (Fig. 1d). The sensor weight is around 8.3 kg with a size of 440 mm × 210 mm × 99 mm, and the field of view is around 53°. The sensor emits lasers with a wavelength of 940 nm and a pulse emitting rate of 50 XZ-profiles/s. The ranging distance is between 0.4–1.5 m. The sensors scanning accuracies will decrease with the increase of distance along the vertical height range. The average horizontal and vertical resolutions are around 0.59 mm and 1.62 mm, respectively (Table 1).

The sensor system was carried by a gantry at a height of 1.5 m. The FieldScan system traveled automatically in the field with a defined regular trajectory (Fig. 2e). The system repeatedly collected data day and night for the whole field. Each round of collection took around 4.5 hours, and then the system sleep 1.5 hours before the next round of collection. Notably, the system removed the lowest 0.28 m points to filter out ground points by default, which means the equipment can monitor a maximum height of 0.82 m.

2.2.4 DAP data

The DAP data was collected using the DJI Phantom4 drone (SZ DJI Technology Co., Shenzhen, China) by carrying an RGB camera (Fig. 1e). The camera has a resolution of 4000 pixels × 3000 pixels. The field of view is 94°. Flight missions were planned using the Pix4D Capture software (PIX4D S.A., Lausanne, Switzerland). To balance the problem of acquisition accuracy and efficiency (Kawamura et al., 2020), we carried out comparisons at different flight altitudes, including 10 m, 20 m, 30 m, and 40 m. The 20 meters was selected because its accuracy is comparable to 10 m and higher than 30 m and 40 m (Fig. S1). Oblique imageries were collected to ensure substantial overlap and reduced systematic errors (James and Robson, 2014). Meanwhile, the cross fight was set up, covering an east-west and a north-south flight trajectories, to improve 3D reconstruction accuracy from images (Fig. 2g). Specifically, the forward and side overlaps were both set as 80%. The camera angle during the flight was set to 80° by referring to Rosnell and Honkavaara (2012). Seven ground control points were set up for image quality control in the field. A total of 216 images were collected during a 20 min flight.

2.2.5 Field measurements

In this study, the field CH is defined as the vertical distance from the ground to the highest point of a canopy in the natural growth state. In each plot, CHs were measured with a ruler of mm precision at three locations that look uniform and representative. The three replicated measurements were averaged as the reference CH (Fig. 1f) (Walter et al., 2019). LAI was defined as half the total intercepting leaf area pre-unit ground area (Chen and Black, 1992). LAI was measured with a SunScan Canopy Analyzer (Delta-T Devices Ltd, Cambridge, U.K.) that has a 1-meter light-sensitive probe with 64 equally spaced photodiodes. The SunScan Canopy Analyzer estimate LAI by measuring the gap fraction (Potter et al., 1996). In each plot, the probe was inserted into the bottom of the canopy and parallel to the row direction (Sone et al., 2009; Ogunbadewa, 2012). Three replicated measurements were implemented and averaged as the reference LAI (Fig. 1g).

3.1 Data preprocessing

Different 3D sensing data need to be first processed into point clouds with different methods before sharing similar point processing methods. TLS data at different scanning locations were automatically registered to generate a point cloud using SCENE software (FARO Technology Inc, FL, USA). BLS was registered during data collection because the system used the SLAM algorithm (Su et al., 2021). GLS data registration was implemented according to the relative position of sensors and the point features using the commercial HortControl software (Phenospex Inc, Heerlen, The Netherlands). DAP images were used to reconstruct the 3D point cloud using the PiX4D mapper software (Pix4D, Lausanne, Switzerland). Once the 3D point clouds were generated, the following data processing processes were similar (Fig. 3).

The generated 3D point cloud data were further processed with a standard pipeline using the LiDAR360 software (Green Valley International Ltd., Beijing, China), including clipping, denoising, filtering, and normalization (Fig. 3). Clipping and denoising were manually implemented to ensure better accuracy, especially avoiding the loss of points in the sparse DAP and BLS point cloud. Filtering was first implemented using an integrated algorithm (i.e., improved progressive triangulated irregular network densification filtering algorithm), and the automatic results were carefully checked and revised to decrease process errors. Normalization was achieved by subtracting the height of each point from the height of its nearest ground point in the horizontal direction. Specifically, GLS data was filtered with a given height threshold of 0.28 m and normalized during data collection. The normalized 3D point clouds of TLS, BLS, GLS, and DAP were shown in Fig. 2b, d, f, g. Taking pre-processed data at the heading stage as an example, the point density of TLS data is the highest (929021.12 pts/m²), followed by GLS (697092.18 pts/m²), DAP (40051.30 pts/m²), and BLS (17761.30 pts/m²). Meanwhile, the final point resolution, denoted by the average adjacent point distance, from fine to coarse was GLS (1.07 mm), TLS (2.46 mm), DAP (12.73 mm), and BLS (15.02 mm) (Table 2).

Table 2

Key information about the data quality of the preprocessed point clouds (taking data at the heading stage as an example) and the roughly estimated platform cost and data cost.
Data sources	Point density (pts/m²)	Point resolution (mm)	Data volume, GB	Platform cost,$	Data Cost, h
Data sources	Point density (pts/m²)	Point resolution (mm)	Data volume, GB	Platform cost,$	Collection	Preprocessing	Total
TLS	929021.12	2.46	11.40	46010.00	5.00	79.20	84.20
BLS	17761.30	15.02	0.22	70515.00	0.30	2.30	2.60
GLS	697092.18	1.07	8.68	1567000.00	8.00	2.00	6.50
DAP	40051.30	12.73	0.48	1253.60	0.50	15.70	16.20

Plot extraction is the prerequisite for CH extraction of each plot. Because different sources of point clouds have their sensor coordinate systems, this study manually aligned these data into the same coordinate origin and north-south directions in LiDAR360 software. After that, 480 plots of different source data at each growth stage can be extracted using a shared plot bounding box map defined manually (Fig. 3).

3.2 Canopy height extraction

CH can be extracted from the normalized point cloud using different statistical metrics. In this study, Hmax, the maximum z value of all normalized points, was extracted. Meanwhile, difference height quantiles from 99% quantile height (i.e., H99) to 80% quantile height (i.e., H80) with an interval of 1% were also extracted (Jin et al., 2019). These different height representations are compared and the optimal one was selected for comparing different sensing technologies.

3.3 Cross-comparisons of canopy height estimates from field measurement and 3D sensing

The accuracies of the CH measured by different 3D sensing data were compared with the field measurement, and the cross-comparisons of different 3D sensing performances were also evaluated. Specifically, the comparisons between sensor data with field measurement include TLS vs.FM, BLS vs.FM, GLS vs.FM, DAP vs.FM, and the cross-comparisons include TLS vs. BLS, BLS vs. DAP, DAP vs. TLS, TLS vs. GLS, BLS vs. GLS, and DAP vs. GLS.

This study further evaluated the accuracy of different methods with respect to different field-measured CH groups, LAI groups, and GS groups, which are important indicators of canopy structure (Luo et al., 2015; Ma and Liang, 2022) and affect the accuracy of CH monitoring. Four CH groups were considered, including 0.3–0.6 m (CH1), 0.6–0.8 m (CH2), 0.8-1 m (CH3), and 1-1.4 m (CH4). Each height group contains 360, 918, 501, and 141 plots, respectively. Four LAI groups were separated at 0–2 m²/m² (LAI1), 2–4 m²/m² (LAI2), 4–6 m²/m² (LAI3), and 6–8 m²/m² (LAI4). Each group contains 874, 641, 340, and 65 plots, respectively. Four compared growth stages were jointing stages, heading stages, flowering stages, and maturity stages.

Specifically, considering the scanning range and height threshold setting in filtering, the effective maximum height of the GLS system is 0.82 m. Therefore, only the plots that have a maximum measured height lower than 0.82 m were selected for comparison with GLS (1365 plots) in this study. Because there are a few plots belonging to the CH3 group and no plots belonging to the CH4 group, we only evaluated the GLS accuracies of CH1 and CH2 (360 and 918 plots, respectively).

The accuracy between the two compared groups was evaluated by Pearson’s correlation coefficient (r), root mean square error (RMSE), relative RMSE (RMSE%), Bias, and relative Bias (Bias%).

$r=\sqrt{1-\frac{\sum {\left({y}_{i}-{\widehat{y}}_{i}\right)}^{2}}{\sum {\left({y}_{i}-\overline{{y}_{i}}\right)}^{2}}}$

(1)

$RMSE=\sqrt{\frac{1}{n}\sum _{i=1}^{n}{({y}_{i}-{\widehat{y}}_{i})}^{2}}$

(2)

$RMSE\%=\left(\frac{RMSE}{\overline{{y}_{i}}}\right)\times 100$

(3)

$Bias={\sum }_{i=1}^{n}\left({y}_{i}-{\widehat{y}}_{i}\right)/n$

(4)

$Bias\%=\left(\frac{Bias}{\overline{{y}_{i}}}\right)\times 100$

(5)

where i represents a sample index, n represents the number of samples, y_i represents reference measurements (e.g., FM), $\widehat{{y}_{i}}$ represents predicted CH from different 3D sensing datasets, and $\stackrel{-}{{y}_{i}}$ is the mean of y_i .

Moreover, the CHs of different data sources were compared in terms of broad-sense heritability (H²). Broad-sense heritability was defined as the proportion of heritability variance (Visscher et al., 2008). In this study, the interaction effect of different varieties and N treatments was considered, i.e., G by E.

$${H}^{2}=\frac{{\sigma }_{G}^{2}}{{\sigma }_{G}^{2}+\frac{{\sigma }_{GE}^{2}}{e}+\frac{{\sigma }_{\epsilon }^{2}}{re}}$$

where ${H}^{2}$ is broad-sense heritability, ${\sigma }_{G}^{2}$ is genetic variance,${{\sigma }}_{\text{G}\text{E}}^{2}$ is the gene-environment interaction variance, ${{\sigma }}_{{\epsilon }}^{2}$ is the error variance, e is the number of N treatments, and r is the number of replicates per genotype.

3.4 Error source analysis

As we know, CHs measured by different methods will not be exactly the same. This study analyzed which data source the error comes from by referring to the method of Wang et al. (2019). First, we calculate the relative residual between the 3D sensing estimated CHs and FM (Eq. 7). Then, screening out the plots where the above calculated relative residuals greater than 20% as the suspicious cases (S) (Eq. 8). The intersections of S_TLS, S_BLS, S_GLS, and S_DAP were defined as the errors due to FM (Error_FM) (Eq. 9). Based on Error_FM, the intersection of S_TLS, S_BLS, S_GLS, S_DAP, and non-Error_FM was defined as the errors due to TLS (Error_TLS), BLS (Error_BLS), GLS (Error_GLS), and DAP (Error_DAP), respectively (Eq. 10–13). Notably, when regarding TLS or any other 3D sensing datasets as the errors, it is not mean the other three 3D sensing datasets do not contain outliers because the conditions for Error_FM are very strict.

The relative residual ${\varDelta }_{\left(a,field\right)}^{i}$, S_a, and Error_TLS, Error_BLS, Error_GLS, and Error_DAP were defined as below:

${\varDelta }_{\left(a,field\right)}^{i}=\left|{H}_{a}^{i}-{H}_{filed}^{i}\right|/{H}_{field}^{i}$

(7)

${S}_{a}=\left\{{P}^{i}|{\varDelta }_{\left(a,field\right)}^{i}\ge 0.2\right\}$

(8)

$Error\_FM=\left\{{P}^{i}|{S}_{TLS}\cap {S}_{BLS}\cap {S}_{GLS}\cap {S}_{DAP}\right\}$

(9)

$Error\_TLS=\left\{{P}^{i}|{S}_{TLS}\cap (!Error\_field)\right\}$

(10)

$Error\_BLS=\left\{{P}^{i}|{S}_{BLS}\cap (!Error\_field)\right\}$

(11)

$Error\_GLS=\left\{{P}^{i}|{S}_{GLS}\cap (!Error\_field)\right\}$

(12)

$Error\_DAP=\left\{{P}^{i}|{S}_{DAP}\cap (!Error\_field)\right\}$

(13)

where i represents a sample index, ${\varDelta }_{\left(a,field\right)}^{i}$ is the relative residual, ${H}_{a}^{i}$ and S_a represents predicted CH and the suspicious cases from 3D sensing datasets, where a can be TLS, BLS, GLS, and DAP, Error_FM, Error_TLS, Error_BLS, Error_GLS, and Error_GLS represent the errors from FM, TLS, BLS, GLS, and DAP, respectively.

4.1 Canopy height from different 3D sensing datasets

To fairly compare different 3D sensing datasets for CH estimation, it is important to first explore which height representation metric is optimal according to their correlations with FM. In this study, the influences of different height quantiles for CH extraction were evaluated using point clouds of all stages. The results showed that the evaluation accuracy of TLS, BLS, and GLS were all high and stable when using different height quantiles (Fig. 4). By contrast, height estimation accuracy from DAP data was lower and more sensitive to the selection of height quantiles. According to the highest correlation (Fig. 4) and the lowest error metrics (Fig. S2), H99 was selected as the best representation of CH for TLS, GLS, and DAP, while H96 was the best for BLS. These best height quantiles (H99 or H96) for each data source was used for all subsequent analysis.

The best correlations of TLS vs. FM, BLS vs. FM, GLS vs. FM, and DAP vs. FM were 0.89, 0.89, 0.82, and 0.83, respectively (Fig. 5). The fitted lines of TLS, BLS, and DAP were close to the reference lines (1:1) except a little overestimation when CH was small (Fig. 5). In contrast, GLS showed an overall underestimation (Fig. 5c).

Cross-comparisons among different sensor datasets showed higher correlations (r) ranging from 0.87 to 0.97, which was much higher than the above comparisons with FM (0.82–0.89). The highest correlation value is 0.97 between TLS and BLS (Fig. 6a), followed by TLS vs. GLS (r = 0.94) (Fig. 6d), BLS vs. GLS (r = 0.93) (Fig. 6e), DAP vs. TLS (r = 0.90) (Fig. 6c), BLS vs. DAP (r = 0.90) (Fig. 6b), and DAP vs. GLS (r = 0.87) (Fig. 6f). Among them, DAP had a relative lager RMSE with other sensing datasets (RMSE > 0.05 m, Fig. 6b, c, f), especially the comparison with BLS (RMSE = 0.08m, Fig. 6b). Moreover, the fitting Bias are all very small (0.01m) except for comparisons with GLS (Fig. 6d, e, f). Although GLS showed an overall underestimation, it still keeps a low RMSE (0.04m − 0.05m) with other 3D sensing datasets.

4.2 Comparing canopy height measurement of different methods among different canopy height groups

The correlation coefficients of CHs derived from 3D sensing and FM decreased obviously when evaluated with respect to different subgroups of CH (r < 0.71). Similarly, the correlation coefficients of cross-comparisons of different 3D sensing also decreased, although the largest r was up to 0.93 (Table 3).

Table 3

Detailed statistics on comparing canopy height measurement methods. The top side of the table showed the evaluation results of 3D sensing datasets with field-measured (FM); the bottom side of the table showed the results of 3D sensing datasets cross-comparisons. RMSE and RMSE%, Bias and Bias%, and correlation coefficient (r) were given for distinct canopy height (CH) groups. The underlined values were the best result for each CH group among different comparisons.
CH group	RMSE, m (RMSE%)	Bias, m (Bias%)	r	RMSE, m (RMSE%)	Bias, m (Bias%)	r
	TLS vs. FM			BLS vs. FM
CH1	0.06 (10.92)	0.05 (10.24)	0.66	0.07 (13.65)	0.06 (12.60)	0.65
CH2	0.07 (10.48)	0.05 (6.79)	0.56	0.08 (10.96)	0.06 (7.85)	0.54
CH3	0.08 (9.57)	0.01 (0.95)	0.49	0.08 (8.75)	0.01 (1.48)	0.52
CH4	0.09 (7.98)	0.01 (0.86)	0.59	0.08 (7.58)	0.01 (0.99)	0.64
Mean	0.08 (9.74)	0.03 (4.71)	0.58	0.08 (10.23)	0.04 (5.73)	0.59
	GLS vs. FM			DAP vs. FM
CH1	0.06 (11.39)	-0.09(-17.96)	0.64	0.07 (12.91)	0.04 (8.15)	0.71
CH2	0.07 (9.82)	-0.11(-15.49)	0.56	0.09 (13.10)	0.05 (6.76)	0.51
CH3	-	-	-	0.11 (12.79)	0.00 (-0.17)	0.36
CH4	-	-	-	0.14 (12.25)	0.02 (-1.49)	0.50
Mean	0.06 (10.6)	-0.10 (-16.72)	0.60	0.10 (12.76)	0.02 (3.31)	0.52
	TLS vs. BLS			TLS vs. GLS
CH1	0.05 (8.89)	0.01 (2.15)	0.84	0.03 (5.32)	0.14 (-25.57)	0.92
CH2	0.04 (4.98)	0.01 (0.99)	0.91	0.04 (5.21)	0.16 (-20.86)	0.88
CH3	0.04 (4.30)	0.00 (0.52)	0.91	-	-	-
CH4	0.04 (3.71)	0.00 (0.13)	0.93	-	-	-
Mean	0.04 (5.47)	0.01 (0.95)	0.89	0.03 (5.27)	-0.15(-23.22)	0.90
	BLS vs. DAP			BLS vs. GLS
CH1	0.06 (10.80)	-0.02 (-3.95)	0.75	0.04 (7.47)	0.16 (-27.14)	0.82
CH2	0.07 (8.98)	-0.01 (-1.00)	0.77	0.04 (5.31)	0.17 (-21.64)	0.88
CH3	0.09 (9.80)	-0.01 (-1.62)	0.69	-	-	-
CH4	0.11 (9.46)	-0.03 (-2.45)	0.74	-	-	-
Mean	0.08 (9.76)	-0.02 (-2.26)	0.74	0.04 (6.39)	-0.16(-24.39)	0.85
	DAP vs. TLS			DAP vs. GLS
CH1	0.04 (7.58)	0.01 (1.93)	0.83	0.06 (13.67)	0.13 (31.82)	0.79
CH2	0.06 (7.83)	0.00 (0.03)	0.75	0.07 (12.38)	0.16 (26.33)	0.73
CH3	0.07 (8.36)	0.01 (0.01)	0.65	-	-	-
CH4	0.07 (6.66)	0.03 (0.03)	0.75	-	-	-
Mean	0.06 (7.61)	0.01 (1.37)	0.74	0.05 (7.95)	-0.15(-22.49)	0.76

Note: - represents the comparison was not available due to the limited ranging ability of the GLS system.

As for comparing 3D sensing with FM, GLS was the best according to the highest mean r (0.60), followed by BLS (mean r = 0.59), TLS (mean r = 0.58), and DAP (mean r = 0.52) (Table 3). From the prospect of subgroup comparisons, the best methods for estimating CH1, CH2, CH3, and CH4 were DAP (mean r = 0.71), TLS (mean r = 0.56), BLS (mean r = 0.52), and BLS (mean r = 0.64), respectively (Table 3). The fitting lines of TLS, BLS, and DAP were very close to the reference lines in CH3 and CH4 groups, while slight overestimation appeared in CH1 and CH2 groups (Fig. 7). Consistently, GLS showed underestimation in both CH1 and CH2 groups (Fig. 7c).

The cross-comparisons of different 3D methods showed much higher correlation values. Among them, TLS vs. GLS showed the highest correlation (mean r = 0.90), followed by TLS vs. BLS (mean r = 0.89), BLS vs. GLS (mean r = 0.85), DAP vs. GLS (mean r = 0.76), DAP vs. TLS (mean r = 0.74), and BLS vs. DAP (mean r = 0.74). From the perspective of subgroup comparisons, the most consistent method for estimating CH1 was TLS vs. GLS, and the most consistent methods for estimating CH2, CH3, and CH4 were always TLS vs. BLS (Table 3).

The fitted lines for TLS vs. BLS and BLS vs. DAP were both close to 1:1 for different CH groups (Fig. S3 a, b). DAP vs. TLS showed overestimation at low heights and underestimation at high heights for each height group (Fig. S3 c). Underestimations also almost existed in comparisons between GLS and other 3D sensing datasets at every CH group (Fig. S3 d-f). The fitted line of GLS vs. TLS was nearly parallel to the reference line, while underestimations to other 3D data become more obvious with height growth.

4.3 Comparing canopy height measurement of different methods among different LAI groups

The correlation coefficients of CHs derived from 3D sensing and FM only decreased slightly (mean r = 0.79 to 0.87) with respect to different LAI groups. Likewise, the correlation coefficients of cross-comparisons of different 3D sensing also decreased slightly (mean r = 0.84 to 0.96), with little change for TLS vs. BLS (Table 4).

Table 4

Detailed statistics on comparing canopy height measurement methods. The top side of the table showed the evaluation results of 3D sensing datasets with field-measured (FM); the bottom side of the table showed the results of 3D sensing datasets cross-comparisons. RMSE and RMSE%, Bias and Bias%, and correlation coefficient © were given for distinct leaf area index (LAI) groups. The underlined values were the best result for each LAI group among different comparisons.
LAI group	RMSE, m (RMSE%)	Bias, m (Bias%)	r	RMSE, m (RMSE%)	Bias, m (Bias%)	r
	TLS vs. FM			BLS vs. FM
LAI1	0.07 (9.11)	0.03 4.71)	0.93	0.06 (8.78)	0.04 (5.13)	0.93
LAI2	0.09 11.43)	0.04 5.32)	0.86	0.09 11.74)	0.05 (7.11)	0.85
LAI3	0.07 (9.79)	0.03 3.88)	0.83	0.08 10.64)	0.03 (4.68)	0.79
LAI4	0.07 (8.98)	0.04 5.46)	0.86	0.07 (9.10)	0.05 (6.16)	0.84
Mean	0.07 (9.83)	0.04 4.84)	0.87	0.07 10.06)	0.04 (5.77)	0.85
	GLS vs. FM			DAP vs. FM
LAI1	0.05 (7.66)	-0.09 (-14.98)	0.91	0.08 (11.51)	0.02 (2.43)	0.89
LAI2	0.08 12.05)	-0.10 (-14.99)	0.73	0.13 (16.53)	0.04 (5.61)	0.77
LAI3	0.06 (8.86)	-0.14 (-19.58)	0.78	0.07 (9.89)	0.03 (4.09)	0.77
LAI4	0.06 (7.85)	-0.13 (-18.74)	0.73	0.06 (8.55)	0.04 (5.78)	0.82
Mean	0.06 (9.1)	-0.12 (-17.07)	0.79	0.09 (11.62)	0.03 (4.48)	0.81
	TLS vs. BLS			TLS vs. GLS
LAI1	0.04 (5.05)	0.00 (0.40)	0.97	0.03 (4.46)	-0.15 (-21.93)	0.97
LAI2	0.04 (5.25)	0.01 (1.70)	0.97	0.04 (5.4)	-0.15 (-20.89)	0.94
LAI3	0.04 (5.68)	0.01 (0.77)	0.94	0.04 (6.02)	-0.16 (-22.76)	0.90
LAI4	0.04 (4.74)	0.01 (0.67)	0.95	0.05 ( 6.33)	-0.17 (-22.32)	0.82
Mean	0.04 (5.18)	0.01 (0.88)	0.96	0.04 (5.55)	-0.16 (-21.98)	0.91
	BLS vs. DAP			BLS vs. GLS
LAI1	0.07 (8.64)	0.02 (-2.56)	0.93	0.04 (5.48)	-0.15 (-22.54)	0.95
LAI2	0.09 11.08)	0.01 (-1.40)	0.89	0.04 (5.51)	-0.17 (-22.21)	0.94
LAI3	0.06 (8.52)	0.00 (-0.56)	0.82	0.05 (7.00)	-0.17 (-23.52)	0.86
CH4	0.07 (8.33)	0.00 (-0.36)	0.80	0.06 (7.38)	-0.18 (-23.24)	0.74
Mean	0.07 (9.14)	-0.01 (-1.22)	0.86	0.05 (6.34)	-0.17 (-22.88)	0.87
	DAP vs. TLS			DAP vs. GLS
LAI1	0.06 (8.21)	0.02 (-2.18)	0.94	0.04 (6.48)	-0.13 (-20.08)	0.93
LAI2	0.10 12.06)	0.00 (0.28)	0.87	0.07 (9.79)	-0.15 (-21.1)	0.80
LAI3	0.06 (8.14)	0.00 (0.20)	0.84	0.06 (7.96)	-0.17 (-23.75)	0.81
LAI4	0.06 (8.12)	0.00 (0.31)	0.82	0.05 (6.33)	-0.18 ( -23.65)	0.81
Mean	0.07 (9.32)	0.00 (0.36)	0.87	0.06 (7.64)	-0.16 (-22.14)	0.84

As for comparing 3D sensing with FM, TLS was the best according to the highest mean r (0.87), followed by BLS (mean r = 0.85), DAP (mean r = 0.81), and GLS (mean r = 0.79). From the presence of subgroup comparisons, the best method for estimating the height of the LAI1 group were BLS (mean r = 0.93) and TLS (mean r = 0.93), while the best method for LAI2, LAI3, and LAI4 was always TLS (mean r༞0.83) (Table 4). The fitting lines of TLS, BLS, and DAP were very close to the reference lines in all LAI groups, while GLS showed underestimation in all LAI groups (Fig. 8).

As for cross-comparison of different 3D methods, TLS vs. BLS showed the highest correlation (mean r = 0.96), followed by TLS vs. GLS (mean r = 0.91), BLS vs. GLS (mean r = 0.87), DAP vs. TLS (mean r = 0.87), BLS vs. DAP (mean r = 0.86), and DAP vs. GLS (mean r = 0.84). From the perspective of subgroup comparisons, the most consistent methods for estimating LAI1 were TLS vs. BLS and TLS vs. GLS (mean r = 0.97). Besides, the most consistent methods for estimating LAI3 and LAI4 were still TLS vs. BLS (mean r = 0.94 and 0.95) (Table 4).

The fitted line for TLS vs. BLS almost coincided with the reference line (Fig. S4 a). The fitted lines for BLS vs. DAP and DAP vs. TLS were also relatively close to the reference line, but they became worse when LAI increased (Fig. S4 b, c). Underestimations also existed in comparisons between GLS and other 3D sensing datasets at all LAI groups, and the correlations decreased when LAI increased (Fig. S4 d-f).

4.4 Comparing canopy height measurement of different methods among different GS groups

The correlation coefficients of CHs derived from 3D sensing and FM were less accurate (mean r = 0.65 to 0.83) with regard to different GS groups, especially for GLS vs. FM. By contrast, the correlation coefficients of cross-comparisons of different 3D sensing data decreased slightly (mean r = 0.80 to 0.94) (Table 5).

Table 5

Detailed statistics on comparing height measurement methods. The top side of the table showed the evaluation results of 3D sensing datasets with field-measured (FM); the bottom side of the table showed the results of 3D sensing datasets cross-comparisons. RMSE and RMSE%, Bias and Bias%, and correlation coefficient (r) were given for distinct growth stage (GS) groups. The underlined values were the best result for each GS group among different comparisons.
GS group	RMSE, m (RMSE%)	Bias, m (Bias%)	r	RMSE, m (RMSE%)	Bias, m (Bias%)	r
	TLS vs. FM			BLS vs. FM
J	0.04 (6.69)	0.02(3.37)	0.88	0.05 (9.58)	0.03 (5.32)	0.81
H	0.05 (6.64)	0.04(5.42)	0.92	0.05 (6.49)	0.04 (4.89)	0.91
F	0.10 (11.82)	0.03(4.10)	0.76	0.10 (12.03)	0.05 (6.41)	0.76
M	0.08 (9.85)	0.05(5.99)	0.76	0.07 (9.16)	0.05 (6.26)	0.75
Mean	0.07 (8.75)	0.04(4.72)	0.83	0.07 (9.32)	0.04 (5.72)	0.81
	GLS vs. FM			DAP vs. FM
J	0.05 (8.96)	-0.13(-22.39)	0.79	0.05 (8.94)	0.03 (5.25)	0.89
H	0.05 (7.00)	-0.10(-13.84)	0.72	0.05 (6.56)	0.05 (5.75)	0.92
F	0.08(10.96)	-0.08(-10.99)	0.45	0.14 (16.58)	0.03 (4.13)	0.64
M	0.06 (7.94)	-0.11(-14.74)	0.62	0.11 (14.18)	0.01 (0.85)	0.60
Mean	0.06 (8.71)	-0.10(-15.49)	0.65	0.09 (11.57)	0.03 (3.99)	0.76
	TLS vs. BLS			TLS vs. GLS
J	0.04 (7.61)	0.01 (1.89)	0.87	0.03 (5.15)	-0.15(-24.93)	0.93
H	0.03 (3.33)	0.00 (-0.50)	0.98	0.03 (3.88)	-0.15(-20.19)	0.91
F	0.04 (4.44)	0.02 (2.22)	0.97	0.03 (4.28)	-0.17(-20.79)	0.92
M	0.04 (5.26)	0.00 (0.25)	0.92	0.05 (6.07)	-0.16(-20.44)	0.76
Mean	0.04 (5.16)	0.01 (0.97)	0.94	0.04 (4.84)	-0.16(-21.59)	0.88
	BLS vs. DAP			BLS vs. GLS
J	0.06 (9.68)	0.00 (-0.07)	0.85	0.04(6.58)	-0.16(-26.33)	0.88
H	0.05 (5.81)	0.01 (0.82)	0.93	0.04 (4.78)	-0.15(-20.42)	0.86
F	0.08 (9.24)	-0.02 (-2.15)	0.89	0.03 (3.49)	-0.18(-22.29)	0.95
M	0.10(11.93)	-0.04 (-5.09)	0.70	0.05 ( 6.65)	-0.16(-20.92)	0.70
Mean	0.07 (9.17)	-0.01 (-1.62)	0.84	0.04 (5.51)	-0.16(-22.52)	0.86
	DAP vs. TLS			DAP vs. GLS
J	0.03 (5.1)	-0.01 (-1.78)	0.92	0.04 (7.05)	-0.16(-26.27)	0.86
H	0.04 (5.27)	0.00 (-0.31)	0.95	0.03 (4.54)	-0.15(-20.41)	0.87
F	0.07 (8.52)	0.00 (-0.02)	0.87	0.05 (5.87)	-0.17(-21.14)	0.84
M	0.09(10.98)	0.04 (5.1)	0.68	0.06 (7.76)	-0.13(-17.09)	0.61
Mean	0.06 (7.47)	0.01 (0.74)	0.86	0.05 (6.3)	-0.15(-21.23)	0.80

Note: J, H, F, and M represent jointing, heading, flowering, and maturity stages.

As for comparing 3D sensing with FM, TLS was the best according to the highest mean r (0.83), followed by BLS (mean r = 0.81), DAP (mean r = 0.76), and GLS (mean r = 0.65) (Table 5). From the perspective of subgroup comparisons, DAP was the best method for estimating CH at the jointing stage (mean r = 0.89). Moreover, TLS was also the best method for the heading, flowering, and maturity stages (Table 5). The fitting lines of TLS, BLS, and DAP were very close to the reference lines, especially at the heading stage (r = 0.72–0.92). However, GLS showed underestimation at all growth stages, which was more obvious at late stages (Fig. 9c).

As for cross-comparison of different 3D methods, TLS vs. BLS showed the highest correlation (mean r = 0.94), followed by TLS vs. GLS (mean r = 0.88), BLS vs. GLS (mean r = 0.86), DAP vs. TLS (mean r = 0.86), BLS vs. DAP (mean r = 0.84), and DAP vs. GLS (mean r = 0.80). From the perspective of subgroup comparisons, the most consistent method for estimating the jointing stage was TLS vs. GLS (mean r = 0.93), while the best methods for heading, flowering, and maturity stages were TLS vs. BLS (mean r = 0.92–0.98) (Table 5).

The fitted lines of BLS vs. TLS and DAP vs. BLS were closer to the reference line than TLS vs. DAP (Fig. S5 a, c, e). Underestimations also existed in comparisons between GLS and other 3D sensing datasets at every GS group, especially at the maturity stage (Fig. S5 d-f).

4.5 Comparing the broad sense heritability of canopy height measurement from different methods

This study found the H² of CH derived from 3D sensing datasets was overall higher than FM no matter analyzed with CH, LAI, or GS groups (Table 6). At different GS, TLS showed the highest H² (mean H² = 0.89), followed by BLS (mean H² = 0.85), GLS (mean H² = 0.85), DAP (mean H² = 0.79), and FM (mean H² = 0.77). Notably, H² of LiDAR-derived CH was larger than that derived from DAP. The overall heritability in the later growth period decreased, especially in the maturity stage.

Table 6

The values of Broad-sense heritability (H²) from different 3D sensing datasets with regard to different canopy height (CH), leaf area index (LAI), and growth stage (GS) groups.
CH group	FM	TLS	BLS	GLS	DAP
CH1	0.58	0.81	0.77	0.73	0.74
CH2	0.61	0.65	0.64	0.59	0.57
CH3	-	-	-	-	-
CH4	-	-	-	-	-
Mean	0.60	0.73	0.70	0.66	0.66
LAI group
LAI1	0.83	0.90	0.85	0.86	0.84
LAI2	-	-	-	-	-
LAI3	-	-	-	-	-
LAI3	-	-	-	-	-
Mean	0.83	0.90	0.85	0.86	0.84
GS group
Jointing	0.79	0.90	0.85	0.86	0.83
Heading	0.83	0.90	0.85	0.86	0.84
Flowering	0.77	0.94	0.86	0.79	0.85
Maturity	0.70	0.82	0.85	0.73	0.62
Mean	0.77	0.89	0.85	0.81	0.79

Note: The calculation of H ² of a variety was based on the variety’s four plot CHs, including two N treatments and two replicates. Because only CHs under 0.82m of FM were used for comparison, the H² value of varieties can be calculated only when all four plots’ CH of a variety were below 0.82m. – represents no varieties within the group that meet the above conditions.

5.1 Height quantities of 3D point cloud affect the best estimates of canopy height

Height quantities have been widely used for depicting CH due to their insensitivity to noisy points (Hu et al., 2018). However, it has been found that different height quantiles may be suitable for different 3D data with regard to different crop types (Liu et al., 2021) and sensor types (Madec et al., 2017).

In this study, we explored the effects of height quantities on the accuracy of height estimation from four kinds of 3D sensing techniques by collecting 1920 wheat plots of various varieties and nitrogen treatments at four growth stages. Our results found that H99 was the best CH quantile of TLS, GLS, and DAP, while H96 was the best for BLS data (Fig. 4). These results are reasonable considering previous studies found the best height quantiles mainly located between H90 and H99, especially near H99, such as the best height quantile for maize was H99 (R² = 0.9) (Niu et al., 2019) and H99.9 (Lu et al., 2021), for wheat was H99.5 (R² = 0.90) (Madec et al., 2017), and for soybean was H99.9 (R² > 0.85) (Liu et al., 2021).

Although the best height quantiles are similar, the influences of height quantile selection on height estimation are different. DAP was easy to lose small targets such as the leaf tips of the canopy (Niu et al., 2019). Meanwhile, DAP was difficult to capture the internal structure of the canopy (Grenzdörffer, 2014), which leads to sparse point density (Fig. 10) and may illustrate why DAP-predicted CH accuracy was more sensitive to height quantiles (Fig. 4) and had a relative lager RMSE with other sensing datasets (RMSE > 0.05 m, Fig. 6b, c, f). By contrast, TLS, BLS, and GLS can generate high-density point clouds, enabling the characterization of inner canopy structure (Fig. 10). This may illustrate why GLS are less sensitive to the selection of height quantities, so are TLS and BLS (Fig. 4). Additionally, the GLS system used in this study may lose points near the ground due to the filtering method (Fig. 10b), which illustrated the overall underestimation and relative high bias of GLS-predicted data (Fig. 5c, Fig. 6d-f, Fig. 7c, Fig. 8c). However, it had a slight influence on the overall trend of CH assessment and RMSE (r = 0.82, Fig. 5c). Notably, despite the high point resolution of GLS, its ranging extent is much closer, making it easier to be saturated when predicting higher canopies, which can be seen if all the plots are used for height estimation in this study (Fig. S6). This suggests that the choice of laser ranging extent is as important as the sensor resolution for high-precision crop phenotyping.

In conclusion, selecting the optimal height quantiles is critical in the evaluation of CH. Despite subtle differences, these best height metrics were very close in performance. Considering the more diverse datasets used in this study than in previous studies (Madec et al., 2017; Lu et al., 2021; Liu et al., 2021), the systematic evaluation of 3D sensing methods were unprecedented, which lays reliable foundations for the further cross-comparisons.

5.2 CH estimation under various height groups, LAI groups, and GS groups

The CH estimate accuracies will obviously decrease when evaluated at CH subgroups (Fig. 7). This has been rarely reported in agriculture, but some similar findings have been drawn in forest CH estimation (Kawamura et al., 2020; Wang et al., 2019). The subgroup of lower CH plots (e.g., CH1) showed higher correlations (Table. 3), which are consistent with previous studies that indicated the uncertainty of CH assessment by 3D sensing increased with height (Sun et al., 2017). This may attribute to the increasing canopy complexity (e.g., crop canopy cover and plant density) with height (Busemeyer et al., 2013; Han et al., 2019). Meanwhile, canopy senescence and logging may also influence height estimation accuracy at high-height groups.

This study found the TLS, BLS, and DAP showed overestimation in low CH groups (i.e., CH1 and CH2 groups) but are closer to field measurement in CH3 and CH4 groups (Fig. 7; Fig. 11). The possible reason is the canopy surface is not closed and looks uneven at the early stage. In this case, field measurement was hard to capture the highest CHs (observation) while the sensor measured height is the globally ranked height quantities (real max. height) of a plot. Although GLS had systematic underestimation due to its limited ranging extent, it had a better fitting effect with TLS and BLS (Fig. S3), demonstrating the high reliability of ranging precision of 3D sensing technologies under different canopy structures. It is also the high precision of the GLS system (Table 1) that may illustrate why GLS keeps a low RMSE (0.04m − 0.05m) with other 3D sensing datasets (Fig. 6d-f).

In addition, DAP-estimated height showed lower correlations with other 3D sensing datasets (Fig. S3). This may be caused by the relatively lower data quality of the DAP point cloud. DAP point cloud was reconstructed from images, which are sensitive to environmental illumination, image quality, and reconstruction algorithms (Han et al., 2018; Goodbody et al., 2019; Ali-Sisto et al., 2020). Although some studies have demonstrated that the DAP has comparable accuracy with LiDAR in monitoring canopy height (Gao et al., 2022). Our study further found that DAP showed similar better results with LiDAR in field plots with lower CH (e.g., CH1), but the accuracy would decrease at higher CH groups (Fig. 7). The decreasing accuracy may be caused by the large variations of estimated height at large CH groups where canopy structures are denser and complicated (Fig. 11).

By contrast, the CH estimate accuracies did not show an obvious decrease when evaluated at LAI or GS groups (Fig. 8 and Fig. 9). The possible reasons are the height range of data within each LAI or GS subgroup was relatively large. However, the accuracy at high LAI or late GS was also relatively lower, which may attribute to the more complex canopy structure (Holman et al., 2016; Malambo et al., 2018).

5.3 Outlier analysis of different datasets

Error source analysis revealed that 8 plots existed FM error according to our definitions in section 3.4 (Fig. 12a). In these plots, heights estimated from all 3D sensing methods were 20% greater than FM, and the heights between different 3D sensing methods were closer. This indicated that FM may be inaccurate. By contrast, there are more potential suspicious CH results estimated from GLS (451), DAP (253), BLS (224), and TLS (164) (Fig. 12). Because the conditions for determining the error sources of FM are too strict that require the FM value to be suspicious compared with all sensors’ estimations (Eq. 7), which illustrates why the number of FM anomalies is small and other sensors’ anomalies are much more.

In fact, there should be more errors coming from FM. For example, Fig. 12b shows the error source case of TLS, but it can be easily found that most TLS measurements were very consistent with BLS and DAP. This may imply FM and GLS are both suspicious, instead of TLS. Similar more suspicious cases of FM can be found in Fig. 12c-e. Although the overall underestimation of GLS data brought challenges for the above outlier analysis, the general trends still exist. As for the outlier estimations of GLS, most relative residuals were below − 20% (Fig. 12d), which was mainly caused by the lack of ground point (Fig. 10b), indicating the importance of ground filtering in CH estimation.

5.4 Field-measured canopy height may not be as accurate as believed

Our results showed that the height correlations between different 3D sensing (r = 0.87–0.97) are much better than the accuracy between 3D sensing and FM (r = 0.82–0.89). The reasons may be two aspects. On the one hand, LiDAR and DAP are both accurate surveying and mapping technologies, they have good repeatability and consistency despite a wide variety of sensors and platforms. The TLS, BLS, and GLS systems with centimeter and millimeter resolutions have been proven accurate for estimating not only height but also other 3D traits (Walter et al., 2019; Zhang et al., 2021b). On the other hand, FM may be suspicious because it is based on subjective samples and is easily influenced by the terrain and other factors (Aasen et al., 2015). Some studies have also indicated that LiDAR may be more accurate than manual inspection (Maesano et al., 2020).

Heritability is another prospect to evaluate the reliability of phenotyping methods and their potential for the breeding program (Schmidt et al., 2019). On the one hand, the differences in the heritability of different data reflect their ability to characterize the subtle differences of CH among different varieties, as mentioned by Volpato et al. (2021). Our results proved that H² of LiDAR-derived CH was larger than that derived from DAP and FM (Table 6), which may be determined by the higher accuracy of LiDAR systems, indicating the accuracy of the CH monitoring method will influence the values of H². On the other hand, the overall heritability in the later growth period decreased, which may attribute to the prominent environmental impact of nitrogen treatment in the later growth period.

5.5 Contributions and implications

This study systematically evaluated the accuracy of CH estimation from advanced 3D sensing systems (TLS, BLS, GLS, and DAP) and FM using wheat plots of different varieties, fertilization levels, and growth stages. To our knowledge, this is the first effort that uses multiple 3D sensing technologies to evaluate their reliability for estimating CH with regard to different CH, LAI, and GS groups. Moreover, we analyzed the heritability from 3D sensing datasets and FM, proving the potential advantages of 3D sensing technologies in crop breeding.

However, there are still some interesting and important directions that need to be explored in the future. First, it is meaningful to deeply analyze the effect of operating modes of different 3D sensing technologies on CH monitoring. As for TLS, the scanning location settings (e.g., positions and total numbers) is important for acquiring a high-quality (higher density and less occlusion) point cloud (Guo et al., 2019; Walter et al., 2019). Although some pioneer studies have been conducted in forestry (Yrttimaa et al., 2019), it is still needed to have a scientific workflow of TLS in agriculture to ensure not only high accuracy but also improve efficiency. BLS is an economically friendly and easy-to-use platform, designing the routine is critical for ensuring data quality and it has been discussed by Su et al. (2021). GLS is a kind of emerging phenotyping platform, which is mainly designed for crop phenotyping and has less been explored. This study highlights the necessity to integrate suitable sensors (e.g., longer-ranging ability) for different crop types, provide access to raw data, and enable more intelligent custom algorithms (e.g., filtering algorithm) for accurate phenotype extraction (Jin et al., 2020b). DAP is a low-cost system that has been widely used in phenotyping. However, the point cloud quality generated from DAP is affected by parameters such as sensor quality, camera shooting angle, routine overlap, and flight speed. This study determined the optimal flight height by a preliminary compassion experiment (Fig. S1). More parameter comparison studies are worth exploring and can refer to Hu et al. (2020). Additionally, considering the UAV-LiDAR systems are more expensive than DAP and do not have obvious advantages in data quality (Zhang et al., 2021b), this study did not compare the UAV-LiDAR systems. However, we believe UAV-LiDAR systems are getting cheaper and the data quality is a good complement to DAP due to its higher penetration ability and robustness to light environments.

Secondly, the tradeoff between precision and efficiency is worth studying. Generally, data precision was depicted by point density and resolution. High point density usually has a high resolution (Fig. 13a). The possible reason why TLS has a higher point density but a lower resolution is the multi-scan registration (Liang et al., 2016). More importantly, this study highlight that higher precision always needs a longer collection time, but does not mean more processing time (e.g., GLS) (Fig. 13). Among them, TLS has the longest data acquisition and processing time, because the reference targets and scanner need to be laboriously laid out during the scanning, and multi-scan data registration is time-consuming during reprocessing (Disney, 2019). BLS has the shortest time (collection plus preprocessing), implicating this type of mobile mapping technology is worth promoting in the future, especially as cost decreases and accuracy increases. GLS not only has the highest point resolution but also has the shortest preprocessing time and affordable collection time, which benefits from the automatic data collection system and processing software (Li et al., 2022c). However, this kind of phenotyping platform is still too expensive (Table 2). DAP has high collection efficiency, but the data quality is relatively low. Besides, the processing time of DAP is long not only caused by 3D reconstruction but also attribute to the manual de-noising process due to the low signal-to-noise ratio of the DAP point cloud. These preliminary explorations are of great significance for further in-depth and systematic analysis of cost and efficiency and the formulation of appropriate phenotypic working plans.

Finally, there is no standard for grouping CH and LAI. This study mainly divided 1920 plots into four different groups based on the value extent (maximum minus minimum) and frequency distribution. Although there are small differences in the spacing of the groupings and the number of groups is not exactly equal, the total sample sizes (i.e., 1920 plots) are unprecedented. The influence of CH, LAI, and GS on height measurement accuracy and heritability has been analyzed, but more quantitative evaluations are worth exploring, such as the specific CH and LAI thresholds for selecting the optimal measuring methods. Moreover, this study mainly studied the important CH trait in wheat, while more biological meaningful and heritable traits in more crop types need further evaluation (Li et al., 2022b; Zhu et al., 2021).

The study demonstrated novel insights on the accuracy and heritability of CH from 3D sensing and field measurement. Cross-comparisons among different sensor datasets showed higher correlations (r = 0.87 to 0.97) than comparisons with FM (r = 0.82 to 0.89). The correlation coefficients of CHs derived from 3D sensing and FM decreased obviously when evaluated with respect to different subgroups (CH, LAI, and GS), especially different CH subgroups. TLS and BLS were more reliable in monitoring CH under different subgroups according to their cross-comparisons and comparisons with FM. The outlier analysis found cases where FM may be error-prone. Moreover, 3D sensing methods showed even higher heritability than FM. Further studies about the best configurations of sensors and working plans are needed, the tradeoff between data quality and efficiency is worth exploring, and more traits deserve future efforts. These novel findings may give insights into the selection of advanced 3D sensing platforms for crop monitoring and may shed new light on the high-quality development of crop sciences (e.g. providing higher heritable traits for breeding).

Acknowledgments

Not applicable.

Author contributions

Shichao Jin designed this study; Jingrong Zang, Qing Li, and Yue Mu collected the data; Jingrong Zang, Songyin Zhang, and Shaochen Li processed the data; Jingrong Zang and Shichao Jin prepared the figures and tables; Jingrong Zang and Shichao Jin wrote the manuscript; All authors helped to revise the manuscript. All authors read and approved the final manuscript.

Data availability

The data that supports the findings of this study are available on request due to its large volume.

Declaration of competing interest

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Ethical approval and consent to participate

Not applicable.

Funding

This study was supported by the National Natural Science Foundation of China (32201656), the Jiangsu Agricultural Science and Technology Independent Innovation Fund Project (CX (22) 1006), the JBGS Project of Seed Industry Revitalization in Jiangsu Province (JBGS [2021] 007), the High-Level Personnel Project of Jiangsu Province (JSSCBS20210271), and the Strategic Priority Research Program of the Chinese Academy of Sciences (XDA24020202).

Aasen, H., Burkart, A., Bolten, A. & Bareth, G. (2015). Generating 3d hyperspectral information with lightweight uav snapshot cameras for vegetation monitoring: From camera calibration to quality assurance. ISPRS Journal of Photogrammetry and Remote Sensing 108: 245-259.
Ali-Sisto, D., Gopalakrishnan, R., Kukkonen, M., Savolainen, P. & Packalen, P. (2020). A method for vertical adjustment of digital aerial photogrammetry data by using a high-quality digital terrain model. International Journal of Applied Earth Observation and Geoinformation 84: 101954-101963.
Busemeyer, L., Mentrup, D., Moller, K., Wunder, E., Alheit, K., Hahn, V., Maurer, H. P., Reif, J. C., Wurschum, T., Muller, J., Rahe, F. & Ruckelshausen, A. (2013). Breedvision--a multi-sensor platform for non-destructive field-based phenotyping in plant breeding. Sensors 13(3): 2830-2847.
Chandana, B. S., Mahto, R. K., Singh, R. K., Ford, R., Vaghefi, N., Gupta, S. K., Yadav, H. K., Manohar, M. & Kumar, R. (2022). Epigenomics as potential tools for enhancing magnitude of breeding approaches for developing climate resilient chickpea. Front Genet 13: 900253-900278.
Chen, J. & Black, T. A. (1992). Defining leaf area index for non‐flat leaves. Plant, Cell & Environment 15: 421-429.
Disney, M. (2019). Terrestrial lidar: A three-dimensional revolution in how we look at trees. New Phytol 222(4): 1736-1741.
Eitel, J. U. H., Höfle, B., Vierling, L. A., Abellán, A., Asner, G. P., Deems, J. S., Glennie, C. L., Joerg, P. C., LeWinter, A. L., Magney, T. S., Mandlburger, G., Morton, D. C., Müller, J. & Vierling, K. T. (2016). Beyond 3-d: The new spectrum of lidar applications for earth and ecological sciences. Remote Sensing of Environment 186: 372-392.
El-Naggar, A. G., Jolly, B., Hedley, C. B., Horne, D., Roudier, P. & Clothier, B. E. (2021). The use of terrestrial lidar to monitor crop growth and account for within-field variability of crop coefficients and water use. Computers and Electronics in Agriculture 190(6): 106416-106432.
Gao, M., Yang, F., Wei, H. & Liu, X. (2022). Individual maize location and height estimation in field from uav-borne lidar and rgb images. Remote Sensing 14(10): 2292-2311.
Gong, Y., Yang, K., Lin, Z., Fang, S., Wu, X., Zhu, R. & Peng, Y. (2021). Remote estimation of leaf area index (lai) with unmanned aerial vehicle (uav) imaging for different rice cultivars throughout the entire growing season. Plant Methods 17(1): 88-104.
Goodbody, T. R. H., Coops, N. C. & White, J. C. (2019). Digital aerial photogrammetry for updating area-based forest inventories: A review of opportunities, challenges, and future directions. Current Forestry Reports 5(2): 55-75.
Grenzdörffer, G. J. (2014). Crop height determination with uas point clouds. The International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences XL-1: 135-140.
Guo, T., Fang, Y., Cheng, T., Tian, Y., Zhu, Y., Chen, Q., Qiu, X. & Yao, X. (2019). Detection of wheat height using optimized multi-scan mode of lidar during the entire growth stages. Computers and Electronics in Agriculture 165(6): 104959-104968.
Han, L., Yang, G., Dai, H., Xu, B., Yang, H., Feng, H., Li, Z. & Yang, X. (2019). Modeling maize above-ground biomass based on machine learning approaches using uav remote-sensing data. Plant Methods 15(1): 10-29.
Han, X., Thomasson, J. A., Bagnall, G. C., Pugh, N. A., Horne, D. W., Rooney, W. L., Jung, J., Chang, A., Malambo, L., Popescu, S. C., Gates, I. T. & Cope, D. A. (2018). Measurement and calibration of plant-height from fixed-wing uav images. Sensors (Basel) 18(12): 4092-4113.
Hartley, R. J. L., Leonardo, E. M., Massam, P., Watt, M. S., Estarija, H. J., Wright, L., Melia, N. & Pearse, G. D. (2020). An assessment of high-density uav point clouds for the measurement of young forestry trials. Remote Sensing 12(24): 4039-4059.
Holman, F., Riche, A., Michalski, A., Castle, M., Wooster, M. & Hawkesford, M. (2016). High throughput field phenotyping of wheat plant height and growth rate in field plot trials using uav based remote sensing. Remote Sensing 8(12): 1031-1055.
Hu, P., Chapman, S. C., Wang, X., Potgieter, A., Duan, T., Jordan, D., Guo, Y. & Zheng, B. (2018). Estimation of plant height using a high throughput phenotyping platform based on unmanned aerial vehicle and self-calibration: Example for sorghum breeding. European Journal of Agronomy 95: 24-32.
Hu, T., Sun, X., Su, Y., Guan, H., Sun, Q., Kelly, M. & Guo, Q. (2020). Development and performance evaluation of a very low-cost uav-lidar system for forestry applications. Remote Sensing 13(1): 77-98.
Hyyppä, E., Yu, X., Kaartinen, H., Hakala, T., Kukko, A., Vastaranta, M. & Hyyppä, J. (2020). Comparison of backpack, handheld, under-canopy uav, and above-canopy uav laser scanning for field reference data collection in boreal forests. Remote Sensing 12(20): 3327-3358.
James, M. R. & Robson, S. (2014). Mitigating systematic error in topographic models derived from uav and ground-based image networks. Earth Surface Processes and Landforms 39(10): 1413-1420.
Jelle ten Harkel, J., Bartholomeus, H. & Kooistra, L. (2019). Biomass and crop height estimation of different crops using uav-based lidar. Remote Sensing 12(1): 17-35.
Jin, S., Su, Y., Gao, S., Wu, F., Hu, T., Liu, J., Li, W., Wang, D., Chen, S., Jiang, Y., Pang, S. & Guo, Q. (2018). Deep learning: Individual maize segmentation from terrestrial lidar data using faster r-cnn and regional growth algorithms. Frontiers in Plant Science 9: 866-875.
Jin, S., Su, Y., Song, S., Xu, K., Hu, T., Yang, Q., Wu, F., Xu, G., Ma, Q., Guan, H., Pang, S., Li, Y. & Guo, Q. (2020a). Non-destructive estimation of field maize biomass using terrestrial lidar: An evaluation from plot level to individual leaf level. Plant Methods 16(3): 69-89.
Jin, S., Su, Y., Wu, F., Pang, S., Gao, S., Hu, T., Liu, J. & Guo, Q. (2019). Stem–leaf segmentation and phenotypic trait extraction of individual maize using terrestrial lidar data. IEEE Transactions on Geoscience and Remote Sensing 57(3): 1336-1346.
Jin, S., Su, Y., Zhang, Y., Song, S., Li, Q., Liu, Z., Ma, Q., Ge, Y., Liu, L., Ding, Y., Baret, F. & Guo, Q. (2021a). Exploring seasonal and circadian rhythms in structural traits of field maize from lidar time series. Plant Phenomics 2021(4): 9895241-9895256.
Jin, S., Su, Y., Zhao, X., Hu, T. & Guo, Q. (2020b). A point-based fully convolutional neural network for airborne lidar ground point filtering in forested environments. IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing 13: 3958-3974.
Jin, S., Sun, X., Wu, F., Su, Y., Li, Y., Song, S., Xu, K., Ma, Q., Baret, F., Jiang, D., Ding, Y. & Guo, Q. (2021b). Lidar sheds new light on plant phenomics for plant breeding and management: Recent advances and future prospects. ISPRS Journal of Photogrammetry and Remote Sensing 171(5): 202-223.
Kawamura, K., Asai, H., Yasuda, T., Khanthavong, P., Soisouvanh, P. & Phongchanmixay, S. (2020). Field phenotyping of plant height in an upland rice field in laos using low-cost small unmanned aerial vehicles (uavs). Plant Production Science 23(4): 452-465.
Kronenberg, L., Yu, K., Walter, A. & Hund, A. (2017). Monitoring the dynamics of wheat stem elongation: Genotypes differ at critical stages. Euphytica 213(7): 157-170.
Li, A., Hao, C., Wang, Z., Geng, S., Jia, M., Wang, F., Han, X., Kong, X., Yin, L., Tao, S., Deng, Z., Liao, R., Sun, G., Wang, K., Ye, X., Jiao, C., Lu, H., Zhou, Y., Liu, D., Fu, X., Zhang, X. & Mao, L. (2022a). Wheat breeding history reveals synergistic selection of pleiotropic genomic sites for plant architecture and grain yield. Mol Plant 15(3): 504-519.
Li, D., Shi, G., Li, J., Chen, Y., Zhang, S., Xiang, S. & Jin, S. (2022b). Plantnet: A dual-function point cloud segmentation network for multiple plant species. ISPRS Journal of Photogrammetry and Remote Sensing 184: 243-263.
Li, Q., Jin, S., Zang, J., Wang, X., Sun, Z., Li, Z., Xu, S., Ma, Q., Su, Y., Guo, Q. & Jiang, D. (2022c). Deciphering the contributions of spectral and structural data to wheat yield estimation from proximal sensing. The Crop Journal 11: 2214-2225.
Liang, X., Kankare, V., Hyyppä, J., Wang, Y., Kukko, A., Haggrén, H., Yu, X., Kaartinen, H., Jaakkola, A., Guan, F., Holopainen, M. & Vastaranta, M. (2016). Terrestrial laser scanning in forest inventories. ISPRS Journal of Photogrammetry and Remote Sensing 115: 63-77.
Liu, F., Hu, P., Zheng, B., Duan, T., Zhu, B. & Guo, Y. (2021). A field-based high-throughput method for acquiring canopy architecture using unmanned aerial vehicle images. Agricultural and Forest Meteorology 296: 108231-108242.
Lu, J., Cheng, D., Geng, C., Zhang, Z., Xiang, Y. & Hu, T. (2021). Combining plant height, canopy coverage and vegetation index from uav-based rgb images to estimate leaf nitrogen concentration of summer maize. Biosystems Engineering 202: 42-54.
Luo, S., Liu, W., Zhang, Y., Wang, C., Xi, X., Nie, S., Ma, D., Lin, Y. & Zhou, G. (2021). Maize and soybean heights estimation from unmanned aerial vehicle (uav) lidar data. Computers and Electronics in Agriculture 182(9): 106005-106014.
Luo, S., Wang, C., Pan, F., Xi, X., Li, G., Nie, S. & Xia, S. (2015). Estimation of wetland vegetation height and leaf area index using airborne laser scanning data. Ecological Indicators 48: 550-559.
Ma, H. & Liang, S. (2022). Development of the glass 250-m leaf area index product (version 6) from modis data using the bidirectional lstm deep learning model. Remote Sensing of Environment 273: 112985-113003.
Madec, S., Baret, F., de Solan, B., Thomas, S., Dutartre, D., Jezequel, S., Hemmerle, M., Colombeau, G. & Comar, A. (2017). High-throughput phenotyping of plant height: Comparing unmanned aerial vehicles and ground lidar estimates. Front Plant Science 8(1): 2002-2016.
Maesano, M., Khoury, S., Nakhle, F., Firrincieli, A., Gay, A., Tauro, F. & Harfouche, A. (2020). Uav-based lidar for high-throughput determination of plant height and above-ground biomass of the bioenergy grass arundo donax. Remote Sensing 12(20): 3464-3484.
Malambo, L., Popescu, S. C., Murray, S. C., Putman, E., Pugh, N. A., Horne, D. W., Richardson, G., Sheridan, R., Rooney, W. L., Avant, R., Vidrine, M., McCutchen, B., Baltensperger, D. & Bishop, M. (2018). Multitemporal field-based plant height estimation using 3d point clouds generated from small unmanned aerial systems high-resolution imagery. International Journal of Applied Earth Observation and Geoinformation 64: 31-42.
Niu, Y., Zhang, L., Zhang, H., Han, W. & Peng, X. (2019). Estimating above-ground biomass of maize using features derived from uav-based rgb imagery. Remote Sensing 11(11): 1261-1282.
Ogunbadewa, E. Y. (2012). Tracking seasonal changes in vegetation phenology with a sunscan canopy analyzer in northwestern england. Forest Science and Technology 8(3): 161-172.
Oumata, S., Monneveux, P., Zaharieva, M., Mekliche-Hanifi, L. & David, J. (2022). Variation of morphological traits among wheat (triticum aestivum l.) landraces from two regions of the algerian sahara. Potential interest for wheat breeding. Genetic Resources and Crop Evolution 1: 429-445.
Potter, E., Wood, J. & Nicholl, C. (1996).Sunscan canopy analysis system: Users manual. In Delta-T Devices, Cambridge, UK.
Ravi, R., Lin, Y.-J., Shamseldin, T., Elbahnasawy, M., Masjedi, A., Crawford, M. & Habib, A. (2018). Wheel-based lidar data for plant height and canopy cover evaluation to aid biomass prediction. IGARSS 2018 - 2018 IEEE International Geoscience and Remote Sensing Symposium 3(3): 3242-3245.
Rosnell, T. & Honkavaara, E. (2012). Point cloud generation from aerial image data acquired by a quadrocopter type micro unmanned aerial vehicle and a digital still camera. Sensors 12(1): 453-480.
Schmidt, P., Hartung, J., Bennewitz, J. & Piepho, H. P. (2019). Heritability in plant breeding on a genotype-difference basis. Genetics 212(4): 991-1008.
Singh, V., Singh, D., Singh, N. & Kumar, S. (2005). Genetic analysis of wheat varieties for yield and its components. Agric 25(2): 145 - 146.
Sofonia, J., Shendryk, Y., Phinn, S., Roelfsema, C., Kendoul, F. & Skocaj, D. (2019). Monitoring sugarcane growth response to varying nitrogen application rates: A comparison of uav slam lidar and photogrammetry. International Journal of Applied Earth Observation and Geoinformation 82: 101878-101893.
Sone, C., Saito, K. & Futakuchi, K. (2009). Comparison of three methods for estimating leaf area index of upland rice cultivars. Crop Science 49(4): 1438-1443.
Song, P., Wang, J., Guo, X., Yang, W. & Zhao, C. (2021). High-throughput phenotyping: Breaking through the bottleneck in future crop breeding. The Crop Journal 9(3): 633-645.
Su, W., Zhang, M., Bian, D., Liu, Z., Huang, J., Wang, W., Wu, J. & Guo, H. (2019). Phenotyping of corn plants using unmanned aerial vehicle (uav) images. Remote Sensing 11(17): 2021-2140.
Su, Y., Guo, Q., Jin, S., Guan, H., Sun, X., Ma, Q., Hu, T., Wang, R. & Li, Y. (2021). The development and evaluation of a backpack lidar system for accurate and efficient forest inventory. IEEE Geoscience and Remote Sensing Letters 18(9): 1660-1664.
Sun, S., Li, C. & Paterson, A. (2017). In-field high-throughput phenotyping of cotton plant height using lidar. Remote Sensing 9(4): 377-398.
Sun, S., Li, C., Paterson, A. H., Jiang, Y., Xu, R., Robertson, J. S., Snider, J. L. & Chee, P. W. (2018). In-field high throughput phenotyping and cotton plant growth analysis using lidar. Front Plant Sci 9(1): 16-33.
Sun, Z., Li, Q., Jin, S., Song, Y., Xu, S., Wang, X., Cai, J., Zhou, Q., Ge, Y., Zhang, R., Zang, J. & Jiang, D. (2022). Simultaneous prediction of wheat yield and grain protein content using multitask deep learning from time-series proximal sensing. Plant Phenomics 2022(3): 1-13.
Tao, H., Xu, S., Tian, Y., Li, Z., Ge, Y., Zhang, J., Wang, Y., Zhou, G., Deng, X., Zhang, Z., Ding, Y., Jiang, D., Guo, Q. & Jin, S. (2022). Proximal and remote sensing in plant phenomics: 20 years of progress, challenges, and perspectives. Plant Communications 3: 100344-100383.
Tilly, N., Hoffmeister, D., Cao, Q., Huang, S., Lenz-Wiedmann, V., Miao, Y. & Bareth, G. (2014a). Multitemporal crop surface models: Accurate plant height measurement and biomass estimation with terrestrial laser scanning in paddy rice. Journal of Applied Remote Sensing 8: 083671-0836693.
Tilly, N., Hoffmeister, D., Schiedung, H., Hütt, C., Brands, J. & Bareth, G. (2014b). Terrestrial laser scanning for plant height measurement and biomass estimation of maize. The International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences XL-7: 181-187.
Visscher, P. M., Hill, W. G. & Wray, N. R. (2008). Heritability in the genomics era--concepts and misconceptions. Nat Rev Genet 9(4): 255-266.
Volpato, L., Pinto, F., Gonzalez-Perez, L., Thompson, I. G., Borem, A., Reynolds, M., Gerard, B., Molero, G. & Rodrigues, F. A., Jr. (2021). High throughput field phenotyping for plant height using uav-based rgb imagery in wheat breeding lines: Feasibility and validation. Front Plant Sci 12: 591587.
Walter, J. D. C., Edwards, J., McDonald, G. & Kuchel, H. (2019). Estimating biomass and canopy height with lidar for field crop breeding. Front Plant Sci 10: 1145-1161.
Wang, X., Singh, D., Marla, S., Morris, G. & Poland, J. (2018). Field-based high-throughput phenotyping of plant height in sorghum using different sensing technologies. Plant Methods 14: 53-69.
Wang, Y., Lehtomäki, M., Liang, X., Pyörälä, J., Kukko, A., Jaakkola, A., Liu, J., Feng, Z., Chen, R. & Hyyppä, J. (2019). Is field-measured tree height as reliable as believed – a comparison study of tree height estimates from field measurement, airborne laser scanning and terrestrial laser scanning in a boreal forest. ISPRS Journal of Photogrammetry and Remote Sensing 147: 132-145.
Wang, Y., Yang, X.-D., Ali, A., Lv, G.-H., Long, Y.-X., Wang, Y.-Y., Ma, Y.-G. & Xu, C.-C. (2020). Flowering phenology shifts in response to functional traits, growth form, and phylogeny of woody species in a desert area. Frontiers in Plant Science 11: 536-547.
Xiao, Q., Bai, X., Zhang, C. & He, Y. (2022). Advanced high-throughput plant phenotyping techniques for genome-wide association studies: A review. J Adv Res 35: 215-230.
Yrttimaa, T., Saarinen, N., Kankare, V., Liang, X., Hyyppä, J., Holopainen, M. & Vastaranta, M. (2019). Investigating the feasibility of multi-scan terrestrial laser scanning to characterize tree communities in southern boreal forests. Remote Sensing 11(12): 1423-1445.
Zhang, C., Craine, W. A., McGee, R. J., Vandemark, G. J., Davis, J. B., Brown, J., Hulbert, S. H. & Sankaran, S. (2021a). High‐throughput phenotyping of canopy height in cool‐season crops using sensing techniques. Agronomy Journal 113(4): 3269-3280.
Zhang, F., Hassanzadeh, A., Kikkert, J., Pethybridge, S. J. & van Aardt, J. (2021b). Comparison of uas-based structure-from-motion and lidar for structural characterization of short broadacre crops. Remote Sensing 13(19): 3975-3996.
Zhou, L., Gu, X., Cheng, S., Yang, G., Shu, M. & Sun, Q. (2020). Analysis of plant height changes of lodged maize using uav-lidar data. Agriculture 10(5): 146-160.
Zhu, Y., Sun, G., Ding, G., Zhou, J., Wen, M., Jin, S., Zhao, Q., Colmer, J., Ding, Y., Ober, E. S. & Zhou, J. (2021). Large-scale field phenotyping using backpack lidar and cropquant-3d to measure structural variation in wheat. Plant Physiol 187(2): 716-738.

No competing interests reported.

Supplementaryfiles.docx

Download PDF

Journal Publication

published 02 Apr, 2023

Read the published version in Plant Methods →

Editorial decision: Major revision
16 Feb, 2023
Reviews received at journal
05 Feb, 2023
Reviewers agreed at journal
15 Jan, 2023
Reviewers invited by journal
15 Jan, 2023
Editor assigned by journal
05 Jan, 2023
Submission checks completed at journal
05 Jan, 2023
First submitted to journal
31 Dec, 2022

You are reading this latest preprint version

Field-measured canopy height may not be as accurate and heritable as believed – Evidence from advanced 3D sensing

Status:

Journal Publication

Version 1

Abstract

Figures

Highlights

1. Introduction

2. Materials And Data Collection

2.1 Study area and experimental design

2.2 Data collection

2.2.1 TLS data

2.2.2 BLS data

2.2.3 GLS data

2.2.4 DAP data

2.2.5 Field measurements

3. Methods

3.1 Data preprocessing

3.2 Canopy height extraction

3.3 Cross-comparisons of canopy height estimates from field measurement and 3D sensing

3.4 Error source analysis

4. Results

4.1 Canopy height from different 3D sensing datasets

4.2 Comparing canopy height measurement of different methods among different canopy height groups

4.3 Comparing canopy height measurement of different methods among different LAI groups

4.4 Comparing canopy height measurement of different methods among different GS groups

4.5 Comparing the broad sense heritability of canopy height measurement from different methods

5. Discussions

5.1 Height quantities of 3D point cloud affect the best estimates of canopy height

5.2 CH estimation under various height groups, LAI groups, and GS groups

5.3 Outlier analysis of different datasets

5.4 Field-measured canopy height may not be as accurate as believed

5.5 Contributions and implications

6. Conclusion

Declarations

References

Additional Declarations

Supplementary Files

Status:

Journal Publication

Version 1