Continuous crop monitoring is essential for making informed crop management and yield optimization decisions. Leaf Area Index (LAI) is a crucial biophysical variable used for crop monitoring, and farmers and researchers can track crop growth and health and detect stress to make informed decisions about crop management practices by continuously monitoring LAI. The Global Climate Observing System listed LAI as an Essential Climate Variable and a key variable for models studying vegetation-atmosphere interactions (Baret et al. 2013, GCOS 2024). Information-based management can help to improve crop yield and quality, lower management costs, and increase the sustainability of agricultural production. Remote sensing data comes in handy in estimating LAI as these data are available at repetitive time scales covering large areas on the ground (Filgueiras et al., 2019). Estimating biophysical variables is a crucial aspect of remote sensing applications, as it allows for non-destructive estimation of these variables across extensive areas (Mourad et al., 2020). LAI is a dimensionless canopy structure parameter and is described as an area of one side leaf per unit ground area and is a key biophysical variable in agricultural and environmental studies (Jonckheere et al., 2004). It provides information on the amount of light intercepted by the crop canopy (Ganguly et al., 2012; Cui et al., 2018). LAI can be measured most accurately using a direct method, which is destructive sampling of leaves and using field instruments. However, these measurement methods have their limitations as they are labour-intensive, non-economical and time-consuming (Raj et al., 2021). Many studies have estimated LAI using satellite remote sensing data with reasonable accuracies (Tripathi et al., 2013; Djamai et al., 2018; Xie et al., 2019; Mourad et al., 2020; Filipponi, 2021; Kang et al., 2021; Sun et al., 2021). Estimated LAI is an important biophysical variable for yield estimation and can be assimilated in the crop simulation model using data assimilation methods (Dente et al., 2008; Huang et al., 2019).
Estimation of LAI from optical remote sensing data can be categorized into two methods: (1) empirical relationships between satellite-derived LAI (vegetation index) and ground-measured LAI and (2) physical model-based inversion. Empirical methods are simple and computationally easier as compared to the model-based approaches and provide an acceptable level of accuracy in LAI estimation (Atzberger, 2004; Cui et al., 2018; Pasqualotto et al., 2019). One drawback of empirical methods is their limited applicability to local scales, as the established relationships are specific to particular locations. Additionally, this approach necessitates multiple calibrations with ground-based observational data (Sun et al., 2022). The physical model-based approach uses the canopy radiative transfer model (RTM), and the most preferred RTM method is PROSAIL (Jacquemoud et al., 2009). Based on interactions between radiation, canopy components, and soil surface, RTM inversion is carried out utilising reflectance and auxiliary variables. The RTM inversion is achieved using a Look-up Table (LUT) and machine learning approaches. This method has shown strong potential for biophysical variables estimation (Verrelst et al., 2015; Xie et al., 2019). The physical model-based inversion method is employed to generate global-scale Leaf Area Index products from Moderate Resolution Imaging Spectroradiometer (MODIS) data, albeit at a coarse spatial and temporal resolution.
MODIS satellites, launched in the years 1999 (terra) and 2002 (aqua), provide NDVI at a temporal resolution of 16 days and spatial resolution of 250m. The MODIS data are widely used in vegetation dynamics and monitoring studies at global and at regional levels scale such as the mountain grassland leaf area index is (estimated using the inversion radiative transfer model with decent RMSE 1.62 (m2/m2) (e.g., Beck et al., 2006; Ren et al., 2008; Mkhabela et al., 2011; Pasolli et al., 2015; Dubey et al., 2018; Prasad et al., 2021). However, these products are of limited use in areas where agricultural land holding are very small. The earth observation satellites such as Landsat and Sentinel which have a spatial resolution in the range of 10–60 m can be used in area of small land holdings. The Landsat program by the United States Geological Survey (USGS) was launched in 1972 and provides time series satellite images at spatial resolution of 30 m and a revisit period of 16 days. By employing statistical and radiative transfer model (RTM) inversion techniques, Landsat-8 imagery is able to provide precise estimates of specific leaf area (SLA) at both regional and global scales (Ganguly et al., 2012, Ali et al., 2017) The LAI model inversion approach using multitemporal optical data from Landsat effectively derives leaf area LAI for various crop types, showing consistent seasonal variations with crop phenological stages (Gonzalez-Sanpedro et al., 2008). An improvement in the spatial and temporal resolution is provided by the European Space Agency (ESA) Sentinel satellites, launched in 2015. A constellation of two satellites Sentinel 2A and Sentinel 2B provides imagery at 10m to 60 m spatial resolution at 5-day revisit time between the two satellites. These satellite sensors data have been used in different studies for vegetation analysis and estimation of crop biophysical parameters such as leaf area index, canopy chlorophyll content (Zheng et al., 2015; Onojeghuo et al., 2018; Kowalski et al., 2020; Nihar et al., 2022). The presence of Red edge bands makes Sentinel 2 data of particular interest for LAI retrieval from winter wheat (Xie et al., 2019). Sentinel-2 Band-8A-Narrow Near InfraRed is more accurate for Leaf Area Index estimation in cotton, tomato, and wheat, while Band-9 (Water vapor) shows a high correlation with LAI, facilitating more accurate agricultural monitoring than traditional Vegetation Indices.
The traditionally used inversion techniques involve a minimization of a cost function which requires excessive computation time to achieve the high retrieval accuracy (Kimes et. al, 2009, Wang et al., 2017). Machine learning algorithms for LAI retrieval can improve prediction accuracies and spatial consistencies (Houborg et al. 2018).Given the potential application of satellite-estimated LAI for crop management, this study aims at validating satellite-estimated LAI for spring wheat crops. This study proposes a framework for retrieving LAI from Landsat-8 and Sentinel-2 satellite data using three machine learning methods i.e., Random Forest (RF), Support Vector Machine (SVM) and XGBoost. In addition, the field-observed LAI is compared with the satellite-derived ones.