Wheat is one of the most important food crops in temperate countries, widely used as livestock feed and staple food (Tadesse et al., 2019). With the development of crop science and breeding technology, an increasing number of wheat varieties have flooded the market. At the same time, the purity of wheat kernels has also attracted great attention from breeders, planters, and consumer (Koistinen and Hanhineva, 2017). However, wheat kernels of different varieties have an almost consistent visual appearance, which makes it difficult to distinguish with the naked eye. Traditional methods used to identify wheat varieties, such as high-performance liquid chromatography (HPLC) and gas chromatography-mass spectrometer (GC-MS) (Qiu et al., 2018), usually consume a huge amount of time and manpower, meanwhile causing damage to the tested samples. A series of recent advances in machine learning, spectroscopy, and computer graphics have led to a variety of rapid and non-destructive methods for the identification of crop seed varieties.
Hyperspectral imaging is an emerging technology that can fuse visible/near-infrared (VNIR) spectra into an image. The hyperspectral image is defined as a data cube with a shape of w×h×d where the w×h is the width and height of the image that contains the information in spatial dimensions and d is the bands of the image that contains the information in spectral dimensions. It can be regarded as a spectrum with an image on each band or an image with a spectrum on each pixel. Due to the above characteristics of the hyperspectral image, many rapid and nondestructive methods for the identification of crop seed varieties based on machine learning have been proposed (Ge et al., 2019; Xu et al., 2021). Huang et al. (2016) reported a 92.19% prediction accuracy in classifying 17 varieties of maize seed using the morphological, texture, and spectrum features of the hyperspectral image combined with the feature transformation/reduction by developing a least squares support vector machines model. Yang et al. (2015) extracted five morphological features and eight texture features to develop two classification models of four waxy corn seed varieties. The classification accuracies were 77.4%-98.2%. Wang et al. (2015) used the chalkiness degree and shape feature of rice and obtained 88.09%-94.45% accuracy in classifying three rice varieties with a backpropagation neural network classifier. However, all of these studies are at the core of the traditional machine learning algorithms built on a series of features extracted from the hyperspectral images. The process of extracting features is complicated, and the results of the research will depend on the method of extracting features.
Deep learning (DL) is one of the most recent emerging research focuses of data analysis and has been successfully applied to hyperspectral image processing (Li and He, 2020; Vaddi and Manoharan, 2020). DL adopts a neuron structure similar to the biological brain to learn the deep features of the data automatically, which is different from traditional machine learning. This neuron structure can extract various features of the original data layer by layer, and gradually convert them into some high-level abstract features for mining the deep information of the data (Zhao and Du, 2016). The convolutional neural network (CNN) is one of the important networks in deep learning, which has been applied to the identification of crop seed varieties. Yu et al. (2021) introduced CNN to identify hybrid okra seeds and indicated the effectiveness of CNN with a 93.79% prediction accuracy in classifying 17 varieties of hybrid okra seeds where CNN had the highest accuracy by comparison with extreme learning machine (ELM) and stacked sparse auto-encoder (SSAE). Singh et al. (2021) employed the near-infrared hyperspectral imaging coupled with the CNN to identify the barley seeds variety and demonstrated that the CNN model with a 98% prediction accuracy in classifying 35 Indian barley varieties has higher accuracy than the traditional machine learning. Chatnuntawech et al. (2018) showed better performance of CNN with 11.9% absolute improvement in the mean classification accuracy compared with the support vector machine (SVM) for identifying rice seed varieties. These satisfactory results show that this CNN classification model, which can automatically extract features, is expert in processing hyperspectral images and capable of classifying crop seeds. Unlike the traditional method, the high-dimensional hyperspectral data cube can be processed by CNN directly without feature extraction.
Although the high-dimensional hyperspectral data cube is beneficial to the application of remote sensing and crop seed recognition, there are still some problems. The hyperspectral images are usually highly redundant in the spectral dimensions (Zhang et al., 2012), which will lead to an increase in computational complexity, storage space, and communication bandwidth (Sui et al., 2014). This redundancy will not only lengthen the training and inference time of CNN which would have taken plenty of time, but also cause the CNN classification model to fail to focus on the most valuable information of hyperspectral images the first time. Thus, prior knowledge is needed for constraining the hyperspectral images in the spectral dimensions. In this study, the feature wavelength interval selection is introduced as prior knowledge of spectroscopy to constrain the spectral dimension of the hyperspectral images. The interval continuum removal (iCR) (Itoh and Iwasaki, 2013), backward interval partial least squares (BiPLS) (Huang et al., 2012), and interval random frog (iRF) (Yun et al., 2013) are used to extract the feature wavelength interval. The different wavelength intervals represent the characteristics of different components in wheat kernels. When constructing the CNN model, a grouped convolution (GC) in the first layer can independently extract the features of different information intervals, it is better to effectively characterize the hyperspectral images. Then the features are merged in the next convolutional layer and full connect layer. In order to verify the general applicability of the proposed method, experiments are conducted using eight varieties of wheat seed with 1000 samples from each variety, and the modeling performance between the proposed CNN and the traditional CNN was compared. The specific objectives of this study are to: (1) propose a hybrid approach combined wavelength intervals and grouped convolution for effectively extracting hyperspectral features; (2) develop the appropriate lightweight CNN of hyperspectral images for wheat kernels classification;
The remainder of this paper is organized as follows. Section 2 presents the experimental data collection, the hyperspectral imaging system, and image preprocessing. Section 3 describes the wavelength interval selection method briefly, then introduces the proposed CNN models. Section 4 discusses the results and performance analysis. Finally, Section 5 summarizes the conclusions.