Research on Rose Classification Based on Neural Network Model

doi:10.21203/rs.3.rs-1689613/v1

Download PDF

Research

Research on Rose Classification Based on Neural Network Model

https://doi.org/10.21203/rs.3.rs-1689613/v1

This work is licensed under a CC BY 4.0 License

Version 1

posted

You are reading this latest preprint version

As an important industry in China, rose has extremely important economic value and ornamental value. However, affected by human factors and planting environment, even roses of the same variety will have different levels of classification. In order to solve the low efficiency of wrong classification and missing classification caused by manual visual classification and realize the transformation of rose classification from manual judgment to machine independent recognition, this paper first performs data enhancement, image naming and other operations on 200 manually photographed yellow rose images of 5 levels of the same variety, and constructs an image database. Then, in Python language and tensorflow2.0 deep learning tool, design rose classification algorithm based on artificial neural network(ANN), rose classification algorithm based on convolution neural network(CNN) and rose classification algorithm based on feature extraction(FE) and artificial neural network(ANN). Finally, the experimental results show that the third algorithm significantly speeds up the running speed and significantly improves the average recognition rate compared with the first two algorithms, which effectively solves the problems of wrong or missing classification and low classification efficiency in manual intuitive classification, and realizes the transformation from manual classification to machine independent classification.

Image recognition

Feature extraction(FE)

Factor analysis

Artificial neural network(ANN)

Convolutional neural network(CNN)

China is a large country of rose planting, with rich rose resources. According to the statistics of the Ministry of Agriculture, the rose planting area in China reached 13251 hectares in 2019, and the total output value of the rose industry exceeded 8 billion yuan, with a growth rate of about 9.28%. In recent years, with the continuous development of China’s economy, the sales volume of roses has increased year by year. As the primary process of roses entering the market, level classification is a key problem. At present, people mainly classify roses according to their subjective judgment, which may cause fatigue due to repeated work, resulting in problems such as wrong or missing classification.

Nowadays, computer vision technology and deep learning algorithm are developing rapidly, and researchers have also made some achievements in image recognition. Since the 1960s, some scholars have used computer vision technology to classify the research objectives. With the rapid development of computer vision technology, the level classification of research objectives has become a research hot spot. Unay et al. (2011) combined structural and statistical methods to realize the identification of citrus defects while shortening the classification time and ensuring the classification effect of citrus[1]. Sarkar and Wolfe (1985) classified the maturity of tomatoes in supermarkets[2] according to their appearance, size and other characteristics. Kondo (2009) based on the automatic detection and classification system, realized the accuracy and stability of the classification system in fruit classification[3]. Paulsen et al. (1989) used image recognition technology and projection mapping method[4] to realize the classification detection of corn seed quality and varieties. Leemans et al. (1998) compared the color information of defective apples with that of normal apples through the color information model[5], so as to classify the defective apples and normal apples. Sofu et al. (2016) designed an online sorting system[6], which divides apples into different levels according to their size, color and weight characteristics, with a recognition rate of 73%-96%. Although many achievements have been made in the research of image recognition, there are still many problems in solving practical problems due to the influence of many external factors.

The development of deep learning, especially the development of neural network, provides a new way to solve this kind of problem. Image recognition based on neural network algorithm has become a hot spot in the field of image processing. In the 1950s, the perceptron algorithm made neural network become a research hot spot. In the 1980s, with the development of computer technology and the rise of artificial intelligence, Rumelhart et al. (1985) added convolution operation on the basis of ANN, namely convolution neural network(CNN)[7]. Le et al. (1989) gave the network structure of LeNet-5[8] and proposed the neural network algorithm of 7-layer network structure, including 2-layer convolution, 2-layer down sampling and 3-layer full connection. Hinton et al. (2006) initialized each layer of neural network[9], which improved the complexity of deep-seated neural network model. Krizhevsky et al. (2012) improved on the basis of LeNet-5 and gave the AlexNet structure of 7-layer convolutional neural network[10], which delayed the convergence speed of the network and effectively prevented the occurrence of over fitting. With the continuous improvement of neural network, many excellent deep convolution neural network models have emerged. Zeiler et al. (2013) used deconvolution neural network[11] to visualize all layers of AlexNet model and adjust the structure of each layer in the network to optimize the network model. Simonyan et al. (2014) designed VGG network model[12] and won the second place with an error rate of 7.32% in image recognition on ILSNRC. The GoogleNet v1 model designed by Szegedy et al. (2014) won the first place with an error rate of 6.62%[13]. He et al. (2015) proposed the residual network model ResNet[14], and the error rate of image recognition is only 3.57%. GoogleNet v1 and ResNet have played a very important role in the development of neural networks. After that, a series of excellent network models are mostly optimized by them.

Based on the existing literature, it is found that although image recognition technology and deep learning methods have achieved extensive research results, there is no public rose image data set on various websites, and the rose structure is more complex compared with the real objects with obvious characteristics such as apples, tomatoes and oranges. Therefore, no algorithm related to rose classification has been proposed. At present, people mainly use artificial visual classification method when classifying roses, which leads to many inefficient problems such as wrong classification and missing classification.

Motivated by the above challenges, this paper proposes the rose classification algorithm based on image recognition and neural network, the main contributions of this work can be summarized as follows:

200 images of 5 levels of yellow roses of the same variety were taken manually as the data set of this paper;

3 rose classification algorithms are designed: a) rose classification algorithm based on artificial neural network(ANN); b) Rose classification algorithm based on convolution neural network(CNN); c)Factor analysis is used to evaluate and analyze the 12 feature variables extracted from the color features, geometric features and texture features of yellow roses, on this basis, a rose classification algorithm based on feature extraction(FE) and artificial neural network(ANN) is designed. 3 algorithms are used to classify 200 yellow rose images, so as to solve the problems of wrong classification, missing classification and low classification efficiency in artificial visual classification, realize the transformation from manual grading to machine independent grading;

Compared with the first 2 algorithms, the third algorithm runs faster and the average recognition rate is significantly improved, which improves the recognition efficiency and accuracy of rose classification, so as to realize the intelligent recognition and automatic management system of roses.

2.1 Data collection of yellow rose image

At present, there is no public rose image database on each website. Through manual shooting and collection, this paper obtains 200 images of 5 levels of yellow roses of the same variety, including 40 rose images of each level. The original pixels of the photographed flower are 1080*1080 and the original pixels of the corresponding flower stem image are 1440*1080. In order to quickly identify the rose level, this paper uses the Image.EXTENT parameter in the PIL library of Python software to uniformly process the rose image into 74*32 pixels. Figure 1 shows the rose image photographed.

2.2 Data enhancement of rose image

Due to the scarcity of yellow rose varieties and the limitation of resources, there is a lack of data. When training the algorithm for the collected 200 rose images, the existing neural network model can not well meet the recognition rate on the test set, which is easy to cause the phenomenon of over fitting. The data enhancement can enhance the learning ability, generalization ability and prediction ability of algorithm. In order to solve the over fitting problem caused by fewer image data samples, without changing the main content of the image, this paper uses the PIL library in Python to perform 7 data enhancement operations on 200 images as shown in Table 2.1, and a total of 1600 images are obtained, of which 1200 images are used as the training set for training the algorithm and 400 images are used as the test set to test the accuracy of the algorithm.

Table 2.1

Data enhancement operation table
Data enhancement method	Use parameters	Data enhancement method	Use parameters
1) Flip left and right	Image.FLIP_LEFT_RIG	5) Image contour adjustment	ImageFilter.CONTOUR
2) Flip up and down	Image.FLIP_TOP_BOTTOM	6) Mean filtering	ImageFilter.BLUR
3) Rotate 180 degrees counterclockwise	Image.ROTATE_180	7) Contrast transformation	ImageEnhance.Contrast
4) Brightness conversion	ImageEnhance.Brightenss

2.3 Naming of rose image

In order to read and index the image data set, the images are uniformly named. The name of each image is 7 digits. The specific naming method is shown in Fig. 2.

The image data set with uniform format can be obtained by naming the image according to the naming rules in Fig. 2. For example, the 1th level rose used for training, after flipping left and right, the 20th image is named 010020.jpg.

Table 2.2

Image dataset file
File name	File content	File name	File content
rose_train	Rose image training set	rose_x_train.npy	Training set of rose image in npy format
rose_test	Rose image test set	rose_x_test.npy	Test set of rose image in npy format
label_train.txt	Rose image label training set	rose_y_train.npy	Rose image label training set in npy format
label_test.txt	Rose image label test set	rose_y_test.npy	Rose image label test set in npy format

After image collection, data enhancement and image naming, the data set contains 8 files. The file name and file content are shown in Table 2.2.

3.1 Rose classification algorithm based on artificial neural network(ANN)

Artificial neural network(ANN) is a machine learning method based on neurophysiology and cognitive science, using mathematical statistics, probability basis, computer and other disciplinary methods to establish neural network model, so as to solve practical complex problems.The neural network model structure is composed of input layer, hidden layer and output layer.

In this paper, Python software is used to send 1200 rose images and corresponding labels in the data enhanced training set into the ANN, adjust the network structure parameters, and train the classification algorithm; then, 400 images and corresponding labels in the test set are sent into the trained algorithm to classify and recognize the yellow roses of the same variety. Use tensorflow2.0 design the flow chart of rose classification algorithm based on ANN, as shown in Fig. 3.

The rose classification algorithm based on ANN trained in this paper is a 3-layer network structure. The number of neurons in the first layer, the second layer and the third layer are 40, 20 and 5 respectively; The activation function relu is selected and classifier softmax is used for output; The optimizer selects adam, the loss function selects sparse_ categorical_crossintropy, and the evaluation index selects sparse_categorical_accuracy, that is, the results of the network are output in the form of probability distribution; Send 20 image data each time and iterate 20 times, 40 times, 60 times, 80 times, 100 times and 120 times respectively. The parameter statistics are shown in Table 3.1.

Table 3.1

Parameter statistics of rose classification algorithm based on ANN
Layer	Output shape	Params
the first layer	multiple	284213
the second layer	multiple	807
the third layer	multiple	102
Total params: 285122

As can be seen from Table 3.1, there are 284213 parameters in the first layer, 807 parameters in the second layer and 102 parameters in the third layer, with a total of 285122 parameters. The specific parameters include weight and bias.

The recognition rate on the training set and test set is shown in Fig. 4. When the number of iterations is 20, 40, 60, 80, 100 and 120 respectively, the rose classification recognition rate based on ANN does not improve with the increase of the number of iterations. In the training set, after the algorithm tends to be stable, the recognition rate exceeds 80%, and the highest recognition rate is 89.42%; In the test set, the recognition rate is mostly between 60% and 80%, and the highest recognition rate is 76.73%. Obviously, the recognition rate of rose classification algorithm based on ANN is not very high.

Generally, the image recognition rate based on convolutional neural network(CNN) is higher than that based on ANN. In order to improve the classification effect of yellow roses, this paper further designs a rose classification algorithm based on CNN.

3.2 Rose classification algorithm based on convolutional neural network(CNN)

Convolutional neural networks(CNN) is a deep neural network based on multi-layer perceptron. It also has a complete infrastructure of ANN. The difference is that the CNN has the core convolution layer of convolution operation. Therefore, in image recognition, the advantage of CNN is that the convolution operation of convolution layer can effectively extract the feature information in the image.Then,by means of weight sharing, local receptive field and spatial down sampling, the scale of updating parameters such as weight and bias in the network model is reduced and the convergence speed of the model is accelerated. The convolution operation in CNN mainly includes 5 modules: Convolutional Layer, Batch Normalization Layer, Activation Layer, Pooling Layer, Dropout Layer and Fully Connected Layer. Figure 5 is the diagram of the convolution process.

In order to reduce the parameters to be trained, the convolution operation based on CNN first extracts the features of the original image, then sends the extracted feature image to the fully connected network for recognition, adjusts the network structure parameters and trains the algorithm, and then sends the test set to the trained algorithm to classify and recognize the yellow roses of the same variety. Based on Python language and tensorflow2.0 tool, the flow chart of rose classification algorithm based on CNN is shown in Fig. 6.

In this paper, CNN is built based on tensorflow2.0. The convolution layer uses 2 convolution cores with a step size of 5*5, and uses all zero filling operation and batch standardization operation; Activation function relu is used in the activation layer; A pool core with a step size of 2*2 is used in the pooling layer; In the dropout layer, the neurons in the hidden layer are temporarily discarded according to 30% probability; When configuring the training method, use optimizer adam, loss function sparse_categorical_crossentropy and evaluation index sparse_categorical_accuracy; When inputting the image training set and test set, input 20 image data each time, and the iteration times are 20, 40, 60, 80, 100 and 120 respectively, so as to obtain the parameter statistics table shown in Table 3.2.

Table 3.2

Parameter statistics of rose classification algorithm based on CNN
Layer	Output shape	Params
Convolution Layer	multiple	177
Batch Normalization Layer	multiple	8
Activation Layer	multiple	0
Pooling Layer	multiple	0
Dropout Layer	multiple	0
Fully Connected Layer	multiple	5879
Total params: 6064

As can be seen from Table 3.2, there are 177 parameters in convolution layer, 8 parameters in batch standardization layer and 5879 parameters in full connection layer, with a total of 6064 parameters. Compared with ANN, the parameters are much reduced.

The recognition rate on the training set and test set is shown in Fig. 7. After the recognition rate of the rose classification algorithm based on CNN tends to be stable on the training set, the recognition rate is more than 90%, and the highest recognition rate is 96.75%; After the recognition rate on the test set tends to be stable, the recognition rate is mostly 75%-90%, and the highest recognition rate is 88.13%. Compared with ANN, the classification and recognition rate on the test set is improved by nearly 12%.

However, whether based on ANN or CNN, the recognition rate on the training set is more than 8% different from that on the test set, especially the difference of recognition rate between training set and test set in ANN is more than 12%. In the final analysis, this phenomenon is still affected by manual shooting and resource problems. In this paper, only 200 yellow roses are photographed to form a small sample image data set. Even through the data enhancement operation, it is still a small sample image data set. Therefore, the over fitting problem is easy to occur in the training process. In order to solve the over fitting problem of small sample data sets, a rose classification algorithm based on feature extraction(FE) and ANN is proposed in this paper.

4.1 Image background segmentation

Before feature extraction(FE) of 200 images, considering that there are too many background factors of the photographed images, such as walls, lights, etc., it is necessary to segment the background of the images and only retain the main content. The background of rose image is segmented using the method shown in Fig. 8.

The background of the image is segmented by removebg algorithm and grabcut algorithm. The rose image after background segmentation is shown in Fig. 9.

The 3 figures in Fig. 10 are the pixel histograms of the original image, the background removed image and the green leaf removed image respectively. The pixel value of the original image is evenly distributed between 0-255. When extracting features, it can not distinguish the background pixel value from the main pixel value of the image; From the pixel histogram of rose image with background removed and green leaves removed, it can be seen that most of the black pixel values are different from other pixel values. Therefore, in FE, the image after background segmentation can better express the main content of the image.

4.2 Image feature extraction

After image background segmentation and removing the irrelevant factors of rose image, this paper extracts the features from 3 aspects: color feature, geometric feature and texture feature.

4.2.1 Color feature extraction

In the process of image processing, RGB model is the most common and important color space model. Based on the RGB model of color image, this paper extracts the color features of yellow rose. Color image is composed of 3 color channel vectors: red, green and blue. To form any color, it must be imaged by superimposing and mixing 3 color components. In the spatial model, if the proportion of the 3 colors is different, the color will be different. The parameters of red channel, green channel and blue channel range from 0 to 255. The gray level of the 3 channels is 256, and the 3 components of R, G and B are highly correlated. The proportion of the 3 color components cannot be observed directly by human eyes,where (255, 0, 0) is red, (0, 255, 0) is green, (0, 0, 255) is blue, (255, 255, 255) is white, and (0, 0, 0) is black.

In the RGB color model, among the 3 primary color components of yellow, red and green account for a large proportion and contain more color information. When classifying the yellow roses, the difference between them will be more significant. Therefore, the mean and variance statistics of the red channel and the mean and variance statistics of the green channel of the rose image are selected as the color features. In order to accurately calculate the statistics, use Python's PIL library to separate the red channel image and green channel image from the color image, extract the image pixel value and generate the list format, cycle the list pixel value, assign the black (0, 0, 0) in the image as NAN, and finally calculate the mean value statistics and variance statistics.

The box line diagram in Fig. 15 shows the basic information of each statistic of 5 levels of yellow roses, and the red line shows the sample mean value of each level, red_mean means the mean value of the red channel, red_std means the variance of the red channel, green_mean means the mean value of the green channel, green_std means the variance of the green channel. Obviously, when classifying yellow roses according to color characteristics, there is no significant difference in the mean value between the first 4 levels. In the neural network model, the first level to the fourth level can not be recognized and classified accurately, there are obvious differences between the first 4 levels and the fifth level, so when the extracted color feature data is sent to the neural network, it will have a high recognition rate.

4.2.2 Geometric feature extraction

Observing the flower image and flower stem image of yellow rose, the area of the main part of the image(including flowers, flower stems and green leaves), the length of the flower stem and the detection of the edge can directly reflect the geometric characteristics of the image. Therefore, this paper examines the geometric characteristics of the image from 3 aspects: area statistics, length statistics and edge detection statistics.

In terms of area, for the convenience of calculation, assuming that all images are unit area, the proportion of flower and leaf area in the image and the proportion of flower stem and leaf area in the image are extracted for the image with leaves after background segmentation. The specific algorithm is to traverse all images by using PIL library, store the RGB pixel values of each image in the form of list, and calculate the list length, use the for loop statement to access channel R, channel G and channel B in turn, accumulate the number of elements whose pixel values of the 3 channels in the list are not all zero, and finally calculate the proportion of the accumulated value in the length of the list, so as to calculate the area statistics expected in this paper. In terms of length, when collecting image data, this paper has manually marked the flower stem length and stored it in Excel to generate the characteristic data of flower stem length. In the aspect of edge detection, the PIL library is also used to detect the edge of the image by using the parameter ImageFilter.FIND_EDGES.

The specific process of calculating the edge perimeter of the rose is roughly the same as the algorithm of the area, and the geometric feature data can be obtained after.

Figure 17 is the box line diagram of geometric characteristics of yellow rose data, where rose_area represents the proportion of flower and leaf area, rose_pole_area represents the proportion of flower stem and leaf area, rose_pole_length is the length of the flower stem, rose_edge represents the edge detection of flowers and leaves. When the yellow roses are classified according to the geometric features, in the images of flower area, flower stem length and edge detection, the red line from the first level to the fourth level shows a downward trend, which shows that there are significant differences between the roses from the first level to the fourth level. Therefore, when the extracted geometric feature data is sent to the neural network, there will be a large recognition rate, it can classify roses of different levels more accurately.

4.2.3 Texture feature extraction

In this paper, the texture features of yellow roses are extracted based on gray level co-occurrence matrix (GLCM). GLCM is generally used to measure the pixel value of gray-scale image. If the pixel value of the gray-scale image changes within a certain small threshold, that is, the variance of the gray-scale value is small, the texture change of the image is relatively slow; On the contrary, the texture of the image changes greatly. In this paper, the GLCM is calculated by using the function greycomatrix()in the library skimage.feature in Python, and the statistics of energy, contrast, dissimilarity and homogeneity are calculated by using the function greycomatrix() to extract the texture features of rose flowers.

(1)\({\text{Energy }}=\sqrt {\sum\limits_{i} {\sum\limits_{j} p } {{(i,j)}^2}}\) (2)\({\text{Contrast }}=\sum\limits_{i} {\sum\limits_{j} {{{(i - j)}^2}} } p(i,j)\)

(3)\({\text{ Dissimilarity }}=\sum\limits_{i} {\sum\limits_{j} | } i - j|p(i,j)\) (4)\({\text{ Homogeneity }}=\sum\limits_{i} {\sum\limits_{j} {\frac{1}{{1+{{(i - j)}^2}}}} } p(i,j)\)

Where, p(i,j) represents the elements in row i and column j of the GLCM. Energy is used to measure the stability of image texture changes. Contrast and dissimilarity, the former is to apply exponential growth to matrix elements, while the latter is to apply offline growth to matrix elements. Both of them are used to measure the change of image brightness. Homogeneity reduces exponentially on the matrix elements to measure whether the image texture changes evenly. According to the above rule changes, this paper first uses the parameter cv2.IMREAD_GRAYSCALE in library openCV to batch process the rose image and uniformly convert it into gray image; Then, input the gray image in the function greycomatrix(), the gray level is 256, select the distance as 1, scan to the right in the horizontal direction, and output the symmetrical GLCM; Finally, input the GLCM into the function greycoprops(), and select 4 feature scalars of energy, contrast, dissimilarity and homology respectively to output the texture feature data.

Figure 18 is the box line diagram of texture features of yellow roses. Obviously, when classifying yellow roses according to texture features, in the energy diagram and the homogeneity diagram, the red lines from the first level to the fourth level show an upward trend, and the red lines from the fourth level to the fifth level show a downward trend; In the contrast diagram, the red line shows a uniform downward trend as a whole; In the dissimilarity diagram, the red line from the first level to the fourth level shows a downward trend, and the red line from the fourth level to the fifth level shows an upward trend. Obviously, there are significant differences between the yellow roses from the first level to the fifth level. Therefore, when the extracted texture feature data is sent to the neural network, it will have a high recognition rate.

4.3 Evaluation of characteristic variables based on factor analysis

In Section 4.2, 12 image features are extracted, and each feature extracted is assumed to correspond to a variable. The specific variable names are shown in Table 4.1.

Table 4.1

Feature information extracted from yellow rose
Symbol	Variable	Symbol	Variable
red_mean	Mean value of red channel	rose_pole_length	Stem length
red_std	Variance of red channel	rose_edge	Edge detection of flowers and leaves
green_mean	Mean value of green channel	energy	Energy
green_std	Variance of green channel	contrast	Contrast
rose_area	Proportion of flower and leaf area	dissimilarity	Dissimilarity
rose_pole_area	Proportion of flower stem and leaf area	homogeneity	Homogeneity

After the feature extraction of yellow rose, the feature data containing 12 variables are obtained. In this paper, the parameters of 12 variables of 1600 samples in the training set are evaluated by factor analysis. Before factor analysis, KMO test and Bartlett spherical test should be conducted for 12 variables. The test results are shown in Table 4.2.

Table 4.2

KMO and Bartlett sphericity test
KMO		0.923
Bartlett sphericity test	\({\chi ^2}\)statistics	5601.237
Bartlett sphericity test	Significance	0.000

It can be seen from Table 4.2 that the KMO value is 0.923 and the p value of Bartlett sphericity test is 0.000, which indicates that there is a strong correlation between the 12 variables and is suitable for factor analysis.

Based on the factor analysis of 12 variable data of yellow rose by SPSS software, the total variance interpretation table of factor analysis is obtained, as shown in Table 4.3. Generally, when the cumulative variance contribution rate of factor analysis reaches 75.00%, it is considered that the corresponding factor has a good ability to explain the original information. The cumulative variance contribution rate of the first 3 factors reaches 78.602%, that is, the first 3 factors explain 78.602% of the original information, which shows that the first 3 factors can better explain the information of 12 variables.

Table 4.3

Interpretation of total variance
Factor	Initial eigenvalue			Extract the sum of squares of loads
Factor	Total	Percentage variance(%)	Accumulation ( %)	Total	Percentage variance(%)	Accumulation ( %)
1	10.588	43.014	43.014	10.588	43.014	43.014
2	5.674	23.051	66.065	5.674	23.051	66.065
3	3.086	12.537	78.602	3.086	12.537	78.602
4	1.063	4.319	82.921	—	—	—
5	0.937	3.807	86.728	—	—	—
6	0.728	2.958	89.685	—	—	—
7	0.614	2.494	92.180	—	—	—
8	0.56	2.275	94.455	—	—	—
9	0.363	1.475	95.929	—	—	—
10	0.355	1.442	97.372	—	—	—
11	0.335	1.361	98.732	—	—	—
12	0.312	1.268	100.000	—	—	—
Note: — means no data.

The coefficient matrix and score coefficient matrix of the first 3 factors and 12 variables are shown in Table 4.4.

Table 4.4

Coefficient matrix and score coefficient matrix of the first 3 factors
Variable	Factor1		Factor2		Factor3
Variable	coefficient	score coefficient	coefficient	score coefficient	coefficient	score coefficient
red_mean	0.536	0.078	0.317	0.034	0.593	0.079
red_std	0.307	0.045	0.227	0.026	0.675	0.086
green_mean	0.580	0.085	0.402	0.039	0.710	0.092
green_std	0.428	0.063	0.201	0.024	-0.607	-0.078
rose_area	0.920	0.135	0.491	0.069	0.117	0.02
rose_pole_area	0.813	0.119	0.249	0.030	0.104	0.018
rose_pole_length	0.761	0.111	-0.227	-0.027	0.215	0.025
rose_edge	0.92	0.135	0.394	0.041	0.100	0.015
energy	-0.919	-0.134	-0.095	-0.010	0.294	0.035
contrast	0.746	0.109	0.139	0.022	0.381	0.037
dissimilarity	0.875	0.128	0.128	0.021	-0.229	-0.028
homogeneity	-0.913	-0.134	-0.073	-0.008	0.383	0.039

If the 3 selected factors are recorded as , the relationship between the 3 factors and the 12 variables is as follows:

It can be seen from Table 4.4 that the information of factor1 mainly comes from the variables energy, contrast, dissimilarity and homogeneity, which will be called image texture feature factor; The information of factor2 mainly comes from the variable rose_ area, rose_ pole_ area, rose_ pole_ length, rose_ edge, it will be called image geometric feature factor; The information of factor3 mainly comes from the variable red_ mean, red_ std, green_ mean, green_ std, it will be called image color feature factor. Obviously, after using factor analysis to evaluate 12 variables, the absolute values of texture features and geometric features in component coefficient and score coefficient are greater than those of color features, that is, the contribution rate of texture features and geometric features is greater than that of color features. This result is basically consistent with the analysis in Section 4.2.

4.4 Rose classification algorithm based on FE and ANN

In order to solve the over fitting problem in Chap. 3 and improve the recognition rate, this paper sends the 3 factor data of 1200 pictures in the training set obtained in Section 4.3 into the ANN for recognition and classification. The flow chart of rose classification algorithm based on FE and ANN is designed by using tensorflow2.0, as shown in Fig. 19.

The ANN based on tensorflow2.0 is a 3-layer network. The number of neurons in the first layer is 3, the number of neurons in the second layer is 10 and the number of neurons in the third layer is 5; In the ANN, the activation function relu is used and the classifier softmax is used for output; In the function compile() configuration training method, select optimizer adam and loss function sparse_categorical_ crossentropy, select evaluation index sparse_categorical_accuracy; In the built ANN model, 20 characteristic data are sent in each time, and they are iterated for 20 times, 40 times, 60 times, 80 times, 100 times and 120 times respectively. The parameter statistical table is shown in Table 4.5.

Table 4.5

Parameter statistics of rose classification algorithm based on FE and ANN
Layer	Output shape	Params
the first layer	multiple	520
the second layer	multiple	820
the third layer	multiple	105
Total params:1445

The recognition rate on the training set and test set is shown in Fig. 20. It can be seen from Fig. 20 that when the number of iterations is 20, 40, 60, 80, 100 and 120 respectively, the recognition rate of rose classification algorithm based on FE and ANN increases with the increase of the number of iterations, and the recognition rate on both training set and test set exceeds 90%. The recognition rate of test set is very close to that of training set.

In order to further improve the classification recognition rate, it is found through experiments, choosing the appropriate number of iterations is particularly important for rose classification algorithm based on FE and ANN. Finally, when the number of iterations is 160, the highest classification recognition rate on the training set and test set is obtained (as shown in Fig. 21). The first figure in Fig. 21 shows the loss rate on the training set and test set under the rose classification algorithm based on FE and ANN, and the second figure shows the recognition rate on the training set and test set. Obviously, both the loss rate and the recognition rate, the results of the training set and the test set are very close. The recognition rate in the training set is mostly more than 95%, and the highest is 97.81%, and the recognition rate in the test set is mostly more than 94%, and the highest is 96.61%, which is significantly improved compared with the classification algorithm based on ANN and CNN.

4.5 Comparison of 3 algorithms

In this paper, the rose classification algorithm based on ANN is named algorithm 1, the rose classification algorithm based on CNN is named algorithm 2, and the rose classification algorithm based on FE and ANN is named algorithm 3. In this paper, 100 training recognition is carried out under 3 algorithms, and the recognition rate tends to be stable is selected, and the average recognition rate of each training and the final average recognition rate of 100 training are obtained by mean processing.

Under algorithm 1, algorithm 2 and algorithm 3, the rose image is trained and recognized 100 times respectively, and the average recognition rates of the 3 algorithms are obtained each time, as shown in Table 4.6 (only the recognition rates of the first 15 trainings are shown).

Table 4.6

Recognition rate of 3 algorithms
Training times	Average recognition rate of training set			Average recognition rate of test set
Training times	Algorithm 1	Algorithm 2	Algorithm 3	Algorithm 1	Algorithm 2	Algorithm 3
1	0.8553	0.9471	0.9883	0.6556	0.8594	0.9479
2	0.8616	0.9402	0.9719	0.6973	0.8473	0.9470
3	0.8607	0.9158	0.9730	0.7068	0.8183	0.9487
4	0.8543	0.9473	0.9673	0.6678	0.8592	0.9475
5	0.8636	0.9198	0.9792	0.7057	0.8092	0.9614
6	0.8580	0.9507	0.9686	0.6490	0.8815	0.9492
7	0.8585	0.9478	0.9843	0.6698	0.8379	0.9500
8	0.8468	0.9168	0.9819	0.7010	0.8110	0.9500
9	0.8571	0.9137	0.9712	0.6765	0.7830	0.9449
10	0.7789	0.9098	0.9626	0.6185	0.7837	0.9382
11	0.8561	0.9406	0.9799	0.6826	0.8110	0.9496
12	0.8609	0.9573	0.9807	0.6962	0.8620	0.9648
13	0.8521	0.9338	0.9692	0.7076	0.8943	0.9504
14	0.8596	0.9363	0.9807	0.7035	0.8118	0.9394
15	0.8564	0.9321	0.9818	0.6682	0.8402	0.9424
…	…	…	…	…	…	…

Under algorithm 1, algorithm 2 and algorithm 3, draw the average recognition rate curve on each training set and test set in 100 times, as shown in Fig. 22. As can be seen from Fig. 22, the average recognition rate of each training set of algorithm1 is mostly 80%-90%, and the average recognition rate of the test set is 60%-80%; The average recognition rate of each training set of algorithm 2 is mostly 90%-96%, and the average recognition rate of test set is 75%-90%; The average recognition rate of each training set of algorithm 3 is mostly 95%-99%, and the average recognition rate of test set is 93%-97%. It can be seen that the recognition effect of algorithm 3 is significantly better than algorithm 1 and algorithm 2.

In order to test the recognition accuracy and stability of 3 algorithms, the average recognition rate and standard deviation of 100 training recognition are calculated under algorithm 1, algorithm 2 and algorithm 3 as shown in Table 4.7.

Table 4.7

Average recognition rate and standard deviation of 3 algorithms
Training times	Training set		Test set
Training times	Average recognition rate	Standard deviation	Average recognition rate	Standard deviation
Algorithm 1	85.46%	0.0163	67.72%	0.0316
Algorithm 2	93.14%	0.0119	83.21%	0.0234
Algorithm 3	97.51%	0.0074	94.45%	0.0078

It is obvious from Table 4.7 that among the 3 algorithms, whether in the training set or the test set, the standard deviation of recognition rate of algorithm 1 is the largest and the standard deviation of recognition rate of algorithm 3 is the smallest, which shows that from the perspective of recognition effect stability, the recognition stability of algorithm 1 is the worst and that of algorithm 3 is the best. The average recognition rate of the test set under algorithm 1 is only 67.72%, which is the lowest. The average recognition rate of test set under algorithm 3 is 94.45%, which is highest. This shows that the rose classification algorithm based on FE and ANN proposed in this paper not only speeds up the operation speed of the algorithm, but also significantly improves the accuracy of rose classification.

With the rapid development of deep learning, image recognition technology based on neural network has also made some achievements, but there is still less research on the classification and recognition of roses of the same variety. This is because there are only slight differences between roses of the same variety, which leads to the low recognition rate of classification. However, for the classification of roses, the market prices of different levels are given, it plays an important step in the market. In view of this, this paper obtains 200 yellow rose images of 5 levels of the same variety through manual shooting, and classifies and recognizes 1600 data enhanced images with rose classification algorithm based on ANN, rose classification algorithm based on CNN, rose classification algorithm based on FE and ANN.

After image collection and preprocessing, the images are respectively sent to the ANN and CNN. After repeated experiments, it is found that the image classification and recognition rate based on ANN is more than 20% difference between the training set and the test set, and the image classification and recognition rate based on CNN is more than 10% difference between the test set and the training set, resulting in over fitting phenomenon, the average recognition rate based on ANN is only 67.72%, and the average recognition rate based on CNN is only 83.21%. Based on this, in the case of a small number of images, this paper first segment the background of the image, then extract the color features, geometric features and texture features of the image, and conduct factor analysis on the extracted feature data, synthesize the 12 feature variables into 3 factors, and then send the data of the 3 factors into the ANN for classification and recognition. Finally, the average recognition rate of classification of roses is increased to 94.45%, the recognition rate is significantly improved. However, with the continuous development of deep learning, the rose classification and recognition based on neural network is also improving. There are still many problems to be solved in this paper, for example:

(1)Parameter setting of network structure. The research of image classification and recognition algorithm based on neural network focuses on the parameter setting of network structure, such as the number of layers of network, the number of neurons in each layer and so on. The optimized parameter setting can not only improve the classification accuracy of the algorithm, but also reduce the complexity of the algorithm and improve the recognition speed of the algorithm.

(2)Image feature data. In the aspect of FE, this paper extracts mean statistics and variance statistics under RGB color model, in fact, in addition to RGB model, there is HIS color model; In terms of geometric features, major axis features and minor axis features can also be extracted. Extracting other feature data is bound to continue to improve the classification and recognition rate of the algorithm.

(3)Due to resource constraints, we only photograph a small number of yellow roses, resulting in a small sample size. In a large number of rose images, increasing the influencing factors of rose classification, such as considering the disease of flowers, growth area and other factors, will certainly enhance the rigor of the experiment and improve the classification and recognition rate of the algorithm.

ANN

artificial neural network

CNN

convolution neural network

feature extraction

Availability of data and materials

Not applicable.

Acknowledgements

The authors would like to thank the anonymous reviewers and editors for helping to improve the work.

Funding

No funding is available.

Contributions

From the acquisition of rose image data, to image processing, data analysis, software simulation, manuscript writing, and subsequent manuscript revisions, the authors have contributed to the current work.

Corresponding author

Correspondence to Jiaojiao Hui([email protected]).

Competing interests

The author declares that this paper was independently completed, did not participate in the team collaboration, the paper has not been published by other members, all the data and materials involved in the paper are true and credible, with a wealth of practicality, and the paper does not have any conflicts of interest and funding conflicts.

D. Unay, B. Gossehn, O. Kleynen, V. Leemans, M.F. Destain, O. Debeir, Automatic grading of Bi-colored apples by multi-spectral machine vision. Computers & Electronics in Agriculture 75(1), 204–212 (2011)
N. Sarkar, R.R. Wolfe, Feature Extraction Techniques for Sorting Tomatoes by Computer Vision. Am. Soc. Agricultural Biol. Eng. 28(3), 970–974 (1985)
N. Kondo, Robotization in fruit grading system. Sens. Instrumentationfor Food Qual. Saf. 3(1), 81–87 (2009)
M.R. Paulsen, W.D. Wigger, J.B. Litchfield, J.B. Sinclair, Computer image analysesfor detection of maize and soybean kernel quality factors. J. Agricultural Eng. Res. 43, 93–101 (1989)
V. Leemans, H. Magein, M. Destain, Defects Segmentation on ‘Golden Delicious’Apples by Using Colour Machine Vision. Comput. Electron. Agric. 20(2), 117–130 (1998)
M. Sofu, O. Er, M.C. Kayacan et al., Design of an Automatic Apple Sorting SystemUsing Machine Vision. Computers & Electronics in Agriculture 127(127), 395–405 (2016)
D.E. Rumelhart, G.E. Hinton, R.J. Welliams, Learning internal representations by errorpropagation (California Univ San Diego La Jolla Inst for Cognitive Science, 1985)
C.Y. Le, B. Boser, J.S. Denker, el al, Backpropagation applied to handwritten zipcode recognition. Neural computation, 1(4), 541–551(1989)
G.E. Hinton, R. Salakhutdinov, Reducing the Dimensionality of Data with NeuralNetworks. cience 313(5786), 504–507 (2006)
A. Krizhevsky, I. Sutskever, G.E. Hinton, Image Net classification with deepconvolutional neural networks. International Conference on Neural Information Processing Systems. Curran Associates Inc, 1097–1105(2012)
M.D. Zeiler, R. Fergus, Visualizing and Understanding Convolutional Networks. European Conference on Computer Vision, 818–833(2014)
K. Simonyan, A. Zisserman, Very Deep Convolutional Networks for Large-Scale ImageRecognition. Computer Science, 2014
C. Szegedy, W. Liu, Y. Jia et al., Going deeper with convolutions. 1–9(2014)
K. He, X. Zhang, S. Ren, J. Sun, Deep residual learning for image recognition. Proceedings of the IEEE conference on computer vision and pattern recognition, 770–778(2016)

Download PDF

Version 1

posted

You are reading this latest preprint version

Research on Rose Classification Based on Neural Network Model

Status:

Version 1

Abstract

Figures

1. Introduction

2. Rose Image Collection And Preprocessing

2.1 Data collection of yellow rose image

2.2 Data enhancement of rose image

2.3 Naming of rose image

3. Rose Classification Algorithm Based On Neural Network

3.1 Rose classification algorithm based on artificial neural network(ANN)

3.2 Rose classification algorithm based on convolutional neural network(CNN)

4. Rose Classification Algorithm Based On Feature Extraction(Fe) And Ann

4.1 Image background segmentation

4.2 Image feature extraction

4.2.1 Color feature extraction

4.2.2 Geometric feature extraction

4.2.3 Texture feature extraction

4.3 Evaluation of characteristic variables based on factor analysis

4.4 Rose classification algorithm based on FE and ANN

4.5 Comparison of 3 algorithms

5. Conclusion And Future Work

Abbreviations

Declarations

References

Status:

Version 1