Visualization and Observations
In Fig. 7, mosaics of surface color video frames are presented, where OCT B-scan lines are overlaid for a typical normal fillet and two WB fillets. Each video frame contains a red horizontal arrow denoting the scan line position. Furthermore, below the video mosaic, Fig. 7 includes mosaics of corresponding B-scan images obtained from the same scan line. In Fig. 7, the visual analysis reveals that the color distinction between the normal and WB fillet images is relatively small. However, specific lesion areas can be observed, particularly reddish spots that exclusively correspond to petechial hemorrhagic lesions found in the severe WB sample in Fig. 7 (c). These findings are consistent with previous research, which has documented lesion areas particularly hemorrhagic lesions in WB samples (Kuttappan et al., 2017; Soglia et al., 2017). On the other hand, as shown in Fig. 7 (b), not all WB samples have those lesion areas. Furthermore, both the normal and WB samples can exhibit fatty areas and white stripes visible on their surfaces. This complex visual presentation underscores the challenges associated with relying solely on surface color images to detect WB. The overlapping features and similarities between normal and WB samples pose difficulties in visually identifying WB based on surface appearance alone.
However, examining the OCT B-scans and segmented depth images provides valuable insights. It is evident that the thickness of the epimysium layer, which is the top layer on the surface of the sample, is significantly larger in the WB sample images compared to the normal sample. These findings are consistent with the previous study conducted by Yoon et al. (2016), which reported that the epimysium thickness of wooden breast fillets in OCT images was approximately twice as large as that of normal fillets. This is also in agreement with the finding of previous studies suggesting that the WB condition results in the accumulation of collagen and other connective tissues in the breast muscle, leading to the thickening of the connective tissue (Soglia et al., 2021). Therefore, the observation in this study emphasizes the potential of OCT imaging and segmented depth analysis for distinguishing WB fillets based on the increased thickness of the epimysium layer. These structural differences offer a promising method for WB detection beyond surface color images.
In Fig. 8, four examples of video frames, OCT scan lines, B scans, and A scans are shown for facilitating a comparison between a normal fillet and three severe WB fillets. Notably, Fig. 8 highlights variations observed in the severe WB fillets, such as the presence or absence of white stripes and lesion areas. However, it is crucial to emphasize that regardless of these variations, the thickness of the epimysium layer in WB samples consistently exceeds that of the normal sample. This distinction becomes particularly noticeable in the WB samples exhibiting lesion areas and prominent white stripes.
Woody Breast Condition Classification
Table 1 provides an overview of the classification performance on both the training and testing datasets achieved by various classifiers on different classification tasks related to the WB condition. For the classification of normal and severe WB fillets, all classifiers demonstrated relatively high accuracy. XGB classifier achieved perfect accuracy of 100% on the training and test dataset, while DT and RF classifiers achieved high test accuracies of 96.0% and 92.0%, respectively. These results indicate that the classifiers were able to effectively differentiate between normal and severe WB fillets. For the classification of normal and WB (moderate and severe combined), the classifiers also achieved high accuracies. XGB, DL, and NB performed well with an accuracy of 93.3% on the testing dataset, followed closely by KNN with an accuracy of 90.0%. These results suggest that the classifiers were reasonably successful in identifying any kinds of WB samples from the normal ones. It is important to emphasize that the task of sorting WB samples from normal ones holds significant practical relevance in real-world industry applications. By achieving high accuracies in this task, the classifiers showcased their potential for reliable identification and differentiation of WB samples, providing valuable insights for quality assessment and control in the industry. Moreover, the highest classification accuracy achieved for distinguishing between normal and moderate WB fillets was 95.0% using the DT model. In contrast, the maximum accuracy rate for differentiating moderate WB from severe WB fillets was 85.0%. These results highlight the difficulty of accurately separating moderate WB samples from severe WB samples, making it the most challenging task among the classification tasks analyzed. However, the results of this study showed a significant improvement in classifying moderate and severe WB fillets compared to the results of a similar work by Wold et al. (2019) who obtained 76.9% training and 68.7% test accuracy for the same task using NIR spectroscopy. Lastly, for the three-class classification task involving normal, moderate, and severe WB fillets, the classifiers achieved varying levels of accuracy. Three-class modeling resulted in noticeably lower accuracies than that in all the two-class models. Its highest rate was achieved using RF with an accuracy of 80.0%, followed by NB with an accuracy of 73.3%. This can be as a result of adding the moderate class which is harder to be separated from the severe class. Overall, the high accuracy rates achieved on various tasks indicate the feasibility of using OCT imaging combined with machine learning for automated detection and classification of the WB condition in poultry meat production. This improvement in the discrimination of normal and WB fillets in this study compared to previous studies can be explained by two main considerations. First, the large-scale scanning across the fillet surface provided more representative information from the spatially unevenly distributed WB areas. And secondly the micron level measurement of the properties of the epimysium layer, which comprises collagen molecules and contributes to the hardness of the WB fillets, performed as a more relevant feature for the WB detection. However, further refinement and optimization may be necessary, particularly in distinguishing between moderate and severe WB samples, by obtaining information from a larger area of individual fillets and fusing other highly relevant features.
Table 1
Performance of different classifiers for the classification of normal vs. woody breast chicken fillets using OCT imaging
Classifier | Normal breast vs. Severe WB | Normal breast vs. WB | Normal breast vs. Moderate WB | Moderate WB vs. Severe WB | 3 Classes |
Train | Test | Train | Test | Train | Test | Train | Test | Train | Test |
XGB | 100 | 100 | 91.3 | 93.3 | 89.2 | 85.0 | 85.0 | 85.0 | 95.6 | 60.0 |
SVM | 92.9 | 84.0 | 92.1 | 86.6 | 98.4 | 90.0 | 80.0 | 60.0 | 76.1 | 70.0 |
DT | 100 | 96.0 | 100 | 93.3 | 100 | 95.0 | 100 | 65.0 | 100 | 66.6 |
RF | 100 | 92.0 | 100 | 86.6 | 100 | 80.0 | 100 | 55.0 | 100 | 80.0 |
KNN | 95.0 | 76.0 | 95.6 | 90.0 | 93.8 | 80.0 | 85.0 | 55.0 | 83.4 | 66.6 |
NB | 84.7 | 68.0 | 86.0 | 93.3 | 92.3 | 85.0 | 85.0 | 85.0 | 81.7 | 73.3 |
LDA | 83.0 | 80.0 | 86.0 | 80.0 | 63.0 | 50.0 | 80.0 | 75.0 | 61.7 | 46.6 |
XGB extreme gradient boosting, SVM support vector machine, DT decision trees, RF random forest, kNN k-nearest neighbors, NB Naïve Bayes, LDA linear discriminant analysis
Tables 2 ‒ 4 display the recall, precision, F1-score, and Kappa coefficient values obtained by various classifiers for different classification tasks related to the WB condition. In Table 2, the XGB classifier achieved the perfect scores of 1.0 for recall, precision, F1-score, and Kappa coefficient, indicating excellent performance in correctly identifying both normal and severe WB samples. The DT and RF models also performed well with score values near 1.0, indicating strong performance in correctly classifying normal and severe WB fillets. Furthermore, the XGB classifier demonstrated superior performance in classifying normal and WB samples, achieving 0.92, 0.94, 0.92, and 0.84 for recall, precision, F1-score, and Kappa coefficient scores, respectively. These metrics highlight the effectiveness of the method in accurately identifying and distinguishing between normal and WB samples. The high recall value indicates its ability to correctly classify a significant proportion of actual positive instances. The precision score reflects its capability to minimize false positives which can lead to less waste of normal samples labeled as defective. F1-score represents the harmonic mean of precision and recall, providing a balanced measure of its overall performance. The Kappa coefficient, which assesses agreement beyond chance, indicates a substantial level of agreement in the classification results and is considered to be good if it is higher than 0.8.
Table 2
Classification performance for normal vs. woody breast chicken fillets using OCT imaging
Classifier | Normal breast vs. Severe WB | | Normal breast vs. WB | |
Recall | Precision | F1-score | Kappa | | Recall | Precision | F1-score | Kappa | |
XGB | 1.0 | 1.0 | 1.0 | 1.0 | | 0.92 | 0.94 | 0.92 | 0.84 |
SVM | 0.87 | 0.86 | 0.82 | 0.84 | | 0.77 | 0.81 | 0.81 | 0.73 |
DT | 0.96 | 0.96 | 0.96 | 0.87 | | 0.92 | 0.94 | 0.92 | 0.75 |
RF | 0.95 | 0.9 | 0.9 | 0.85 | | 0.85 | 0.78 | 0.80 | 0.72 |
KNN | 0.78 | 0.82 | 0.74 | 0.82 | | 0.89 | 0.92 | 0.88 | 0.73 |
NB | 0.66 | 0.62 | 0.63 | 0.71 | | 0.92 | 0.94 | 0.92 | 0.75 |
LDA | 0.81 | 0.72 | 0.73 | 0.69 | | 0.78 | 0.79 | 0.74 | 0.73 |
XGB extreme gradient boosting, SVM support vector machine, DT decision trees, RF random forest, kNN k-nearest neighbors, NB Naïve Bayes, LDA linear discriminant analysis
Table 3 shows the classification performance of the models in distinguishing moderate WB samples from normal and severe WB ones. For example, in the task of classifying moderate WB vs. normal samples, the DT classifier demonstrated a recall of 0.96, precision of 0.95, F1-score of 0.94, and Kappa coefficient of 0.76. These scores were higher than the ones for classifying moderate vs. severe WB samples with a maximum of 0.90, 0.88, 0.84, and 0.74 for recall, precision, F1-score, and Kappa coefficient, respectively. This indicates that distinguishing between moderate WB fillets and severe WB fillets is more challenging compared to classifying moderate WB fillets from normal ones.
Table 3
Classification performance for normal and severe woody breast vs. moderate woody breast chicken fillets using OCT imaging
Classifier | Normal breast vs. Moderate WB | | Moderate WB vs. severe WB | |
Recall | Precision | F1-score | Kappa | | Recall | Precision | F1-score | Kappa |
XGB | 0.81 | 0.79 | 0.77 | 0.6 | | 0.9 | 0.88 | 0.84 | 0.74 |
SVM | 0.91 | 0.91 | 0.89 | 0.7 | | 0.71 | 0.56 | 0.57 | 0.57 |
DT | 0.96 | 0.95 | 0.94 | 0.76 | | 0.63 | 0.68 | 0.63 | 0.48 |
RF | 0.81 | 0.85 | 0.78 | 0.72 | | 0.57 | 0.54 | 0.45 | 0.39 |
KNN | 0.81 | 0.76 | 0.84 | 0.7 | | 0.53 | 0.54 | 0.48 | 0.33 |
NB | 0.88 | 0.86 | 0.84 | 0.7 | | 0.72 | 0.74 | 0.71 | 0.36 |
LDA | 0.46 | 0.26 | 0.32 | 0.59 | | 0.76 | 0.78 | 0.74 | 0.38 |
XGB extreme gradient boosting, SVM support vector machine, DT decision trees, RF random forest, kNN k-nearest neighbors, NB Naïve Bayes, LDA linear discriminant analysis
Table 4 presents the performance of different classifiers in the three-class classification of normal, moderate, and severe WB fillets. The RF classifier achieved relatively higher scores across the metrics, indicating better overall performance in accurately identifying the three classes. However, there is still room for improvement in achieving higher accuracy in the 3-class task.
Table 4
Classification performance for normal and severe woody breast vs. moderate woody breast chicken fillets using OCT imaging
Classifier | 3-Class Classification of Normal breast, Moderate WB, and Severe WB |
Recall | Precision | F1-score | Kappa |
XGB | 0.62 | 0.62 | 0.56 | 0.38 |
SVM | 0.70 | 0.69 | 0.64 | 0.43 |
DT | 0.47 | 0.53 | 0.47 | 0.43 |
RF | 0.72 | 0.65 | 0.67 | 0.47 |
KNN | 0.66 | 0.55 | 0.56 | 0.47 |
NB | 0.68 | 0.66 | 0.64 | 0.49 |
LDA | 0.42 | 0.31 | 0.33 | 0.45 |
XGB extreme gradient boosting, SVM support vector machine, DT decision trees, RF random forest, kNN k-nearest neighbors, NB Naïve Bayes, LDA linear discriminant analysis
The findings of this study, which employed OCT imaging coupled with traditional image processing and machine learning techniques, yielded promising results in the accurate detection and classification of WB fillets. This indicates that the applied algorithms were effective in extracting relevant features and making accurate classifications. In addition, the methods used had low computational complexity without the need for large amounts of labeled data for training. However, it is important to note that these techniques relied on manual thresholding and feature extraction which may limit their adaptability to complex and diverse datasets. On the other hand, deep learning methods can achieve superior performance with automatically learning discriminative features and patterns directly from the data. Thus, in the future direction, a fully automated high-speed OCT scanning system will be developed to enable the acquisition of a large volume of data from a larger number of samples which could be utilized to enhance the performance of deep learning models. Additionally, such an automated high-speed scanning system would allow for OCT image data to be acquired from a larger portion of the fillet, rather than scanning only one central line along each fillet. This would enable a more accurate assessment of the non-uniform distribution of WB areas within each sample. This improved dataset would facilitate more accurate and reliable classification, even in the challenging three-class task involving normal, moderate, and severe WB samples. This can contribute to better understanding and management of this condition.