Shelter identification for shelter-transporting AGV based on improved target detection model yolov5

doi:10.21203/rs.3.rs-1711724/v2

Download PDF

Article

Shelter identification for shelter-transporting AGV based on improved target detection model yolov5

https://doi.org/10.21203/rs.3.rs-1711724/v2

This work is licensed under a CC BY 4.0 License

Version 2

posted

You are reading this latest preprint version

Shelter identification is the fundamental issue to make shelter-transporting automated guided vehicle (AGV) effectively detect and transport shelter. Actively identifying shelter has an important problem of high accuracy but slow speed for a complex model, and fast speed but low accuracy for a simple model. However, all kinds of target detection algorithms available have low detection accuracy and speed. In this paper, the model yolov5n6* is developed based on the modified yolov5 model by selecting different model structures, introducing an attention mechanism, and improving loss function and non-maximum suppression (NMS). Then, the experiments for shelter recognition were carried out using the model yolov5n6*. The experimental results show that the box_loss is reduced by 1.2%, the mAP_0.5:0.95 is improved by 2%, and the detection accuracy is improved by 0.87% for the improved model yolov5n6* compared with the yolov5n6. However, the yolov5n6* size is only 7.2M, and the detection time is increased by 0.2ms. So it is proved that the modified model yolov5n6* not only has a significant improvement in the shelter detection ability but also has strong robustness, which meets both the requirements of the recognition accuracy and the detection speed.

Yolov5

Automated guided vehicle

Target detection

Attention mechanism

Field shelter hospitals are of great significance to accomplish medical treatment for the patients in medical support, emergency prevention and control of infectious diseases, and emergency medical rescue of sudden disasters. The field shelter hospital consists of multiple shelter modules, and its deployment involves the cooperation of multiple modules, so the shelter transfer has become an important step during its deployment process. The shelter should be transported to the assignment position to satisfy docking requirements among shelters. Shelter-transporting efficiency seriously affects the deployment of shelter hospitals [1–6]. Shelter-transporting automated guided vehicle (AGV) as an intelligent transportation device, has higher intelligence and can complete the shelter-transporting at a faster speed and higher precision. However, the shelter-transporting AGV faces extremely serious problems that shelter-transporting AGV identify shelters have low detection accuracy and slow speed. Shelter-transporting AGV combining computer vision can effectively solve the above problems during the deployment of field shelter hospitals. The target detection algorithm can accomplish the shelter accurate recognition, then improves shelter-transporting AGV efficiency and reduces the deployment time of field shelter hospitals.

Recently, many scholars have proposed different target detection models [6–8]. For example, the two-stage algorithm model is represented by region-network backbone network series [9], and the two-stage target detection network use region extraction operation. Firstly, the convolution neural network backbone was used to extract image features, and then the possible candidate regions were found from the feature map. Finally, the sliding window operation was carried out on the candidate regions to further judge the target category and location information. In order to further improve the real-time performance of target detection, some scholars put forward a simplified algorithm model that transforms target detection into a regression problem. With improving detection accuracy and detection speed at the same time, You Only Look Once (YOLO) and Single Shot Detector (SSD) of single-stage target detection models based on position regression were proposed respectively [10–14]. A new object detection framework is proposed, which deals with the multi-scale problem of target by adding a multi-angle anchor frame, and a dual-channel feature fusion network was designed to learn local and context attributes along two independent paths [15]. Li et al. [16] presented a method based on the combination of yolov3 and kernel correlation filter to track the target vehicle. The yolov3 framework was improved in order to solve the problem of super-large scale object recognition in the same scene [17]. In the training process, the strategy of dynamic cross-combination was proposed for targets of different scales. The experimental results show that the modified model improves the recognition ability of super-large objects. A mobile detection model based on yolov5 was constructed in order to improve the framework. The yolov5 was used as the main framework and the backbone was replaced by Mobilenetv2, which compared with the baseline model the number of parameters was reduced by half and the detection speed was increased by 47% [18]. Actually, the yolov5 target detection algorithm combining the global satellite navigation, the visual navigation, and the laser navigation was applied to shelter-transporting AGV navigation to conduct achieve shelter rapid transportation and positioning and also show the good field environment adaptability [19]. However, the identification accuracy and efficiency of the shelter still need further improvement.

In summary, this paper intends to further improve the recognition accuracy and efficiency of the shelter identification method for the shelter-transporting AGV based on the modified yolov5 model by selecting different model structures, introducing an attention mechanism and improving loss function and non-maximum suppression (NMS). Then the experiments for shelter recognition were carried out using the modified yolov5 model and the experimental results were discussed. The work aims at achieving the engineering application for the proposed shelter identification method by improving the accuracy and speed of shelter-transporting AGV's identification shelter.

In June 2020, the Ultralytics team proposed the yolov5 model based on yolov3, which was very fast in recognition, high performance and easy to use [20–24].

The yolov5 algorithm has five network structures, namely yolov5n, yolov5s, yolov5m, yolov5l and yolov5x. The difference between the five network structures is that the width and depth of the cross stage partial (CSP) structure of the Backbone and Neck are different. The number of convolution and the number of residual blocks are different [25]. Yolov5 simple network structure is shown in Fig. 1 [26–28], its network model is divided into four parts, which are Input, Backbone, Neck and Prediction.

(1) Input: The Input includes three parts: Mosaic data enhancement, image size processing and adaptive anchor frame calculation.

(2) Backbone: Backbone is the key part of the network, mainly composed of Focus and CSP. In the Focus structure, the key point is the slicing operation [29]. The CSP structure divides the input into two parts. One part performs a certain operation first, and then carries out convolution operation. And the other part goes straight to the convolution.

(3) Neck: Neck adopts Feature Pyramid Network (FPN) and Pyramid Attention Network (PAN) structure. The FPN structure processes image features from top to bottom. The PAN adopts the bottom-up feature pyramid idea.

(4) Prediction: Prediction includes bounding box loss function and non-maximum suppression [30].

Through the above discussion, it was can be found that the model with the more complex the structure and the deeper the depth usually has the better the detection effect in the yolov5 model, but has the lower training efficiency and the higher the weights. Complex detection models require more computing resources, which seriously reduces the real-time performance of target detection and is harmful to the deployment of engineering applications. In order to design a model with better performance on the detection speed and accuracy for the shelter recognition in the field environment, the optimal model should be chosen. The 10 pre-trained models of yolov5 v6 was trained using the dataset, and then the trained model was further tested in the test dataset, and finally the optimal model was selected according to train and testing results.

From the above analysis, the selected model from the 10 pre-trained models of yolov5 v6 has good detection performance and inference speed compared with other nine models, but has shortages in shelter detection, as the followings:

(1) In the process of shelter-transporting AGV movement, most of the video information is useless. A large number of video information input greatly occupies computing resources and reduces the real-time performance and stability of target detection.

(2) When the prediction box is inside the target box and the size of the prediction box is consistent, the boundary box regression loss function generalized-IOU (GIOU) will degenerate into a simple intersection-over-union (IOU) loss function, which cannot achieve accurate positioning of the prediction box and better optimization of the model. Moreover, the application of GIOU loss function prediction box in horizontal or vertical direction is difficult to optimize, and the convergence is slow, which reduces the training efficiency.

(3) In the process of post-processing, screening of more target frames usually need to be processed by the NMS. The NMS algorithm is sensitive to the setting of the overlap threshold, which set too low will lead to leak and too high will lead to error checking. For the recognition of partially overlapping targets, only unobstructed targets can be detected, and partially obstructed targets have no output detection results.

3.1 Attention mechanism application

Presently, most researchers introduced attention mechanism to solve the problem of small target detection. However, the attention mechanism can well solve the real-time and stability problems of shelter-transporting AGV shelter detection, so the shelter-transporting AGV will introduce an attention mechanism to accomplish the shelter detection in the process of shelter transportation.

Attention mechanisms can be classified as channel attention, spatial attention, convolutional block attention module (CBAM), squeeze-and-excitation networks (SENet). Among the above four attention mechanisms, CBAM was chosen to applied to shelter detection because of its simplicity and effectivity. It is a lightweight module and can be trained in an end-to-end manner [31–35]. CBAM combines the attention mechanism of feature channel and feature space. Given a feature map, CBAM will successively infer the attention map along the two independent dimensions of channel and space, and then multiply the attention map with the input feature map to perform adaptive feature refinement. CBAM was integrated into yolov5 to help the model quickly find the region of interest in a wide range of images. The structure of CBAM module is shown in Fig. 2 [36].

During the shelter-transporting AGV movement, CBAM can be used to extract the attention area from the video information obtained by the camera fixed on the shelter-transporting AGV, and focus on the useful information instead of the useless information in shelter detection process.

3.2 CIOU_Loss improvement

The original yolov5 uses GIOU as the loss function, but its model was low on both optimization degree and convergence speed in the application of shelter detection. However, complete-IOU (CIOU) pays attention to the scale information of the width to height ratio of the boundary frame, and increases the size of the detection frame as well as the loss of length and width, making the prediction frame more consistent with the real frame. Therefore, CIOU was chosen as the loss function. In order to further improve the positioning accuracy of the yolov5 model, this study will analyze the influence of the distance between the center point of the detection frame, and the annotation frame and the aspect ratio was added on the basis of considering the overlap area. If the loss function CIOU replacing GIOU was introduced to yolov5, the modified yolov5 will achieve better results. CIOU_Loss serves as a loss function, as shown in Eq. 1.

\(CIOU\_Loss=1-CIOU=1-(IOU-\frac{Distance\_{2}^{2}}{Distance\_{C}^{2}}-\frac{{v}^{2}}{\left(1-IOU\right)+v}\))	(1)
\(v=\frac{4}{{\pi }^{2}}(\text{arctan}\frac{{w}^{gt}}{{h}^{gt}}-arctan\frac{{w}^{p}}{{h}^{p}})\)²

Where, Distance_2² is the coordinates of the center point of the label box and the prediction box. Distance_C² is the diagonal distance of the smallest bounding rectangle. w^p is the prediction frame width. h^p is the height of the prediction box. w^gt is the width of the label box. h^gt is the height of the label box.

3.3 DIOU_nms improvement

Distance-IOU_nms (DIOU_nms) considers not only the IOU but also the distance between the center points of two enclosures. If both the IOU between two frames is large and the center distance between two frames are large, DIOU_nms will consider these frames as two objects, and will not filter the blocked object.

4.1 Data set

The original dataset source was divided into two parts, one is the static image data captured by the camera, and the other is the image data obtained from the video screenshots captured by the camera when the shelter-transporting AGV was working. Images were named uniformly, and 1000 images were selected as the total data set of target detection training. The data set was divided into 800 training sets and 200 test sets.

Labelimg tool was used to annotate each image in the dataset, as shown in Fig. 3. The corresponding relationship between shelter, red cross and label in the data set is shown in Table 1. The anchor frame should completely cover the target in the annotation process, and the annotation object is the peripheral features of the shelter and the red cross in the middle. YOLO format was selected for annotation. Labelimg generate an outer frame in the form of boundary box in the image, and automatically generate a txt file with the same name as the annotation image after the manual annotation result was saved.

Table 1

Correspondence between label
Category	The shelter	The red cross
Tag name	shelter	red cross
Tag number	0	1

4.2 Model selection

Considering comprehensive factors such as training efficiency and detection accuracy, a suitable detection model for shelter-transporting AGV detecting shelter in shelter transport with both better performance and faster speed should be obtained. Therefore, the ten pre-training models of yolov5 were chosen to be trained, and the index parameters of the ten pre-training models are shown in Table 2.

Table 2

Index parameters of ten different models
Model	Depth	Width	Layer	Parameter
yolov5n	0.33	0.25	270	1766623
yolov5n6	0.33	0.25	355	3096244
yolov5s	0.33	0.50	270	7025023
yolov5s6	0.33	0.50	355	12326164
yolov5m	0.67	0.75	391	21060447
yolov5m6	0.67	0.75	481	35281716
yolov5l	1.00	1.00	499	46636735
yolov5l6	1.00	1.00	607	76170196
yolov5x	1.33	1.25	567	86224543
yolov5x6	1.33	1.25	733	140045044
The models in Table 2 are listed in ascending order of complexity of network structure. The experimental environment is Ubuntu18.04 operating system, and based on Pytorch framework. CPU: Intel Core I9-10900K, GPU: NVIDIA RTX 3090, 24GB. Training parameter settings for ten pre-training models of yolov5 are shown in Table 3.

Table 3

Experimental training parameters
Parameter	The numerical
Epochs (Preset training times)	500
Batch-size (Batch)	16
Learning rate	0.01
Momentum term	0.937
Decay (Decay regular term)	0.0005

The ten models in Table 2 were trained for a total of 16.059 hours according to the training parameters in Table 3, and the comparison of post-training model parameters are shown.

During the training process, various values change with the number of training steps increasing. The meanings of each value in Fig. 4 are as follows:

In Fig. 4(a), the precision is equal to the number of correct targets marked divided by the total number of targets marked. The closer to 1, and the higher the accuracy. And in Fig. 4(b), the recall rate is equal to the number of correct targets marked divided by the total number of targets that need to be marked. The closer to 1, and the higher the accuracy. In Fig. 4(c), the mAP_0.5 (mean Average Precision) represents when IOU is set to 0.5, the AP of all pictures of each category is calculated, and then all categories are averaged. In Fg.4(d), the mAP_0.5:0.95 represents the average mAP at different IOU thresholds (from 0.5 to 0.95 in steps of 0.05).

As can be seen from Fig. 4, when the number of training steps reached 200, each value tended to be stable. As the number of training steps reached 500, the curves all achieved a good fitting effect. By comparing the training results, it can be found that both the precision and the recall tend to reach 1 with the increase of training steps, indicating that all the 10 models achieved good training effects, and the mAP_0.5 were also stable around 1 as the number of training steps increased. The mAP_0.5:0.95 increased slowly in the first 100 training sessions, then tended to stabilize and approached 1 slowly, and it was obvious that there is a certain gap in the final stable value for different training models, but the overall results were all greater than 0.9 and the trend was stable.

In order to further analyze the training effect of the model, the 10 models after training were tested and compared. The detection results of the trained 10 models on the same test set are shown in Table 4.

Table 4

Comparison of detection results
Model	Layer	Parameter	Model size /M	Training time /h	Accuracy / %	Detection time /ms
yolov5n	213	1761871	3.8	0.719	94.24	5.5
yolov5n6	280	3089188	6.6	0.857	96.28	6.9
yolov5s	213	7015519	14.4	0.813	95.04	5.8
yolov5s6	280	12312052	25.1	0.955	96.91	7.6
yolov5m	308	21041679	42.6	1.260	95.22	8.3
yolov5m6	378	35254692	71.1	1.371	95.88	10.3
yolov5l	392	46605951	93.8	1.847	96.46	10.6
yolov5l6	476	76126356	153.0	2.002	97.24	12.5
yolov5x	444	86180143	173.1	2.929	96.96	14.7
yolov5x6	574	139980484	280.9	3.306	96.65	17.6

It can be seen from Table 4 that the more complex structure and more parameters has the longer corresponding training time and the larger weight. From the comparison of performance indexes and detection results of the different trained models, it can be seen that the detection accuracy of relatively complex pre-training models (i.e., yolov5m and yolov5m6), were not as good as that of relatively simple (i.e., yolov5s6 and yolov5n6), and the most complex yolov5x6 was not as good as the simplest yolov5s6. Therefore, it can be concluded that detection effect of the complex trained model may not be better in actual applications. Yolov5n has the minimum depth and width of pre-training model, and the minimum number of model layers, parameters and detection time obtained after training, as well as the minimum weight of model, which is very suitable to deployment on the shelter-transporting AGV. However, it has the lowest detection accuracy of 94.24% compared with other models. The yolov5n6 was slightly higher than yolov5n on model complexity, but it is nearly 2% higher than yolov5n on detection accuracy. Compared to other more complex including yolov5m, yolov5l and yolov5x, the accuracy of the yolov5n6 is not low, and detection time of the yolov5n6 is only 6.9ms, which is far less than the detection time of complex model (i.e., yolov5m, yolov5l and yolov5x) greater than 10ms.The training results of yolov5n6 are shown in Fig. 5.

According to the yolov5n6 training result, the model loss value decreased and tended to be stable with the increase of training steps. The curve fitting state was good, and precision, recall, mAP_0.5 and mAP_0.5:0.95 all tended to be stable at 1. Considering the detection accuracy, detection time and model weight, the yolov5n6 was selected as the best detection model and applied to shelter detection, which can well achieve the high detection accuracy and speed.

4.3 Discussion

To solve the problems of identifying shelters with low accuracy and slow speed existing in the process of shelter-transporting AGV shelter detection and further improve the detection performance, the detection model yolov5n6* was developed by introducing the CBAM into the model's main structure, changing the loss function from GIOU_Loss to CIOU_Loss, and selecting a more reasonable DIOU_nms. The box_loss and mAP_0.5:0.95 of the yolov5n6* and yolov5n6 models are shown in Fig. 6.

By comparing training results of the two models in Fig. 6, it can be seen that box_loss of the improved model yolov5n6* and yolov5n6 decreased with the increase of training steps and gradually tended to be stable. According to the comparison figure, the box_loss of the improved yolov5n6* is 1.2% lower than the yolov5n6, which meets the requirements of proposed strategy and proves that the improved strategy enables yolov5n6* to have higher positioning accuracy. As shown in Fig. 6(b), mAP_0.5:0.95 of yolov5n6* and yolov5n6 gradually approached 1 during training process, and tended to be stable after 400 training times. The training model reached the fitting. Compared with the yolov5n6 before the improvement, the mAP_0.5.95 of the yolov5n6* increased by 2%, indicating that the improved model yolov5n6* obtained good training results.

In order to further evaluate performance of the improved model yolov5n6* and the original model yolov5n6 both the yolov5n6* and the yolov5n6 were tested on the test set. The comparison of detection results of the two models shown in Table 5 and Fig. 7.

Table 5

Comparison of detection results
Model	Model size /M	Accuracy /%	Detection time /ms
yolov5n6	6.6	96.28	6.9
yolov5n6*	7.2	97.15	7.1

As can be seen from Table 5, compared with the yolo5n6, the detection accuracy of the yolov5n6* increased by 0.87%, however, the detection time of the yolov5n6* was only increased by 0.2ms. Therefore, the yolov5n6* was suitable for the application on a shelter-transporting AGV due to its small size. Figure 7 shows the original picture, and the detection effect pictures by the yolov5n6 and the yolov5n6*. It can be seen that the detection result will not filter the blocked object from the first row of pictures. However, it can be seen from the second row of pictures that the introduction of CBAM and the change of CIOU_Loss increases the confidence of the detection results. It was proved that the introduction of the attention mechanism and the improvement of the loss function were effective to improve the shelter detection ability and make the yolov5n6* model more robustness.

The shelter detection has the problems of slow recognition speed and low accuracy. Therefore, this paper applies yolov5 to the target detection stage of the shelter-transporting AGV transshipment shelter. Firstly, a suitable shelter detection model yolov5n6 was selected through experiments. And the yolov5n6* was proposed by modifying the yolov5n6 based on the introduction of the attention mechanism, the application of the loss function CIOU_Loss and the non-maximum suppression function DIOU_nms. Compared with the yolov5n6 model, the box_loss of the improved model yolov5n6* is reduced by 1.2%, the mAP_0.5:0.95 is improved by 2%, and the accuracy is improved by 0.87% which shows that yolov5n6* was more suitable for shelter-transporting AGV transshipment shelters and has significantly improved detection ability and strong robustness. Under the condition of the model performance improvement, the model size is only 7.2M, and the detection time is only increasing by 0.2ms, indicating that the model yolov5n6* can meet the requirements of both identification accuracy and detection speed, and can effectively solve the problems of the low identification accuracy and the detection speed existing in the target detection of shelter-transporting AGV transshipment shelters.

Data Availability Statements: The data that support the findings of this study are available from the corresponding author upon reasonable request.

Competing interests: The author(s) declare no competing interests.

Role of funding source: Not applicable.

Informed Consent: Consent for publication was obtained from the participants.

Data availability statement：

The datasets generated during and/or analysed during the current study are available from the corresponding author on reasonable request.
All data generated or analysed during this study are included in this published article (and its Supplementary Information files).

Authors' Contributions: Yang Dian is the first author of this article. Su Chen participated in the conception and design of the study. Wu Hang performed the analysis and interpretation of data. Xu Xinxi revised it critically for important intellectual content. Zhao Xiuguo is the guarantor for the article who accepts full responsibility for the work and/or the conduct of the study, had access to the data, and oversaw the decision to publish.

Zhi Y, Ivy L, Yan G. Fangcang Shelter Hospitals for Covid-19: Construction and Operation Manual[M]. World Scientific Publishing Company. (2020)
Zhou Feng, Gao Xuan, Li Mengwei, Zhang Ying. Shelter Hospital: Glimmers of Hope in Treating Coronavirus 2019.[J]. Disaster medicine and public health preparedness,2020,14(5).
Z. Pei, Y. Yuan, T. Yu and N. Li, "Dynamic Allocation of Medical Resources During the Outbreak of Epidemics," in IEEE Transactions on Automation Science and Engineering, vol. 19, no. 2, pp. 663–676, April 2022, doi: 10.1109/TASE.2021.3102491.
Sheikhbardsiri Hojjat, Khademipour Gholamreza, Davarani Esmat Rezabeigi, Tavan Asghar, Amiri Hadis, Sahebi Ali. Response capability of hospitals to an incident caused by mass gatherings in southeast Iran.[J]. Injury,2022,53(5).
Neely Robyn, Haynes Kristy, Miller Greg. Development of a Mobile Hospital for Disaster Relief.[J]. The Journal of nursing administration,2021,51(1).
J. Wang, T. Xiao, Q. Gu and Q. Chen, "YOLOv5_CSL_F: YOLOv5’s Loss Improvement and Attention Mechanism Application for Remote Sensing Image Object Detection," 2021 International Conference on Wireless Communications and Smart Grid (ICWCSG), Hangzhou, China, 2021, pp. 197–203.
Q. Fu, J. Chen, W. Yang and S. Zheng, "Nearshore Ship Detection on SAR Image Based on Yolov5," 2021 2nd China International SAR Symposium (CISS), Shanghai, China, 2021, pp. 1–4.
K. Ding, X. Li, W. Guo and L. Wu, "Improved object detection algorithm for drone-captured dataset based on yolov5," 2022 2nd International Conference on Consumer Electronics and Computer Engineering (ICCECE), Guangzhou, China, 2022, pp. 895–899.
Li Shilin, Zhang Shujuan, Xue Jianxin, Sun Haixia, Ren Rui. A Fast Neural Network Based on Attention Mechanisms for Detecting Field Flat Jujube[J]. Agriculture,2022,12(5).
Xing Linjie, Fan Xiaoyan, Dong Yaxin, Xiong Zenghui, Xing Lin, Yang Yang, Bai Haicheng, Zhou Chengjiang. Multi-UAV cooperative system for search and rescue based on YOLOv5[J]. International Journal of Disaster Risk Reduction,2022,76.
J. Ieamsaard, S. N. Charoensook and S. Yammen, "Deep Learning-based Face Mask Detection Using YoloV5," 2021 9th International Electrical Engineering Congress (iEECON), Pattaya, Thailand, 2021, pp. 428–431.
Gao H, Wang WH, Yang CJ: Traffic signal image detection technology based on YOLO, Journal of Physics: Conference Series. (2021)
Pulkit A, Ross B, Jitendra M: Analyzing the Performance of Multilayer Neural Networks for Object Recognition, CORR. (2014)
Chen C: Survey and Application of Target Detection Algorithms Based on Deep Learning, World Scientific Research Journal. (2020)
Ray TB, Singh HK, Rode T. Towards identification of solutions of interest for multi-objective problems considering both objective and variable space information[J]. Applied Soft Computing Journal. (2022)
Li K, Cheng G, Bu SH: Rotation-Insensitive and Context-Augmented Object Detection in Remote Sensing Images, IEEE Transactions on Geoscience and Remote Sensing. (2018)
Chen SM, Zhang ZJ, Yang JT. Fangcang shelter hospitals: a novel concept for responding to public health emergencies[J]. The Lancet. (2020)
Ning ZX, Wu XJ, Yang J:) MT-YOLOv5: Mobile terminal table detection model based on YOLOv5, Journal of Physics: Conference Series. (2021)
Qu PT, Wu H, Xu XX: Research and design of automatic guided vehicle for field medical shelter transfer [J/OL]. Military Medicine. (2021) (In Chinese)
Chen YW, Zhang C, Liu B: Ship Detection in Optical Sensing Images Based on Yolov5, Twelfth international conference on graphics and Image processing. (2021)
X. Zhang, H. Fan, H. Zhu, X. Huang, T. Wu and H. Zhou, "Improvement of YOLOV5 Model Based on the Structure of Multiscale Domain Adaptive Network for Crowdscape," 2021 IEEE 7th International Conference on Cloud Computing and Intelligent Systems (CCIS), Xi'an, China, 2021, pp. 171–175.
Jiajia Feng, Zhijing Xu, Zhi Liu. Improved YOLOv5 with Attention Mechanism for SAR Ship Target Detection in Complex Environment[J]. International Core Journal of Engineering,2022,8(6).
Zhang Wei, Zhou Qikai, Li Ruizhi, Niu Fu. Research on camouflage target detection method based on improved YOLOv5[J]. Journal of Physics: Conference Series,2022,2284(1).
E. Cengil, A. Çinar and M. Yildirim, "A Case Study: Cat-Dog Face Detector Based on YOLOv5," 2021 International Conference on Innovation and Intelligence for Informatics, Computing, and Technologies (3ICT), Zallaq, Bahrain, 2021, pp. 149–153.
Tan SL, Yan J, Jiang ZQ: Approach for improving YOLOv5 network with application to remote sensing target detection, Society of Photo-Optical Instrumentation Engineers. (2021)
Xuan L, Jing L: Improvement of Pedestrian Detection Algorithm Based on YOLO, Proceedings of 2019 2nd International Conference on Mechanical Engineering, Industrial Materials and Industrial Electronics. (2019)
J. Zhou, M. Yan, C. Luo and X. Xing, "Underwater Sonar Target Detection Based on YOLOv5," 2021 International Conference on Electronic Information Engineering and Computer Science (EIECS), Changchun, China, 2021, pp. 729–732.
Z. -H. Su and J. -C. Yen, "Cup-to-Disk Ratio Detection of Optic Disk in Fundus Images Based on YOLOv5," 2021 International Conference on Electronic Communications, Internet of Things and Big Data (ICEIB), Yilan County, Taiwan, 2021, pp. 327–329.
Lee, Jeon H, Hwang: YOLO with adaptive frame control for real-time object detection applications, Multimedia Tools and Applications. (2021)
Jia W, Xu SQ, Liang Z: Real-time automatic helmet detection of motorcyclists in urban traffic using improved YOLOv5 detector, IET Image Processing. (2021)
Zhu LL, Li Z, Liu C: Improving YOLOv5 with Attention Mechanism for Detecting Boulders from Planetary Images, Remote Sensing. (2021)
J. Wang, T. Xiao, Q. Gu and Q. Chen, "YOLOv5_CSL_F: YOLOv5’s Loss Improvement and Attention Mechanism Application for Remote Sensing Image Object Detection," 2021 International Conference on Wireless Communications and Smart Grid (ICWCSG), Hangzhou, China, 2021, pp. 197–203.
Q. Fu, J. Chen, W. Yang and S. Zheng, "Nearshore Ship Detection on SAR Image Based on Yolov5," 2021 2nd China International SAR Symposium (CISS), Shanghai, China, 2021, pp. 1–4.
K. Ding, X. Li, W. Guo and L. Wu, "Improved object detection algorithm for drone-captured dataset based on yolov5," 2022 2nd International Conference on Consumer Electronics and Computer Engineering (ICCECE), Guangzhou, China, 2022, pp. 895–899.
Z. Wu et al., "Using YOLOv5 for Garbage Classification," 2021 4th International Conference on Pattern Recognition and Artificial Intelligence (PRAI), Yibin, China, 2021, pp. 35–38.
Sang H, Jong C, Joon L: CBAM: Convolutional Block Attention Module, 15th European Conference on Computer Vision (ECCV)Munich. (2018)

No competing interests reported.

Download PDF

Version 2

posted

You are reading this latest preprint version

Shelter identification for shelter-transporting AGV based on improved target detection model yolov5

Status:

Version 2

Abstract

Figures

1 Introduction

2 Yolov5 Model And Analysis

3 Model Improvement Method

3.1 Attention mechanism application

3.2 CIOU_Loss improvement

3.3 DIOU_nms improvement

4 Experiment And Discussion

4.1 Data set

4.2 Model selection

4.3 Discussion

5 Conclusion

Declarations

References

Additional Declarations

Status:

Version 2