Vehicle Detection on The Traffic Using Detection Transformer (DETR) Algorithm

Rofiatul Khoiriyah; Aria Hendrawan

doi:10.63158/IJAIS.v1.i1.4

Authors

Rofiatul Khoiriyah Semarang University Author
Aria Hendrawan Semarang University Author

DOI:

https://doi.org/10.63158/IJAIS.v1.i1.4

Keywords:

DETR, object detection, vehicle detection

Abstract

Object detection is a computer vision technique aimed at detecting and identifying objects in images or videos. In recent years, with advancements in Machine Learning and Deep Learning, object detection has made significant progress in various fields such as healthcare, security, and transportation. The DETR algorithm is a novel approach in object detection that combines transformer architecture with attention techniques to address object detection challenges. This research applies the DETR algorithm with ResNet backbone for vehicle detection on the roads, involving 6 object classes: Car, Truck, Bus, Motorcycle, Pickup Car, and Truck Box. Four training experiments were conducted: DETR-ResNet50, DETR-ResNet101, DETR-DC5-ResNet50, and DETR-DC5-ResNet101. The implementation results show that DETR-DC5 improves the accuracy of vehicle detection. DETR-DC5 with ResNet-101 achieved the highest score for AP50, which is 0.957. However, it should be noted that DETR-DC5 with ResNet-50 managed to maintain overall AP stability, with a lower parameter of 35.5. The model's outcomes in this study can be effectively applied for vehicle detection on the roads.

References

M. Aria and C. Cuccurullo, “bibliometrix: An R-tool for comprehensive science mapping analysis,” J. Informetr., vol. 11, no. 4, pp. 959–975, Nov. 2017, doi: 10.1016/j.joi.2017.08.007.

Z. Chen et al., “Fast vehicle detection algorithm in traffic scene based on improved SSD,” Measurement, vol. 201, p. 111655, Sep. 2022, doi: 10.1016/j.measurement.2022.111655.

D. Lorencik and I. Zolotova, “Object recognition in traffic monitoring systems,” in DISA 2018 - IEEE World Symposium on Digital Intelligence for Systems and Machines, Proceedings, Oct. 2018, pp. 277–282, doi: 10.1109/DISA.2018.8490634.

M. A. Bin Zuraimi and F. H. Kamaru Zaman, “Vehicle detection and tracking using YOLO and DeepSORT,” in ISCAIE 2021 - IEEE 11th Symposium on Computer Applications and Industrial Electronics, Apr. 2021, pp. 23–29, doi: 10.1109/ISCAIE51753.2021.9431784.

S. Ren, K. He, R. Girshick, and J. Sun, “Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks,” IEEE Trans. Pattern Anal. Mach. Intell., vol. 39, no. 6, pp. 1137–1149, Jun. 2017, doi: 10.1109/TPAMI.2016.2577031.

W. Liu et al., “SSD: Single Shot MultiBox Detector,” in Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), vol. 9905 LNCS, Dec. 2015, pp. 21–37, doi: 10.1007/978-3-319-46448-0_2.

N. Carion, F. Massa, G. Synnaeve, N. Usunier, A. Kirillov, and S. Zagoruyko, “End-to-End Object Detection with Transformers,” in Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), vol. 12346 LNCS, May 2020, pp. 213–229, doi: 10.1007/978-3-030-58452-8_13.

X. Zhu et al., “Deformable DETR: Deformable Transformers for End-to-End Object Detection,” in Proc. IEEE Comput. Soc. Conf. Comput. Vis. Pattern Recognit., Jun. 2021, pp. 13177–13186, doi: 10.1109/CVPR46437.2021.01298.

X. Dai, Y. Chen, J. Yang, P. Zhang, L. Yuan, and L. Zhang, “Dynamic DETR: End-to-End Object Detection with Dynamic Attention,” in Proc. IEEE Int. Conf. Comput. Vis., 2021, pp. 2968–2977, doi: 10.1109/ICCV48922.2021.00298.

E. Arkin, N. Yadikar, X. Xu, A. Aysa, and K. Ubul, “A survey: object detection methods from CNN to transformer,” Multimed. Tools Appl., Oct. 2022, doi: 10.1007/s11042-022-13801-3.

K. He, X. Zhang, S. Ren, and J. Sun, “Deep Residual Learning for Image Recognition,” in Proc. IEEE Comput. Soc. Conf. Comput. Vis. Pattern Recognit., Dec. 2015, pp. 770–778, doi: 10.1109/CVPR.2016.90.

M. Aria and C. Cuccurullo, “bibliometrix: An R-tool for comprehensive science mapping analysis,” J. Informetr., vol. 11, no. 4, pp. 959–975, Nov. 2017, doi: 10.1016/j.joi.2017.08.007.

A. Hendrawan, R. Gernowo, O. D. Nurhayati, B. Warsito, and A. Wibowo, “Improvement Object Detection Algorithm Based on YoloV5 with BottleneckCSP,” in Proc. IEEE Int. Conf. Commun. Netw. Satell. (COMNETSAT 2022), 2022, pp. 79–83, doi: 10.1109/COMNETSAT56033.2022.9994461.

F. Yu, V. Koltun, and T. Funkhouser, “Dilated Residual Networks,” in Proc. IEEE Conf. Comput. Vis. Pattern Recognit. (CVPR 2017), May 2017, vol. 2017-Jan., pp. 636–644, doi: 10.1109/CVPR.2017.75.