On-Road Text Detection Platform for Driver Assistance Systems
Commenced in January 2007
Frequency: Monthly
Edition: International
Paper Count: 33093
On-Road Text Detection Platform for Driver Assistance Systems

Authors: Guezouli Larbi, Belkacem Soundes

Abstract:

The automation of the text detection process can help the human in his driving task. Its application can be very useful to help drivers to have more information about their environment by facilitating the reading of road signs such as directional signs, events, stores, etc. In this paper, a system consisting of two stages has been proposed. In the first one, we used pseudo-Zernike moments to pinpoint areas of the image that may contain text. The architecture of this part is based on three main steps, region of interest (ROI) detection, text localization, and non-text region filtering. Then, in the second step, we present a convolutional neural network architecture (On-Road Text Detection Network - ORTDN) which is considered as a classification phase. The results show that the proposed framework achieved ≈ 35 fps and an mAP of ≈ 90%, thus a low computational time with competitive accuracy.

Keywords: Text detection, CNN, PZM, deep learning.

Procedia APA BibTeX Chicago EndNote Harvard JSON MLA RIS XML ISO 690 PDF Downloads 163

References:


[1] N. Anwar, T. Khan, and A. F. Mollah, “Text detection from scene and born images,” in Recent Trends in Communication and Intelligent Systems, A. K. S. undir, N. Yadav, H. Sharma, and S. Das, Eds. Singapore: Springer Nature Singapore, 2022, pp. 115–122.
[2] M. Liao, Z. Zou, Z. Wan, C. Yao, and X. Bai, “Real-time scene text detection with differentiable binarization and adaptive scale fusion,” IEEE Transactions on Pattern Analysis and Machine Intelligence, pp. 1–1, 2022.
[3] D. Haifeng and H. Siqi, “Natural scene text detection based on yolo v2 network model,” Journal of Physics: Conference Series, vol. 1634, p. 012013, sep 2020.
[4] J. Wang, H. Hu, and X. Lu, “Adn for object detection,” IET Computer Vision, vol. 14, no. 2, pp. 65–72, 2020.
[5] S. R. Narang, M. K. Jindal, S. Ahuja, and M. Kumar, “On the recognition of devanagari ancient handwritten characters using sift and gabor features,” Soft Computing, vol. 27, no. 22, pp. 17 279–17 289, 2020.
[6] M. Sravani, A. Maheswararao, and M. K. Murthy, “Robust detection of video text using an efficient hybrid method via key frame extraction and text localization,” Multimedia Tools and Applications, 2020. (Online). Available: https://doi.org/10.1007/s11042-020-10113-2
[7] Z. Liu, W. Zhou, and H. Li, “Scene text detection with fully convolutional neural networks,” Multimedia Tools and Applications, vol. 78, no. 13, pp. 18 205–18 227, 2019.
[Online]. Available: https://doi.org/10.1007/s11042-019-7177-4
[8] X. Wang and L. min Hou, “A new robust digital image watermarking based on pseudo-zernike moments,” Multidimens. Syst. Signal Process, vol. 21, no. 2, pp. 179–196, 2010.
[9] G. A. Papakostas, Y. S. Boutalis, D. A. Karras, and B. G. Mertzios, “Efficient computation of zernike and pseudo-zernike moments for pattern classification applications,” Pattern Recognition and Image Analysis, vol. 20, pp. 56–64, 3 2010.
[10] S. Reddy, M. Mathew, L. Gomez, M. Rusinol, D. Karatzas, and C. V. Jawahar, “Roadtext-1k: Text detection & recognition dataset for driving videos,” in 2020 IEEE International Conference on Robotics and Automation, (ICRA) 2020, Paris, France, May 31 - August 31, 2020. IEEE, 2020, pp. 11 074–11 080.
[11] V. Toro and M. Alejandro, “Fast text detection for road scenes,” Master’s thesis, Department of Computer Science, University of Applied Sciences Bonn-Rhein-Sieg, Bonn-Rhein-Sieg, 5 2015.
[12] P. He, W. Huang, T. He, Q. Zhu, Y. Qiao, and X. Li, “Single shot text detector with regional attention,” in 2017 IEEE International Conference on Computer Vision (ICCV), 2017, pp. 3066–3074.
[13] M. Fujitake and H. Ge, “Temporally-aware convolutional block attention module for video text detection,” in 2021 IEEE International Conference on Systems, Man, and Cybernetics (SMC), 2021, pp. 220–225.
[14] Z. Cheng, J. Lu, B. Zou, L. Qiao, Y. Xu, S. Pu, Y. Niu, F. Wu, and S. Zhou, “Free: A fast and robust end-to-end video text spotter,” IEEE Transactions on Image Processing, vol. 30, pp. 822–837, 2021.
[15] C. Fernandez, Learning from Imbalanced Data Sets, 1st ed. Springer; 1st ed. 2018 edition, 11 2018.
[16] G. V. Jose, “Useful plots to diagnose your neural network,” https://towardsdatascience.com/useful-plots-to-diagnose-your-neuralnetwork- 521907fa2f45, 10 2019, accessed by: 26-12-2020.
[17] Z. Wu and S. He, “Improvement of the alexnet networks for large-scale recognition applications,” Iranian Journal of Science and Technology, Transactions of Electrical Engineering, 2020.