Deep Learning-based Road Segmentation & Pedestrian Detection System for Intelligent Vehicles
Year 2023,
, 22 - 31, 30.04.2023
Gozde Yolcu Öztel
,
İsmail Öztel
Abstract
Correctly determining the driving area and pedestrians is crucial for intelligent vehicles to reduce fatal road accidents risk. But these are challenging tasks in the computer vision field. Various weather, road conditions, etc., make them difficult. This paper presents a vision-based road segmentation and pedestrian detection system. First, the roads are segmented using a deep learning based consecutive triple filter size (CTFS) approach. Then, pedestrians on the segmented roads are detected using the transfer learning approach. The CTFS approach can create feature maps for small and big features. The proposed system is a reliable, low-cost road segmentation and pedestrian detection system for intelligent vehicles.
References
- K.-W. Chiang and Y.-W. Huang, “An intelligent navigator for seamless INS/GPS integrated land vehicle navigation applications,” Appl Soft Comput, vol. 8, no. 1, pp. 722–733, Jan. 2008, doi: 10.1016/j.asoc.2007.05.010.
- X. Zhang and M. M. Khan, “Intelligent Vehicle Navigation and Traffic System,” in Principles of Intelligent Automobiles, Singapore: Springer Singapore, 2019, pp. 175–209. doi: 10.1007/978-981-13-2484-0_5.
- J. Jin and X. Ma, “A group-based traffic signal control with adaptive learning ability,” Eng Appl Artif Intell, vol. 65, pp. 282–293, Oct. 2017, doi: 10.1016/j.engappai.2017.07.022.
- J.-Z. Yuan, H. Chen, B. Zhao, and Y. Xu, “Estimation of Vehicle Pose and Position with Monocular Camera at Urban Road Intersections,” J Comput Sci Technol, vol. 32, no. 6, pp. 1150–1161, Nov. 2017, doi: 10.1007/s11390-017-1790-3.
- C. Ma, W. Hao, A. Wang, and H. Zhao, “Developing a Coordinated Signal Control System for Urban Ring Road Under the Vehicle-Infrastructure Connected Environment,” IEEE Access, vol. 6, pp. 52471–52478, 2018, doi: 10.1109/ACCESS.2018.2869890.
- S. Zhang, R. Benenson, M. Omran, J. Hosang, and B. Schiele, “Towards Reaching Human Performance in Pedestrian Detection,” IEEE Trans Pattern Anal Mach Intell, vol. 40, no. 4, pp. 973–986, Apr. 2018, doi: 10.1109/TPAMI.2017.2700460.
- J. Li, X. Liang, S. Shen, T. Xu, J. Feng, and S. Yan, “Scale-aware Fast R-CNN for Pedestrian Detection,” IEEE Trans Multimedia, pp. 1–1, 2017, doi: 10.1109/TMM.2017.2759508.
- B. Ma, S. Lakshmanan, and A. O. Hero, “Simultaneous detection of lane and pavement boundaries using model-based multisensor fusion,” IEEE Transactions on Intelligent Transportation Systems, vol. 1, no. 3, pp. 135–147, 2000, doi: 10.1109/6979.892150.
- J. Sparbert, K. Dietmayer, and D. Streller, “Lane detection and street type classification using laser range images,” in ITSC 2001. 2001 IEEE Intelligent Transportation Systems. Proceedings (Cat. No.01TH8585), pp. 454–459. doi: 10.1109/ITSC.2001.948700.
- M. Bertozzi and A. Broggi, “GOLD: a parallel real-time stereo vision system for generic obstacle and lane detection,” IEEE Transactions on Image Processing, vol. 7, no. 1, pp. 62–81, 1998, doi: 10.1109/83.650851.
- Y. Wang, E. K. Teoh, and D. Shen, “Lane detection and tracking using B-Snake,” Image Vis Comput, vol. 22, no. 4, pp. 269–280, Apr. 2004, doi: 10.1016/j.imavis.2003.10.003.
- Luo-Wei Tsai, Jun-Wei Hsieh, Chi-Hung Chuang, and Kuo-Chin Fan, “Lane detection using directional random walks,” in 2008 IEEE Intelligent Vehicles Symposium, Jun. 2008, pp. 303–306. doi: 10.1109/IVS.2008.4621271.
- Q. Li, N. Zheng, and H. Cheng, “Springrobot: A Prototype Autonomous Vehicle and Its Algorithms for Lane Detection,” IEEE Transactions on Intelligent Transportation Systems, vol. 5, no. 4, pp. 300–308, Dec. 2004, doi: 10.1109/TITS.2004.838220.
- Y. Wang, E. K. Teoh, and D. Shen, “Lane detection and tracking using B-Snake,” Image Vis Comput, vol. 22, no. 4, pp. 269–280, Apr. 2004, doi: 10.1016/j.imavis.2003.10.003.
- J. M. Alvarez, T. Gevers, Y. LeCun, and A. M. Lopez, “Road Scene Segmentation from a Single Image,” 2012, pp. 376–389. doi: 10.1007/978-3-642-33786-4_28.
- G. L. Oliveira, W. Burgard, and T. Brox, “Efficient deep models for monocular road segmentation,” in 2016 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), Oct. 2016, pp. 4885–4891. doi: 10.1109/IROS.2016.7759717.
- H. Liu, X. Han, X. Li, Y. Yao, P. Huang, and Z. Tang, “Deep representation learning for road detection using Siamese network,” Multimed Tools Appl, vol. 78, no. 17, pp. 24269–24283, Sep. 2019, doi: 10.1007/s11042-018-6986-1.
- C.-A. Brust, S. Sickert, M. Simon, E. Rodner, and J. Denzler, “Convolutional Patch Networks with Spatial Prior for Road Detection and Urban Scene Understanding,” Feb. 2015.
- D. T. Nguyen, W. Li, and P. O. Ogunbona, “Human detection from images and videos: A survey,” Pattern Recognit, vol. 51, pp. 148–175, Mar. 2016, doi: 10.1016/j.patcog.2015.08.027.
- Y. Kim and T. Moon, “Human Detection and Activity Classification Based on Micro-Doppler Signatures Using Deep Convolutional Neural Networks,” IEEE Geoscience and Remote Sensing Letters, vol. 13, no. 1, pp. 8–12, Jan. 2016, doi: 10.1109/LGRS.2015.2491329.
- W. Ouyang and X. Wang, “Joint deep learning for pedestrian detection,” Proceedings of the IEEE International Conference on Computer Vision, pp. 2056–2063, 2013, doi: 10.1109/ICCV.2013.257.
- I. Goodfellow, Y. Bengio, and A. Courville, Deep learning. MIT Press, 2016.
- S. Srinivas, R. K. Sarvadevabhatla, K. R. Mopuri, N. Prabhu, S. S. S. Kruthiventi, and R. V. Babu, “A Taxonomy of Deep Convolutional Neural Nets for Computer Vision,” Front Robot AI, Jan. 2016, doi: 10.3389/frobt.2015.00036.
- V. Nair and G. E. Hinton, “Rectified Linear Units Improve Restricted Boltzmann Machines,” in Proceedings of the 27th International Conference on International Conference on Machine Learning, 2010, pp. 807–814.
- N. Srivastava, G. Hinton, A. Krizhevsky, I. Sutskever, and R. Salakhutdinov, “Dropout: A Simple Way to Prevent Neural Networks from Overfitting,” Journal of Machine Learning Research, vol. 15, pp. 1929–1958, 2014, doi: 10.1214/12-AOS1000.
- R. Padilla, S. L. Netto, and E. A. B. da Silva, “A Survey on Performance Metrics for Object-Detection Algorithms,” in 2020 International Conference on Systems, Signals and Image Processing (IWSSIP), Jul. 2020, pp. 237–242. doi: 10.1109/IWSSIP48289.2020.9145130.
- J. Redmon, S. Divvala, R. Girshick, and A. Farhadi, “You Only Look Once: Unified, Real-Time Object Detection,” in 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Jun. 2016, pp. 779–788. doi: 10.1109/CVPR.2016.91.
- Y. Tian, G. Yang, Z. Wang, H. Wang, E. Li, and Z. Liang, “Apple detection during different growth stages in orchards using the improved YOLO-V3 model,” Comput Electron Agric, vol. 157, pp. 417–426, Feb. 2019, doi: 10.1016/j.compag.2019.01.012.
- V. Badrinarayanan, A. Kendall, and R. Cipolla, “SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation,” IEEE Trans Pattern Anal Mach Intell, vol. 39, no. 12, pp. 2481–2495, Dec. 2017, doi: 10.1109/TPAMI.2016.2644615.
- M. Everingham, L. Van~Gool, C. K. I. Williams, J. Winn, and A. Zisserman, “The Pascal Visual Object Classes Challenge 2012 (VOC2012) Results.”
- K. Simonyan and A. Zisserman, “Very Deep Convolutional Networks for Large-Scale Image Recognition,” Sep. 2015.
- X. Song, T. Rui, S. Zhang, J. Fei, and X. Wang, “The Road Segmentation Method Based on the Deep Auto-Encoder with Supervised Learning,” Computers & Electrical Engineering, vol. 68, pp. 381–388, 2018, doi: 10.1007/978-3-319-69877-9_28.
- J. Liu, B. Liu, and H. Lu, “Detection guided deconvolutional network for hierarchical feature learning,” Pattern Recognit, vol. 48, no. 8, pp. 2645–2655, Aug. 2015, doi: 10.1016/j.patcog.2015.02.002.
- J. Long, E. Shelhamer, and T. Darrell, “Fully convolutional networks for semantic segmentation,” in 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Jun. 2015, pp. 3431–3440. doi: 10.1109/CVPR.2015.7298965.
Year 2023,
, 22 - 31, 30.04.2023
Gozde Yolcu Öztel
,
İsmail Öztel
References
- K.-W. Chiang and Y.-W. Huang, “An intelligent navigator for seamless INS/GPS integrated land vehicle navigation applications,” Appl Soft Comput, vol. 8, no. 1, pp. 722–733, Jan. 2008, doi: 10.1016/j.asoc.2007.05.010.
- X. Zhang and M. M. Khan, “Intelligent Vehicle Navigation and Traffic System,” in Principles of Intelligent Automobiles, Singapore: Springer Singapore, 2019, pp. 175–209. doi: 10.1007/978-981-13-2484-0_5.
- J. Jin and X. Ma, “A group-based traffic signal control with adaptive learning ability,” Eng Appl Artif Intell, vol. 65, pp. 282–293, Oct. 2017, doi: 10.1016/j.engappai.2017.07.022.
- J.-Z. Yuan, H. Chen, B. Zhao, and Y. Xu, “Estimation of Vehicle Pose and Position with Monocular Camera at Urban Road Intersections,” J Comput Sci Technol, vol. 32, no. 6, pp. 1150–1161, Nov. 2017, doi: 10.1007/s11390-017-1790-3.
- C. Ma, W. Hao, A. Wang, and H. Zhao, “Developing a Coordinated Signal Control System for Urban Ring Road Under the Vehicle-Infrastructure Connected Environment,” IEEE Access, vol. 6, pp. 52471–52478, 2018, doi: 10.1109/ACCESS.2018.2869890.
- S. Zhang, R. Benenson, M. Omran, J. Hosang, and B. Schiele, “Towards Reaching Human Performance in Pedestrian Detection,” IEEE Trans Pattern Anal Mach Intell, vol. 40, no. 4, pp. 973–986, Apr. 2018, doi: 10.1109/TPAMI.2017.2700460.
- J. Li, X. Liang, S. Shen, T. Xu, J. Feng, and S. Yan, “Scale-aware Fast R-CNN for Pedestrian Detection,” IEEE Trans Multimedia, pp. 1–1, 2017, doi: 10.1109/TMM.2017.2759508.
- B. Ma, S. Lakshmanan, and A. O. Hero, “Simultaneous detection of lane and pavement boundaries using model-based multisensor fusion,” IEEE Transactions on Intelligent Transportation Systems, vol. 1, no. 3, pp. 135–147, 2000, doi: 10.1109/6979.892150.
- J. Sparbert, K. Dietmayer, and D. Streller, “Lane detection and street type classification using laser range images,” in ITSC 2001. 2001 IEEE Intelligent Transportation Systems. Proceedings (Cat. No.01TH8585), pp. 454–459. doi: 10.1109/ITSC.2001.948700.
- M. Bertozzi and A. Broggi, “GOLD: a parallel real-time stereo vision system for generic obstacle and lane detection,” IEEE Transactions on Image Processing, vol. 7, no. 1, pp. 62–81, 1998, doi: 10.1109/83.650851.
- Y. Wang, E. K. Teoh, and D. Shen, “Lane detection and tracking using B-Snake,” Image Vis Comput, vol. 22, no. 4, pp. 269–280, Apr. 2004, doi: 10.1016/j.imavis.2003.10.003.
- Luo-Wei Tsai, Jun-Wei Hsieh, Chi-Hung Chuang, and Kuo-Chin Fan, “Lane detection using directional random walks,” in 2008 IEEE Intelligent Vehicles Symposium, Jun. 2008, pp. 303–306. doi: 10.1109/IVS.2008.4621271.
- Q. Li, N. Zheng, and H. Cheng, “Springrobot: A Prototype Autonomous Vehicle and Its Algorithms for Lane Detection,” IEEE Transactions on Intelligent Transportation Systems, vol. 5, no. 4, pp. 300–308, Dec. 2004, doi: 10.1109/TITS.2004.838220.
- Y. Wang, E. K. Teoh, and D. Shen, “Lane detection and tracking using B-Snake,” Image Vis Comput, vol. 22, no. 4, pp. 269–280, Apr. 2004, doi: 10.1016/j.imavis.2003.10.003.
- J. M. Alvarez, T. Gevers, Y. LeCun, and A. M. Lopez, “Road Scene Segmentation from a Single Image,” 2012, pp. 376–389. doi: 10.1007/978-3-642-33786-4_28.
- G. L. Oliveira, W. Burgard, and T. Brox, “Efficient deep models for monocular road segmentation,” in 2016 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), Oct. 2016, pp. 4885–4891. doi: 10.1109/IROS.2016.7759717.
- H. Liu, X. Han, X. Li, Y. Yao, P. Huang, and Z. Tang, “Deep representation learning for road detection using Siamese network,” Multimed Tools Appl, vol. 78, no. 17, pp. 24269–24283, Sep. 2019, doi: 10.1007/s11042-018-6986-1.
- C.-A. Brust, S. Sickert, M. Simon, E. Rodner, and J. Denzler, “Convolutional Patch Networks with Spatial Prior for Road Detection and Urban Scene Understanding,” Feb. 2015.
- D. T. Nguyen, W. Li, and P. O. Ogunbona, “Human detection from images and videos: A survey,” Pattern Recognit, vol. 51, pp. 148–175, Mar. 2016, doi: 10.1016/j.patcog.2015.08.027.
- Y. Kim and T. Moon, “Human Detection and Activity Classification Based on Micro-Doppler Signatures Using Deep Convolutional Neural Networks,” IEEE Geoscience and Remote Sensing Letters, vol. 13, no. 1, pp. 8–12, Jan. 2016, doi: 10.1109/LGRS.2015.2491329.
- W. Ouyang and X. Wang, “Joint deep learning for pedestrian detection,” Proceedings of the IEEE International Conference on Computer Vision, pp. 2056–2063, 2013, doi: 10.1109/ICCV.2013.257.
- I. Goodfellow, Y. Bengio, and A. Courville, Deep learning. MIT Press, 2016.
- S. Srinivas, R. K. Sarvadevabhatla, K. R. Mopuri, N. Prabhu, S. S. S. Kruthiventi, and R. V. Babu, “A Taxonomy of Deep Convolutional Neural Nets for Computer Vision,” Front Robot AI, Jan. 2016, doi: 10.3389/frobt.2015.00036.
- V. Nair and G. E. Hinton, “Rectified Linear Units Improve Restricted Boltzmann Machines,” in Proceedings of the 27th International Conference on International Conference on Machine Learning, 2010, pp. 807–814.
- N. Srivastava, G. Hinton, A. Krizhevsky, I. Sutskever, and R. Salakhutdinov, “Dropout: A Simple Way to Prevent Neural Networks from Overfitting,” Journal of Machine Learning Research, vol. 15, pp. 1929–1958, 2014, doi: 10.1214/12-AOS1000.
- R. Padilla, S. L. Netto, and E. A. B. da Silva, “A Survey on Performance Metrics for Object-Detection Algorithms,” in 2020 International Conference on Systems, Signals and Image Processing (IWSSIP), Jul. 2020, pp. 237–242. doi: 10.1109/IWSSIP48289.2020.9145130.
- J. Redmon, S. Divvala, R. Girshick, and A. Farhadi, “You Only Look Once: Unified, Real-Time Object Detection,” in 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Jun. 2016, pp. 779–788. doi: 10.1109/CVPR.2016.91.
- Y. Tian, G. Yang, Z. Wang, H. Wang, E. Li, and Z. Liang, “Apple detection during different growth stages in orchards using the improved YOLO-V3 model,” Comput Electron Agric, vol. 157, pp. 417–426, Feb. 2019, doi: 10.1016/j.compag.2019.01.012.
- V. Badrinarayanan, A. Kendall, and R. Cipolla, “SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation,” IEEE Trans Pattern Anal Mach Intell, vol. 39, no. 12, pp. 2481–2495, Dec. 2017, doi: 10.1109/TPAMI.2016.2644615.
- M. Everingham, L. Van~Gool, C. K. I. Williams, J. Winn, and A. Zisserman, “The Pascal Visual Object Classes Challenge 2012 (VOC2012) Results.”
- K. Simonyan and A. Zisserman, “Very Deep Convolutional Networks for Large-Scale Image Recognition,” Sep. 2015.
- X. Song, T. Rui, S. Zhang, J. Fei, and X. Wang, “The Road Segmentation Method Based on the Deep Auto-Encoder with Supervised Learning,” Computers & Electrical Engineering, vol. 68, pp. 381–388, 2018, doi: 10.1007/978-3-319-69877-9_28.
- J. Liu, B. Liu, and H. Lu, “Detection guided deconvolutional network for hierarchical feature learning,” Pattern Recognit, vol. 48, no. 8, pp. 2645–2655, Aug. 2015, doi: 10.1016/j.patcog.2015.02.002.
- J. Long, E. Shelhamer, and T. Darrell, “Fully convolutional networks for semantic segmentation,” in 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Jun. 2015, pp. 3431–3440. doi: 10.1109/CVPR.2015.7298965.