Machine Learning Based Computer Vision Application for Visually Disabled People

Shubhada Mone; Nihar Salunke; Omkar Jadhav; Arjun Barge; Nikhil Magar

doi:10.32628/CSEIT2173130

Authors

Shubhada Mone Faculty at Department of Computer Engineering, SPPU, India
Nihar Salunke Department of Computer Engineering, SPPU, India
Omkar Jadhav
Arjun Barge
Nikhil Magar

DOI:

https://doi.org/10.32628/CSEIT2173130

Keywords:

Machine Learning, Computer Vision, Mobile Application Development, Cloud Computing

Abstract

With the easy availability of technology, smartphones are playing an important role in every person’s life. Also, with the advancements in computer vision based research, Automatic Driving cars, Object Recognition, Depth Map Prediction, Object Distance Estimation, have reached commendable levels of intelligence and accuracy. Combining the research and technological advancements, we can be hopeful in creating a computer vision based mobile-application which will help guide visually disabled people in performing their day to day tasks with easily available mobile applications. With our study, the visually disabled can perform simple tasks like outdoor/indoor navigation without encountering obstacles, also they can avoid accidental collisions with objects in their surroundings. Currently, there are very few applications which provide the same assistance to the visually impaired. Using physical tools like sticks is a very common practice when it comes to avoiding obstacles in a visually disabled person’s path. Our study will be focused on object detection and depth estimation techniques- two of the most popular and advanced fields in Intelligent Computer vision studies. We have explored more on the traditional challenges and future hopes of incorporating these techniques on embedded devices.

References

C. Godard, O. M. Aodha, and G. J. Brostow. “Digging into self-supervised monocular depth estimation 2018, arXiv:1806.01260. Online]. Available: https://arxiv.org/abs/1806.01260
G. Lian, "Pedestrian detection using quaternion histograms of oriented gradients," 2020 IEEE International Conference on Power, Intelligent Computing and Systems (ICPICS), 2020, pp. 415-419, doi: 10.1109/ICPICS50287.2020.9202071.
A. Womg, M. J. Shafiee, F. Li and B. Chwyl, "Tiny SSD: A Tiny Single-Shot Detection Deep Convolutional Neural Network for Real-Time Embedded Object Detection," 2018 15th Conference on Computer and Robot Vision (CRV), 2018, pp. 95-101, doi: 10.1109/CRV.2018.00023.
W. Lan, J. Dang, Y. Wang and S. Wang, "Pedestrian Detection Based on YOLO Network Model," 2018 IEEE International Conference on Mechatronics and Automation (ICMA), 2018, pp. 1547-1551, doi: 10.1109/ICMA.2018.8484698.
Q. Zhao, T. Sheng, Y. Wang, F. Ni, and L. Cai, “CFENet: An accurate and efficient single-shot object detector for autonomous driving,” CoRR, arXiv:1806.09790, 2018.
Zhiqiang Long Dongbing Gu Ruiho Li, Sen Wang. Deepvo: Monocular visual odometry through unsupervised deep learning. In IEEE International Conference on Robotics and Au-tomation (ICRA), 2018.
R. Zhang, F. Zhu, J. Liu and G. Liu, "Depth-Wise Separable Convolutions and Multi-Level Pooling for an Efficient Spatial CNN-Based Steganalysis," in IEEE Transactions on Information Forensics and Security, vol. 15, pp. 1138-1150, 2020, doi: 10.1109/TIFS.2019.2936913.
J. Zhu and Y. Fang, "Learning Object-Specific Distance From a Monocular Image," 2019 IEEE/CVF International Conference on Computer Vision (ICCV), 2019, pp. 3838-3847, doi: 10.1109/ICCV.2019.00394.
Kaiming He, Xiangyu Zhang, Shaoqing Ren, Jian Sun, “Deep Residual Learning for Image Recognition” Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2016, pp. 770-778.
D. Wofk, F. Ma, T. -J. Yang, S. Karaman and V. Sze, "FastDepth: Fast Monocular Depth Estimation on Embedded Systems," 2019 International Conference on Robotics and Automation (ICRA), 2019, pp. 6101-6108, doi: 10.1109/ICRA.2019.8794182.
Yu-Chen Chiu, Chi-Yi Tsai, Mind-Da Ruan;Guan-Yu Shen;Tsu-Tian Lee, “Mobilenet-SSDv2: An Improved Object Detection Model for Embedded Systems.(Object Detection)” 2020 International Conference on System Science and Engineering (ICSSE) .
Wang, B., Fremont, V., & Rodriguez, S. A. (2014). “Color-based road detection and its evaluation on the KITTI road benchmark.” 2014 IEEE Intelligent Vehicles Symposium Proceedings. doi:10.1109/ivs.2014.6856619 (kitti dataset)
Ming, A., Wu, T., Ma, J., Sun, F., & Zhou, Y. (2016). “Monocular Depth-Ordering Reasoning with Occlusion Edge Detection and Couple Layers Inference”. IEEE Intelligent Systems, 31(2), 54–65. doi:10.1109/mis.2015.94
Y. -C. Chiu, C. -Y. Tsai, M. -D. Ruan, G. -Y. Shen and T. -T. Lee, "Mobilenet-SSDv2: An Improved Object Detection Model for Embedded Systems," 2020 International Conference on System Science and Engineering (ICSSE), 2020, pp. 1-5, doi: 10.1109/ICSSE50014.2020.9219319.
S. Zhang, C. Wang and S. C. Chan, "A new high resolution depth map estimation system using stereo vision and depth sensing device," 2013 IEEE 9th International Colloquium on Signal Processing and its Applications, 2013, pp. 49-53, doi: 10.1109/CSPA.2013.6530012.
Silberman N., Hoiem D., Kohli P., Fergus R. (2012) Indoor Segmentation and Support Inference from RGBD Images. In: Fitzgibbon A., Lazebnik S., Perona P., Sato Y., Schmid C. (eds) Computer Vision – ECCV 2012. ECCV 2012. Lecture Notes in Computer Science, vol 7576. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-33715-4_54
Z. Xiao, B. Dai, T. Wu, L. Xiao and T. Chen, "Dense Scene Flow Based Coarse-to-Fine Rigid Moving Object Detection for Autonomous Vehicle," in IEEE Access, vol. 5, pp. 23492-23501, 2017, doi: 10.1109/ACCESS.2017.2764546.

Machine Learning Based Computer Vision Application for Visually Disabled People

Authors

DOI:

Keywords:

Abstract

References

Downloads

Published

Issue

Section

License

How to Cite