Blind Leap Real-Time Object Recognition with results converted to Audio for Blind People

Authors

  • Dr. Jaya R  Department of CSE, New Horizon College of Engineering, Bangalore-560103 Karnataka, India

Keywords:

Object Recognition, Object detectionYOLO, Raspberry Pi, Unity.

Abstract

This project tries to change the visual world into the audio world. It has the likelihood to inform blind people about the objects as well as their spatial locations. The objects that are detected at the scene are represented by their names and are then transformed to speech. Their spatial locations are encoded into the 2-channel audio with the help of 3D binaural sound simulation. The system is collected of various modules. The video is captured by a portable camera device (Raspberry Pi with Noir Camera) on the client side. It is then streamed to the server for real-time Object recognition with existing object detection models (YOLO). The 3D location of the objects is determined by the location and the size of the bounding boxes using the detection algorithm. A 3D sound generation application, built on Unity game engine then renders the binaural sound keeping the locations encoded. The transmission of the sound to the user happens with Bluetooth/3.5 jack earphones. The sound is played at an interval of a few seconds or when the recognized object differs from the last one - depends which one is the earliest.

References

  1. David Brown, Tom Macpherson, and Jamie Ward,seeing with sound? exploring different characteristics of a visual-to-auditory sensory substitution device. Perception, 40(9):1120–1135, 2011.
  2. Ross Girshick. Fast r-cnn. In Proceedings of the IEEE International Conference on Computer Vision, pages 1440–1448, 2015.
  3. Shaoqing Ren, Kaiming He, Ross Girshick, and Jian Sun. Faster r-cnn: Towards real-time object detection with region proposal networks. In Advances in Neural Information Processing Systems, pages 91–99, 2015.
  4. Joseph Redmon, Santosh Divvala, Ross Girshick, and Ali Farhadi. You only look once: Unified, real-time object detection. arXiv preprint arXiv:1506.02640, 2015.
  5. Unity3D:https://docs.unity3d.com/560/Documentation/Manual/AudioSpatializerSDK.html

Downloads

Published

2019-12-30

Issue

Section

Research Articles

How to Cite

[1]
Dr. Jaya R, " Blind Leap Real-Time Object Recognition with results converted to Audio for Blind People" International Journal of Scientific Research in Computer Science, Engineering and Information Technology(IJSRCSEIT), ISSN : 2456-3307, Volume 4, Issue 9, pp.380-383, November-December-2019.