Transfer Learning-Based Recognition of Bhutanese Sign Language Digits Using Deep CNNs

Yonten Jamtsho; Sonam Wangmo

doi:10.32628/CSEIT2612133

Authors

Yonten Jamtsho Gyalpozhing College of Information Technology, Royal University of Bhutan, Thimphu, Bhutan Author
Sonam Wangmo Gyalpozhing College of Information Technology, Royal University of Bhutan, Thimphu, Bhutan Author

DOI:

https://doi.org/10.32628/CSEIT2612133

Keywords:

Bhutanese sign digit, convolutional neural network, MobileNet, transfer learning, ResNet50, VGG16

Abstract

The hearing and speech-impaired community uses hand gesture-based communication media to communicate with general public. However, the general public finds it difficult to communicate with them due to their difficulties in understanding sign digits, thereby creating a communication gap between the general public and the hearing-impaired community. Therefore, this paper proposes three pre-trained (VGG16, ResNet50, MobileNet) models to train models on the Bhutanese sign digit dataset. In this study, two different datasets (Bhutanese sign digit and Turkish sign digit) were merged and used to train the models. The rationale for merging the datasets is that both datasets use the same representation of sign gestures and have different variations of images. The dataset was split into train and test sets with a ratio of 80:20. The VGG16 network architecture outperformed the other two models with the training and testing accuracy of 96.72% and 95.85%. The trained model was integrated with the Django framework to create a web application for digit recognition.

Downloads

Download data is not yet available.

References

L. Wangmo, ‘Physical impairment cases see steady rise – Business Bhutan’. Accessed: Oct. 12, 2021. [Online]. Available: https://businessbhutan.bt/physical-impairment-cases-see-steady-rise/

A. L. C. Barczak, N. H. Reyes, M. Abastillas, A. Piccio, and T. Susnjak, ‘A New 2D Static Hand Gesture Colour Image Dataset for ASL Gestures’, p. 9.

P. Kiranalli and S. R., ‘Indian Sign Language Numeral Recognition - An Image Processing Approach’, IJCA, vol. 146, no. 7, pp. 24–27, Jul. 2016, doi: 10.5120/ijca2016910866. DOI: https://doi.org/10.5120/ijca2016910866

O. Sevli̇ and N. Kemaloğlu, ‘Turkish sign language digits classification with CNN using different optimizers’, International Advanced Researches and Engineering Journal, vol. 4, no. 3, pp. 200–207, Dec. 2020, doi: 10.35860/iarej.700564. DOI: https://doi.org/10.35860/iarej.700564

Md. M. Hasan and Sk. M. M. Ahsan, ‘Bangla Sign Digits Recognition Using HOG Feature Based Multi-Class Support Vector Machine’, in 2019 4th International Conference on Electrical Information and Communication Technology (EICT), Dec. 2019, pp. 1–5. doi: 10.1109/EICT48899.2019.9068832. DOI: https://doi.org/10.1109/EICT48899.2019.9068832

D. Tasmere, B. Ahmed, and M. M. Hasan, ‘Bangla Sign Digits: A Dataset For Real Time Hand Gesture Recognition’, in 2020 11th International Conference on Electrical and Computer Engineering (ICECE), Dec. 2020, pp. 186–189. doi: 10.1109/ICECE51571.2020.9393070. DOI: https://doi.org/10.1109/ICECE51571.2020.9393070

K. Wangchuk, P. Riyamongkol, and R. Waranusast, ‘Real-time Bhutanese Sign Language digits recognition system using Convolutional Neural Network’, ICT Express, vol. 7, no. 2, pp. 215–220, Jun. 2021, doi: 10.1016/j.icte.2020.08.002. DOI: https://doi.org/10.1016/j.icte.2020.08.002

Md. S. Alom, Md. J. Hasan, and Md. F. Wahid, ‘Digit Recognition in Sign Language Based on Convolutional Neural Network and Support Vector Machine’, in 2019 International Conference on Sustainable Technologies for Industry 4.0 (STI), Dec. 2019, pp. 1–5. doi: 10.1109/STI47673.2019.9067999. DOI: https://doi.org/10.1109/STI47673.2019.9067999

Md. A. Kalam, Md. N. I. Mondal, and B. Ahmed, ‘Rotation Independent Digit Recognition in Sign Language’, in 2019 International Conference on Electrical, Computer and Communication Engineering (ECCE), Feb. 2019, pp. 1–5. doi: 10.1109/ECACE.2019.8679172. DOI: https://doi.org/10.1109/ECACE.2019.8679172

Chollet, Francois, and others, Keras. (2015). GitHub. [Online]. Available: https://github.com/fchollet/keras

G. Bradski, ‘The OpenCV Library’, Dr. Dobb’s Journal of Software Tools, 2000.

K. Simonyan and A. Zisserman, ‘Very Deep Convolutional Networks for Large-Scale Image Recognition’, Apr. 10, 2015, arXiv: arXiv:1409.1556. doi: 10.48550/arXiv.1409.1556.

K. He, X. Zhang, S. Ren, and J. Sun, ‘Deep Residual Learning for Image Recognition’, Dec. 10, 2015, arXiv: arXiv:1512.03385. doi: 10.48550/arXiv.1512.03385.

A. G. Howard et al., ‘MobileNets: Efficient Convolutional Neural Networks for Mobile Vision Applications’, Apr. 16, 2017, arXiv: arXiv:1704.04861. doi: 10.48550/arXiv.1704.04861.

Transfer Learning-Based Recognition of Bhutanese Sign Language Digits Using Deep CNNs

Authors

DOI:

Keywords:

Abstract

Downloads

References

Downloads

Published

Issue

Section

License

How to Cite

IssueDate

RightSideBlock

Latest publications