Performance Evaluation of Speech Emotion Recognition with Conventional Neural Network

Sana Fatema N. Ali; Prof. S. T. Khandare; Prof. S. Y. Amdani

doi:10.32628/CSEIT2390538

Authors

Sana Fatema N. Ali ME Scholar, Babasaheb Naik College of Engineering, Pusad, Maharashtra, India
Prof. S. T. Khandare Associate Professor, Babasaheb Naik College of Engineering, Pusad, Maharashtra, India
Prof. S. Y. Amdani Associate Professor, Babasaheb Naik College of Engineering, Pusad, Maharashtra, India

Keywords:

Speech emotion, Deep learning, CNN, MATLAB

Abstract

The realm of speech emotion recognition presents a formidable challenge, offering valuable insights into the emotional states of speakers and facilitating enhanced human-machine interactions. However, in various scenarios, particularly those involving resource-constrained environments like embedded systems, the need arises to discern emotions in speech while grappling with limited computing and memory resources. While some prior research has shown promising recognition rates through transfer learning techniques utilising popular models such as Alex Net, a significant hindrance remains their substantial model size, rendering them impractical for execution on embedded systems. In response to this challenge, we present an innovative solution: a compact deep convolutional neural network architecture tailored to address the demands of resource-constrained environments.

References

X. Xu, J. Deng, E. Coutinho, C. Wu, and L. Zhao, “Connecting Subspace Learning and Extreme Learning Machine in Speech Emotion Recognition,” IEEE, vol. XX, no. XX, pp. 1–13, 2018.
Z. Huang, J. Epps, D. Joachim, and V. Sethu, “Natural Language Processing Methods for Acoustic and Landmark Event-based Features in Speech-based Depression Detection,” IEEE J. Sel. Top. Signal Process. Vol. PP, no. c, p. 1, 2019.
P. S. Member, “Transfer Linear Subspace Learning for Cross-corpus Speech Emotion Recognition,” vol. X, no. X, pp. 1–12, 2017.
J. Deng, X. Xu, Z. Zhang, and S. Member, “Semi-Supervised Auto encoders for Speech Emotion Recognition,” vol. XX, no. XX, pp. 1–13, 2017.
Y. Qin, S. Member, T. Lee, A. Pak, and H. Kong, “Automatic Assessment of Speech Impairment in Cantonese-speaking People with Aphasia,” IEEE J. Sel. Top. Signal Process. Vol. PP, no. c, p. 1, 2019.
M. D. Zeiler et al., “ON RECTIFIED LINEAR UNITS FOR SPEECH PROCESSING New York University, USA Google Inc., USA University of Toronto , Canada,” pp. 3–7.
Yelin Kim and Emily Mower Provost, Data driven framework to explore patterns (timings and durations) of emotion evidence, specific to individual emotion classes; University of Michigan Electrical Engineering and Computer Science, Ann Arbor, Michigan, USA;2020.
A. Yao, D. Cai, P. Hu, S. Wang, L. Shan, and Y. Chen; HoloNet: towards robust emotion recognition in the wild, 2021
Y. Fan, X. Lu, D. Li, and Y. Liu.Video-based Emotion Recognition Using CNN-RNN and C3D Hybrid Networks. Proceedings of ICMI 2016 Proceedings of the 18th ACM International Conference on Multimodal Interaction, Pages 445-450,Tokyo, Japan — November 12 - 16, 2019.
Zixing Zhang, Fabien Ringeval, Fabien Ringeval, Eduardo Coutinho, Erik Marchi and Björn Schüller, Semi-Supervised Learning (SSL) technique.
Wei-Long Zheng1 and Bao-Liang Lu, Personalizing EEG-Based Affective Models with Transfer Learning, Centre for Brain-like Computing and Machine Intelligence, Department of Computer Science and Engineering, Key Laboratory of Shanghai Education Commission for Intelligent Interaction and Cognitive Engineering, Brain Science and Technology Research Centre, Shanghai Jiao Tong University, Shanghai, China. 2018.
Thiang and Suryo Wijoyo, “Speech Recognition Using Linear Predictive Coding and Artificial Neural Network for Controlling Movement of Mobile Robots”, in Proceedings of International Conference on Information and Electronics Engineering (IPCSIT).
Ms.Vimala.C and Dr.V.Radha, “Speaker Independent Isolated Speech Recognition System for Tamil Language using HMM”, in Proceedings International Conference on Communication Technology and System Design 2020, Procedia Engineering 30 ISSN: 1877-7058, 13March 2020, pp.1097 – 1102.
Cini Kuriana, Kannan Balakrishnan, “Development & evaluation of different acoustic models for Malayalam continuous speech recognition”, in Proceedings of International Conference on Communication Technology and System Design 2020 Published by Elsevier Ltd, December 2020, pp.1081-1088
Suma Swamy, K.V Ramakrishnan, “An Efficient Speech Recognition System”, Computer Science & Engineering: An International Journal (CSEIJ), Vol.3,No.4,DOI:10.512 1/cseij.2019.3403 August 2021, pp.21-27
Annu Chaudhary, Mr. R.S. Chauhan, Mr. Gautam Gupta, “Automatic Speech Recognition System for Isolated & Connected Words of Hindi Language By Using Hidden Markov Model Toolkit (HTK)”, in Proceedings of International Conference on Emerging Trends in Engineering and Technology, DOI: 03.AETS.2013.3.234, 22-24th February 2020, pp.244– 252.
Preeti Saini, Parneet Kaur, Mohit Dua, “Hindi Automatic Speech Recognition Using HTK”, International Journal of Engineering Trends and Technology (IJETT)”, Vol.4, Issue 6, ISSN: 2231- 5381, June 2020, pp.2223-2229.
Akkas Ali, Manwar Hossain, Mohammad Nuruzzaman Bhuiyan, “Automatic Speech Recognition Technique for Bangla Words”, International Journal of Advanced Science and Technology, Vol. 50, January, 2020, pp.51-60
Maya Money Kumar, Elizabeth Sherly, Win Sam Varghese, “Malayalam Word Identification for Speech Recognition System” An International Journal of Engineering Sciences, Special IssueiDravadian , Vol. 15 ISSN: 2229-6913 (Print), December 2021, pp. 22-26.
Jitendra Singh Pokhariya and Dr. Sanjay Mathur, “Sanskrit Speech Recognition using Hidden Markov Model Toolkit”, International Journal of Engineering Research & Technology (IJERT),Vol.3, Issue 10, ISSN: 2278-0181, October-2020, pp.93-98
Geeta Nijhawan and Dr. M.K Soni, “Real Time Speaker Recognition System for Hindi Words”, International Journal of Information Engineering and Electronic Business, Vol. 6, DOI: 10.5815/ijieeb.2019 .02.04, April 2020, pp. 35-40

Performance Evaluation of Speech Emotion Recognition with Conventional Neural Network

Authors

Keywords:

Abstract

References

Downloads

Published

Issue

Section

License

How to Cite