Gender Identification Via Voice Analysis

Authors

  • Shivangee Kushwah  Student, MPSTME, NMIMS Shirpur, Maharashtra, India
  • Shantanu Singh  Student, MPSTME, NMIMS Shirpur, Maharashtra, India
  • Kshitij Vats  Student, MPSTME, NMIMS Shirpur, Maharashtra, India
  • Mrs Varsha Nemade  Assistant Professor, MPSTME, NMIMS Shirpur, Maharashtram India

DOI:

https://doi.org//10.32628/CSEIT1952188

Keywords:

Voice Analysis, Random Forest, WarbleR, CART Model, Gender Identification, Machine Learning, Voice Analytics, Logistic Regression, Regression Tree, SVM, XGBoost, Random Forest

Abstract

Human voice is basically sound which is made by humans from their vocal tracts. Voice is made of different constituents and has various characteristics such as frequency, amplitude etc. These characteristics are produced by combination of vocal folds and articulations. This paper reflects development of a system using these characteristics which altogether are called acoustic parameters to detect the gender of the speaker. We have used four models to classify the genders namely CART, XGBoost, SVM and Random Forest. An ensemble of all the models is also used to make the entire system more accurate. This system can be used as a building block for many other softwares where it will take the first step to extract the acoustic parameters and detect the gender of the speaker.

References

  1. Hassam Ulla Sheikh, University of Manchester, "WHO IS SPEAKING? MALE OR FEMALE"
  2. J. Bishop, & P. Keating, "Perception of pitch location within a speaker’s range: Fundamental Frequency, voice quality and speaker sex", in The Journal of the Acoustical Society of America, vol. 132-2, pp.1100-1112, 2012.
  3. R. Vergin, A. Farhat, & D. O’Shaughnessy, "Robust gender dependent acoustic-phonetic modeling in continuous speech recognition based on a new automatic male/female classification", in Spoken Language, vol. 2, pp.1081-1084, 1996.
  4. WarbleR Documentation, https://cran.r-project.org/web/packages/warbleR
  5. Erwan Pépiot, HAL Archives ,"Voice, speech and gender: male-female acoustic differences and cross-language variation in English and French speakers",2013
  6. Vijayalakshmi A, Midhun Jimmy, Moksha Nair , "A study on Automated Speech Recognition Technique", IJARCET, 2015
  7. Anjali Pahwa, Gaurav Aggarwal,"Speech Feature Extraction for Gender Recognition", MECS, 2016
  8. J Benesty, J Chen, Y Huang, I Cohen "Pearson Correlation Coefficient", Springer 2009
  9. Gelfer, M. P., & Mikos, V. A., "The Relative Contributions of Speaking Fundamental Frequency and Formant Frequencies to Gender Identification Based on Isolated Vowels", Elsevier , 2005
  10. Asuero, A. G., Sayago, A., & González, A. G., "The Correlation Coefficient: An Overview", University of Seville, 2006
  11. Peng, C.-Y. J., Lee, K. L., & Ingersoll, G. M., " An Introduction to Logistic Regression Analysis and Reporting", Indiana University , 2002
  12. Efron B., "Logistic Regression, Survival Analysis, and the Kaplan-Meier Curve" , JASA , 1988
  13. De’ath, G., & Fabricius, K. E., "CLASSIFICATION AND REGRESSION TREES: A POWERFUL YET SIMPLE TECHNIQUE" , 2000
  14. LEO BREIMAN, "Random Forests", Machine Learning (Springer), 2001
  15. Mark R. Segal, "Machine Learning Benchmarks and Random Forest Regression", University Of California , 2003
  16. CHRISTOPHER J.C. BURGES, "A Tutorial on Support Vector Machines for Pattern Recognition", Springer, 1998
  17. Gunn S.R., "Support Vector Machines for Classification and Regression", University Of Southampton, 1998
  18. Chen, T., & Guestrin, C., "XGBoost: A Scalable Tree Boosting System" , 2016
  19. Mitchell R., Frank E., "Accelerating the XGBoost algorithm using GPU computing" , PeerJ , 2017
  20. Diettrich T.G., "Ensemble Methods in Machine Learning", 2000
  21. Erokyar H., "Age and Gender Recognition for Speech Applications based on Support Vector Machines", 2014

Downloads

Published

2019-04-30

Issue

Section

Research Articles

How to Cite

[1]
Shivangee Kushwah, Shantanu Singh, Kshitij Vats, Mrs Varsha Nemade, " Gender Identification Via Voice Analysis, IInternational Journal of Scientific Research in Computer Science, Engineering and Information Technology(IJSRCSEIT), ISSN : 2456-3307, Volume 5, Issue 2, pp.746-753, March-April-2019. Available at doi : https://doi.org/10.32628/CSEIT1952188