Gender Identification Via Voice Analysis

Shivangee Kushwah; Shantanu Singh; Kshitij Vats; Mrs Varsha Nemade

doi:10.32628/CSEIT1952188

Authors

Shivangee Kushwah Student, MPSTME, NMIMS Shirpur, Maharashtra, India
Shantanu Singh Student, MPSTME, NMIMS Shirpur, Maharashtra, India
Kshitij Vats Student, MPSTME, NMIMS Shirpur, Maharashtra, India
Mrs Varsha Nemade Assistant Professor, MPSTME, NMIMS Shirpur, Maharashtram India

DOI:

https://doi.org/10.32628/CSEIT1952188

Keywords:

Voice Analysis, Random Forest, WarbleR, CART Model, Gender Identification, Machine Learning, Voice Analytics, Logistic Regression, Regression Tree, SVM, XGBoost, Random Forest

Abstract

Human voice is basically sound which is made by humans from their vocal tracts. Voice is made of different constituents and has various characteristics such as frequency, amplitude etc. These characteristics are produced by combination of vocal folds and articulations. This paper reflects development of a system using these characteristics which altogether are called acoustic parameters to detect the gender of the speaker. We have used four models to classify the genders namely CART, XGBoost, SVM and Random Forest. An ensemble of all the models is also used to make the entire system more accurate. This system can be used as a building block for many other softwares where it will take the first step to extract the acoustic parameters and detect the gender of the speaker.

References

Hassam Ulla Sheikh, University of Manchester, "WHO IS SPEAKING? MALE OR FEMALE"
J. Bishop, & P. Keating, "Perception of pitch location within a speaker’s range: Fundamental Frequency, voice quality and speaker sex", in The Journal of the Acoustical Society of America, vol. 132-2, pp.1100-1112, 2012.
R. Vergin, A. Farhat, & D. O’Shaughnessy, "Robust gender dependent acoustic-phonetic modeling in continuous speech recognition based on a new automatic male/female classification", in Spoken Language, vol. 2, pp.1081-1084, 1996.
WarbleR Documentation, https://cran.r-project.org/web/packages/warbleR
Erwan Pépiot, HAL Archives ,"Voice, speech and gender: male-female acoustic differences and cross-language variation in English and French speakers",2013
Vijayalakshmi A, Midhun Jimmy, Moksha Nair , "A study on Automated Speech Recognition Technique", IJARCET, 2015
Anjali Pahwa, Gaurav Aggarwal,"Speech Feature Extraction for Gender Recognition", MECS, 2016
J Benesty, J Chen, Y Huang, I Cohen "Pearson Correlation Coeﬃcient", Springer 2009
Gelfer, M. P., & Mikos, V. A., "The Relative Contributions of Speaking Fundamental Frequency and Formant Frequencies to Gender Identiﬁcation Based on Isolated Vowels", Elsevier , 2005
Asuero, A. G., Sayago, A., & González, A. G., "The Correlation Coefficient: An Overview", University of Seville, 2006
Peng, C.-Y. J., Lee, K. L., & Ingersoll, G. M., " An Introduction to Logistic Regression Analysis and Reporting", Indiana University , 2002
Efron B., "Logistic Regression, Survival Analysis, and the Kaplan-Meier Curve" , JASA , 1988
De’ath, G., & Fabricius, K. E., "CLASSIFICATION AND REGRESSION TREES: A POWERFUL YET SIMPLE TECHNIQUE" , 2000
LEO BREIMAN, "Random Forests", Machine Learning (Springer), 2001
Mark R. Segal, "Machine Learning Benchmarks and Random Forest Regression", University Of California , 2003
CHRISTOPHER J.C. BURGES, "A Tutorial on Support Vector Machines for Pattern Recognition", Springer, 1998
Gunn S.R., "Support Vector Machines for Classiﬁcation and Regression", University Of Southampton, 1998
Chen, T., & Guestrin, C., "XGBoost: A Scalable Tree Boosting System" , 2016
Mitchell R., Frank E., "Accelerating the XGBoost algorithm using GPU computing" , PeerJ , 2017
Diettrich T.G., "Ensemble Methods in Machine Learning", 2000
Erokyar H., "Age and Gender Recognition for Speech Applications based on Support Vector Machines", 2014

Gender Identification Via Voice Analysis

Authors

DOI:

Keywords:

Abstract

References

Downloads

Published

Issue

Section

License

How to Cite