Gender Identification Via Voice Analysis
DOI:
https://doi.org/10.32628/CSEIT1952188Keywords:
Voice Analysis, Random Forest, WarbleR, CART Model, Gender Identification, Machine Learning, Voice Analytics, Logistic Regression, Regression Tree, SVM, XGBoost, Random ForestAbstract
Human voice is basically sound which is made by humans from their vocal tracts. Voice is made of different constituents and has various characteristics such as frequency, amplitude etc. These characteristics are produced by combination of vocal folds and articulations. This paper reflects development of a system using these characteristics which altogether are called acoustic parameters to detect the gender of the speaker. We have used four models to classify the genders namely CART, XGBoost, SVM and Random Forest. An ensemble of all the models is also used to make the entire system more accurate. This system can be used as a building block for many other softwares where it will take the first step to extract the acoustic parameters and detect the gender of the speaker.
References
- Hassam Ulla Sheikh, University of Manchester, "WHO IS SPEAKING? MALE OR FEMALE"
- J. Bishop, & P. Keating, "Perception of pitch location within a speaker’s range: Fundamental Frequency, voice quality and speaker sex", in The Journal of the Acoustical Society of America, vol. 132-2, pp.1100-1112, 2012.
- R. Vergin, A. Farhat, & D. O’Shaughnessy, "Robust gender dependent acoustic-phonetic modeling in continuous speech recognition based on a new automatic male/female classification", in Spoken Language, vol. 2, pp.1081-1084, 1996.
- WarbleR Documentation, https://cran.r-project.org/web/packages/warbleR
- Erwan Pépiot, HAL Archives ,"Voice, speech and gender: male-female acoustic differences and cross-language variation in English and French speakers",2013
- Vijayalakshmi A, Midhun Jimmy, Moksha Nair , "A study on Automated Speech Recognition Technique", IJARCET, 2015
- Anjali Pahwa, Gaurav Aggarwal,"Speech Feature Extraction for Gender Recognition", MECS, 2016
- J Benesty, J Chen, Y Huang, I Cohen "Pearson Correlation Coefficient", Springer 2009
- Gelfer, M. P., & Mikos, V. A., "The Relative Contributions of Speaking Fundamental Frequency and Formant Frequencies to Gender Identification Based on Isolated Vowels", Elsevier , 2005
- Asuero, A. G., Sayago, A., & González, A. G., "The Correlation Coefficient: An Overview", University of Seville, 2006
- Peng, C.-Y. J., Lee, K. L., & Ingersoll, G. M., " An Introduction to Logistic Regression Analysis and Reporting", Indiana University , 2002
- Efron B., "Logistic Regression, Survival Analysis, and the Kaplan-Meier Curve" , JASA , 1988
- De’ath, G., & Fabricius, K. E., "CLASSIFICATION AND REGRESSION TREES: A POWERFUL YET SIMPLE TECHNIQUE" , 2000
- LEO BREIMAN, "Random Forests", Machine Learning (Springer), 2001
- Mark R. Segal, "Machine Learning Benchmarks and Random Forest Regression", University Of California , 2003
- CHRISTOPHER J.C. BURGES, "A Tutorial on Support Vector Machines for Pattern Recognition", Springer, 1998
- Gunn S.R., "Support Vector Machines for Classification and Regression", University Of Southampton, 1998
- Chen, T., & Guestrin, C., "XGBoost: A Scalable Tree Boosting System" , 2016
- Mitchell R., Frank E., "Accelerating the XGBoost algorithm using GPU computing" , PeerJ , 2017
- Diettrich T.G., "Ensemble Methods in Machine Learning", 2000
- Erokyar H., "Age and Gender Recognition for Speech Applications based on Support Vector Machines", 2014
Downloads
Published
Issue
Section
License
Copyright (c) IJSRCSEIT

This work is licensed under a Creative Commons Attribution 4.0 International License.