Audio signal processing: A review of audio signal classification features

Authors

  • Mittal C. Darji  Information Technology Department, G H Patel College of Engineering & Technology, V.V.Nagar, Anand, Gujarat, India

Keywords:

Audio Features, Physical features, Perceptual features, Zero Crossing Rate, Short Time Energy, Spectral Centroid, Flux, Fundamental Frequency, Loudness, Pitch

Abstract

Digital audio processing applications are getting popularity with the time. Audio data compression, summarization, speech recognition, speaker identification, speech and music separation, music genre classifications, singer identification, gender detection and many more are there. Feature selection is the curial part of these applications. Various audio signal features are reviewed here especially considering classification as the purpose.

References

  1. Moore BCJ (2003) An Introduction to the Psychology of Hearing. Academic Press, San Diego.
  2. McKinney M F, Breebaart J (2003) Features for Audio and Music Classifcation. Proc of the Intl Symp on Music Information Retrieval (ISMIR)
  3. Tzanetakis G, Cook P (2002) Musical Genre Classifcation of Audio Signals. IEEE Trans on Speech and Audio Processing 10(5):293-302.
  4. Burred J J, Lerch A (2004) Hierarchical Automatic Audio Signal Classification. J Audio Engineering Society 52(7/8):724-739
  5. M.C. Darji, Dr. N.M. Patel, Z.H. Shah, “A Review on audio features based extraction of songs from movies”, International Journal of Advance Engineering and Research Development (IJAERD) e-ISSN: 2348 – 4470, print-ISSN: 2348-6406, 2015.
  6. M. Casey, “General sound classification and similarity in mpeg-7,” Organized Sound, vol. 6:2, 2002.
  7. Zhang T, Kuo C C J (2001) Audio Content Analysis for Online AudioVisual Data Segmentation and Classifiation. IEEE Trans on Speech and Audio Processing 9(4):441-457.
  8. M.C. Darji, Dr. N.M. Patel, Z.H. Shah, “Extraction of Video Songs from Movies using Audio Features”, IEEE, Advanced Computing and Communication (ISACC), Print ISBN: 978-1-4673-6707-3, 2015.
  9. Wold E, Blum T, Keisler D, Wheaton J (1996) Content-based Classi¯cation, Search and Retrieval of Audio. IEEE Multimedia 3(3):27-36
  10. Meribeth Bunch. Dynamics of the Singing Voice. Springer-Verlag, New York, 1982.
  11. Chris Chafe and David Jaffe. Source separation and note identification in polyphonic music. In icassp, pages 1289–1292. IEEE, 1996.
  12. Information on Tempo available: http://en.wikipedia.org/wiki/Tempo
  13. Information on Zero-Crossing Rates available: http://en.wikipedia.org/wiki/Zero-crossing_rate

Downloads

Published

2017-06-30

Issue

Section

Research Articles

How to Cite

[1]
Mittal C. Darji, " Audio signal processing: A review of audio signal classification features, IInternational Journal of Scientific Research in Computer Science, Engineering and Information Technology(IJSRCSEIT), ISSN : 2456-3307, Volume 2, Issue 3, pp.227-230 , May-June-2017.