PCA-ICA Based Acoustic Ambient Extraction

Authors

  • G. Rajitha  M.Tech (Embedded Systems), Department of ECE, SVCE, Tirupati, Andhra Pradesh, India
  • K. Upendra Raju  Assistant Professor, Department of ECE, SVCE, Tirupati, Andhra Pradesh, India

Keywords:

Primary Ambient Extraction (PAE), Ambient Phase, Spatial Audio, Sparsity, Principal Component analysis (PCA), Independent Component Analysis (ICA), Frequency Domain.

Abstract

Primary-ambient extraction (PAE) has been playing an important role in spatial audio analysis-synthesis. Based on the spatial features, PAE decomposes a signal into primary and ambient components, which are then rendered separately. PAE is performed in sub band domain for complex input signals having multiple point-like sound sources. However, the performance of PAE approaches and their key influences for such signals have not been well-studied so far. In this paper, we conducted a study on frequency-domain PAE using principal component analysis (PCA) and independent component analysis (ICA) in the case of multiple sources. We found that the partitioning of the frequency bins is very critical in PAE. Simulation results reveal that the proposed top-down adaptive partitioning method achieves superior performance as compared to the conventional partitioning methods.

References

  1. C. Avendano and J. M. Jot, "A frequency-domain approach to multichannel upmix," J. Audio Eng. Soc., vol. 52, no. 7/8, pp. 740-749, Jul./Aug. 2004.
  2. M. M. Goodwin and J. M. Jot, "Primary-ambient signal decomposition and vector-based localization for spatial audio coding and enhancement," in Proc. ICASSP, Hawaii, 2007, pp. 9-12.
  3. J. Merimaa, M. M. Goodwin, J. M. Jot, "Correlation-based ambience extraction from stereo recordings," in 123rd Audio Eng. Soc. Conv., New York, Oct. 2007.
  4. J. He, E. L. Tan, and W. S. Gan, "Linear estimation based primary-ambient extraction for stereo audio signals," IEEE/ACM Trans. Audio, Speech,Lang. Process., vol. 22, no. 2, pp. 505-517, Feb. 2014.
  5. M. Plumbley, T. Blumensath, L. Daudet, R. Gribonval, and M. E. Davies,"Sparse representation in audio and music: from coding to source separation," Proc. IEEE, vol. 98, no. 6, pp. 995-1016, Jun. 2010.
  6. P. J. V. Laarhoven, and E. H. Aarts, Simulated annealing, Netherlands:Springer, 1987.
  7. G. Kendall, "The decorrelation of audio signals and its impact on spatial imagery," Computer Music Journal, vol. 19, no. 4, pp. 71-87, 1995.
  8. J. He. (2014 Feb 24). Ambient phase estimation APE[online].Available:http://jhe007.wix.com/main#!ambient-phase-estimation/cied
  9. C. Faller, "Multiple-loudspeaker playback of stereo signals," J. Audio Eng. Soc., vol. 54, no. 11, pp. 1051–1064, Nov. 2006.
  10. Dolby Atmos-Next Generation Audio for Cinema (White Paper). 2013. Available online: http://www.dolby.com/uploadedFiles/Assets/US/Doc/Professional/Dolby-Atmos-Next-Generation-Audio-for-Cinema.pdf
  11. I. Jollife. Principal Component Analysis. Springer series in statistics, 2 ed. 2002.
  12. A. Hyvärinen, J. Karhunen, E. Oja. Independent Component Analysis, New York: Wiley, 2001. ISBN 978-0-471-40540-5.
  13. C. Avendano and J. M. Jot, "A frequency-domain approach to multichannel upmix," J. Audio Eng. Soc., vol. 52, no. 7/8, pp. 740-749, Jul./Aug. 2004.
  14. C. Faller, "Multiple-loudspeaker playback of stereo signals," J. Audio Eng. Soc., vol. 54, no. 11, pp. 1051-1064, Nov. 2006.

Downloads

Published

2017-12-31

Issue

Section

Research Articles

How to Cite

[1]
G. Rajitha, K. Upendra Raju, " PCA-ICA Based Acoustic Ambient Extraction , IInternational Journal of Scientific Research in Computer Science, Engineering and Information Technology(IJSRCSEIT), ISSN : 2456-3307, Volume 3, Issue 1, pp.51-59, January-February-2018.