Automotive 3-Microphone Noise Canceller in a Frequently Moving Noise Source Environment
Authors: Z. Qi, T. J. Moir
Abstract:
A combined three-microphone voice activity detector (VAD) and noise-canceling system is studied to enhance speech recognition in an automobile environment. A previous experiment clearly shows the ability of the composite system to cancel a single noise source outside of a defined zone. This paper investigates the performance of the composite system when there are frequently moving noise sources (noise sources are coming from different locations but are not always presented at the same time) e.g. there is other passenger speech or speech from a radio when a desired speech is presented. To work in a frequently moving noise sources environment, whilst a three-microphone voice activity detector (VAD) detects voice from a “VAD valid zone", the 3-microphone noise canceller uses a “noise canceller valid zone" defined in freespace around the users head. Therefore, a desired voice should be in the intersection of the noise canceller valid zone and VAD valid zone. Thus all noise is suppressed outside this intersection of area. Experiments are shown for a real environment e.g. all results were recorded in a car by omni-directional electret condenser microphones.
Keywords: Signal processing, voice activity detection, noise canceller, microphone array beam forming.
Digital Object Identifier (DOI): doi.org/10.5281/zenodo.1079708
Procedia APA BibTeX Chicago EndNote Harvard JSON MLA RIS XML ISO 690 PDF Downloads 1615References:
[1] M. Shozakai, S. Nakamura, and K. Shikano, "Robust speech recognition in car environments," presented at Acoustics, Speech, and Signal Processing, 1998. ICASSP '98. Proceedings of the 1998 IEEE International Conference on, 1998.
[2] L. Griffiths and C. Jim, "An alternative approach to linearly constrained adaptive beamforming," Antennas and Propagation, IEEE Transactions on
[legacy, pre - 1988], vol. 30, pp. 27-34, 1982.
[3] B. Widrow and F. Luo, "Microphone arrays for hearing aids: An overview," Speech Communication, vol. 39, pp. 27-34, 2003.
[4] T. Nishiura, R. Gruhn, and S. Nakamura, "Collaborative steering of microphone array and video camera toward multi-lingual tele-conference through speech-to-speech translation," presented at Automatic Speech Recognition and Understanding, 2001. ASRU '01. IEEE Workshop on, 2001.
[5] S. Stergiopoulos and A. C. Dhanantwari, "Implementation of adaptive processing in integrated active-passive sonars with multi-dimensional arrays," presented at Advances in Digital Filtering and Signal Processing, 1998 IEEE Symposium on, 1998.
[6] G. W. Elko, "Microphone array systems for hands-free telecommunication," Speech Communication, vol. 22, pp. 229-240, 1996.
[7] B. Widrow, J. R. Glover, Jr., J. M. McCool, J. Kaunitz, C. S. Williams, R. H. Hearn, J. R. Zeidler, J. Eugene Dong, and R. C. Goodlin, "Adaptive noise cancelling: Principles and applications," Proceedings of the IEEE, vol. 63, pp. 1692-1716, 1975.
[8] B. Widrow and M. E. Hoff, "Adaptive switching circuits," IRE Wescon Convention Record, pp. 94-104, 1960.
[9] M. M. Goulding and J. S. Bird, "Speech enhancement for mobile telephony," Vehicular Technology, IEEE Transactions on, vol. 39, pp. 316-326, 1990.
[10] W. Armbruester, R. Czarnach, and P. Vary, "Adaptive Noise Cancellation with Reference Input - Possible Applications and Theoretical Limits," in Signal Processing III: Theories and Applications, I. T. Young, Ed.: Elsevier, 1986, pp. 391-394.
[11] H. Agaiby and T. J. Moir, "A robust word boundary detection algorithm with application to speech recognition," presented at Digital Signal Processing Proceedings, 1997. DSP 97., 1997 13th International Conference on, 1997.
[12] Z. Qi and T. J. Moir, "An Automotive three-microphone Voice Activity Detector and noise canceller," presented at 2005 International Conference on Intelligent Sensors, Sensor Networks and Information, Melbourne, 2005.
[13] G. Carter, C. Knapp, and A. Nuttall, "Estimation of the magnitudesquared coherence function via overlapped fast Fourier transform processing," Audio and Electroacoustics, IEEE Transactions on, vol. 21, pp. 337-344, 1973.
[14] S. Haykin, Adaptive Filter Theory, 4 ed: Prentice Hall, 2002.
[15] G. Barrault, M. H. Costa, J. C. M. Bermudez, and A. Lenzi, "A new analytical model for the NLMS algorithm," presented at Acoustics, Speech, and Signal Processing, 2005. Proceedings. (ICASSP '05). IEEE International Conference on, 2005.
[16] C. Rulph, DSP applications using C and the TMS320C6x DSK: J. Wiley, 2002.
[17] W. Herbordt, Sound Capture for Human/machine Interfaces - Practical Aspects of Microphone Array Signal Processing: Springer-Verlag, 2005.
[18] W. Herbordt, T. Horiuchi, M. Fujimoto, T. Jitsuhiro, and S. Nakamura, "Hands-Free Speech Recognition and Communication on PDAS Using Microphone Array Technology," presented at Automatic Speech Recognition and Understanding, 2005 IEEE Workshop on, 2005.
[19] O. Hoshuyama and A. Sugiyama, "Robust Adaptive Beamforming," in Microphone Arrays: Signal Processing Techniques and Applications (Digital Signal Processing), M. Brandstein and Ward, Eds.: Springer- Verlag, 2001.
[20] R. B. Wallace and R. A. Goubran, "Improved tracking adaptive noise canceler for nonstationary environments," Signal Processing, IEEE Transactions on (see also Acoustics, Speech, and Signal Processing, IEEE Transactions on), vol. 40, pp. 700-703, 1992.
[21] R. B. Wallace and R. A. Goubran, "Noise cancellation using parallel adaptive filters," Circuits and Systems II: Analog and Digital Signal Processing, IEEE Transactions on (see also Circuits and Systems II: Express Briefs, IEEE Transactions on), vol. 39, pp. 239-243, 1992.
[22] D. Van Compernolle, "Switching adaptive filters for enhancing noisy and reverberant speech from microphone array recordings," presented at Acoustics, Speech, and Signal Processing, 1990. ICASSP-90., 1990 International Conference on, 1990.