Commenced in January 2007
Frequency: Monthly
Edition: International
Paper Count: 30576
Time Delay Estimation Using Signal Envelopes for Synchronisation of Recordings

Authors: Sergei Aleinik, Mikhail Stolbov


In this work, a method of time delay estimation for  dual-channel acoustic signals (speech, music, etc.) recorded under  reverberant conditions is investigated. Standard methods based on  cross-correlation of the signals show poor results in cases involving  strong reverberation, large distances between microphones and  asynchronous recordings. Under similar conditions, a method based  on cross-correlation of temporal envelopes of the signals delivers a  delay estimation of acceptable quality. This method and its properties  are described and investigated in detail, including its limits of  applicability. The method’s optimal parameter estimation and a  comparison with other known methods of time delay estimation are  also provided.


Keywords: Signal Processing, cross-correlation, delay estimation, signal envelope

Digital Object Identifier (DOI):

Procedia APA BibTeX Chicago EndNote Harvard JSON MLA RIS XML ISO 690 PDF Downloads 2647


[1] J. Chen, J. Benesty and Y.A. Huang, "Time Delay Estimation in Room Acoustic Environments,” EURASIP Journal on Advances in Signal Processing, vol. 2006, 2006, pp. 1-20.
[2] A. Sandmair, M. Lietz, J. Stefan, and F.P. Leon, "Time delay estimation in the time-frequency domain based on a line detection approach”, in Proc. ICASSP- International Conference on Acoustics, Speech, and Signal Processing, 2011, pp. 2716-2719.
[3] K. Gedalyahu and Y.C. Eldar. "Time-delay estimation from low-rate samples: A union of subspaces approach,” IEEE Transactions on Signal Processing, vol. 58, no. 6, 2010, pp. 3017 –3031.
[4] B. Kirkwood, "Acoustic Source Localization Using Time-Delay Estimation”, M.S. Thesis, 2003, project-ms.html
[5] A. Kozlov, O. Kudashev, Yu. Matveev, T. Pekhovsky, K. Simonchik, A. Shulipa."SVID Speaker Recognition System for NIST SRE 2012,”in. Proc. of 15th International Conference "Speech and Computer” (SPECOM 2013). Springer Lecture Notes in Computer Science, Lecture Notes in Artificial Intelligence, 2013, Vol. 8113, pp. 278-285.
[6] S. Bédard, B. Champagne and A. Stéphenne, "Effects of Room Reverberation on Time-Delay Estimation Performance,” IEEE Transactions Acoustics, Speech, and Signal Processing, vol.2, 1994, pp. 261-264.
[7] R. Raya, A. Frizera, R. Ceres, L. Calderón, E. Rocon, "Design and evaluation of a fast model-based algorithm for ultrasonic range measurements,” Sensors and Actuators A: Physical, vol. 148, No. 1, 2008, pp. 335–341.
[8] L. Yang, A.V. Lavrinenko, J.M. Hvam, and O. Sigmund, "Design of one-dimensional optical pulse-shaping filters by time-domain topology optimization,” Appl. Phys. Lett. 95, 2009, 261101.
[9] B.S. Lazarov, R. Matzen, and Y. Elesin, "Topology optimization of pulse shaping filters using the Hilbert transform envelope extraction,” Structural and Multidisciplinary Optimization, vol. 44, no. 3, pp. 409– 419, 2011.
[10] P. Ignatov, M. Stolbov, S. Aleinik, "Semi-automated technique for noisy recording enhancement using an independent reference recording,” in Proc. 46th International Conference of the Audio Engineering Society, 2012, pp. 57-65
[11] N. Thrane, J. Wismer, H. Konstantin-Hansen, and S. Gade, "Practical use of the Hilbert transform,” Application Note, Brüel&Kjær, Denmark. Available:
[12] C. Faller, C Tournery, "Estimating the delay and coloration effect of the acoustic echo path for low-complexity echo suppression,” inProc. Intl. Works. OnAcoust. Echo and Noise Control (IWAENC), The Netherlands, 2005, pp. 53-56.
[13] O.M.Bouzid, G. Y. Tian, J.Neasham, and B. Sharif, "Envelope and Wavelet Transform for Sound Localisation at Low Sampling Rates in Wireless Sensor Networks,” Journal of Sensors, vol. 2012, Article ID 680383, 9 pages.
[14] S. J. Orfanidis, Introduction to Signal Processing. Available:
[15] T. Hougast, H. J. M. Steeneken, "A review of the MTF concept in room acoustics and it’s use for estimating speech intelligibility in auditoria”, J. Acoust. Soc. Am. 67, 1985, pp. 1060-1077.