Commenced in January 2007
Frequency: Monthly
Edition: International
Paper Count: 30184
Enhanced Parallel-Connected Comb Filter Method for Multiple Pitch Estimation

Authors: Taro Matsuno, Yuta Otani, Ryo Tanaka, Kaori Ikezaki, Hitoshi Yamamoto, Masaru Fujieda, Yoshihisa Ishida

Abstract:

This paper presents an improvement method of the multiple pitch estimation algorithm using comb filters. Conventionally the pitch was estimated by using parallel -connected comb filters method (PCF). However, PCF has problems which often fail in the pitch estimation when there is the fundamental frequency of higher tone near harmonics of lower tone. Therefore the estimation is assigned to a wrong note when shared frequencies happen. This issue often occurs in estimating octave 3 or more. Proposed method, for solving the problem, estimates the pitch with every harmonic instead of every octave. As a result, our method reaches the accuracy of more than 80%.

Keywords: music transcription, pitch estimation, comb filter, fractional delay

Digital Object Identifier (DOI): doi.org/10.5281/zenodo.1072096

Procedia APA BibTeX Chicago EndNote Harvard JSON MLA RIS XML ISO 690 PDF Downloads 1075

References:


[1] T. Morita, M. Yamaguchi and Y. Tadokoro, "Pitch Estimation Method for Transcription Using Parallel-Connected Comb Filters and Noticing Their Output Values," Journal of IEICE, D-II, Vol.J87-D-II, No.12, pp.2271-2279, Dec. 2004.
[2] T. Saito, T. Matsui, H. Honda and Y. Tadokoro, "Real-time Realization of Scale Detection Based on Comb Filters Using DSPs," Journal of SICE, Vol. 34, pp.504-509, 1998.
[3] H. Kameoka, T. Nishimoto and S. Sagayama, "Separation of Harmonic Structures Based on Tied Gaussian Mixture Model and Information Criterion for Concurrent Sounds," ICASSP -04, pp.297-300, 2004.
[4] H. Kameoka, T. Nishimoto and S. Sagayama, "A Multipitch Analyzer Based on Harmonic Temporal Structured Clustering," IEEE Tras. On Audio, Speech, and Language Processing, Vol. 15, No.3, pp. 982-994, 2007.
[5] K. Miyamoto, H. Kameoka, T. Nishimoto, N. Ono and S. Sagayama, "Harmonic-Temporal-Timbral Clustering (HTTC) for the analysis of multi-instrument polyphonic music signal," ICASSP -08, pp. 113-116, 2008
[6] K. Kashino, T. Kinoshita, K. Nakadai and H. Tanaka, "Chord Recognition Mechanisms in the OPTIMA Processing Architecture for Music Scene Analysis," The transactions of the Institute of Electronics, Information and Communication Engineers, J79-D-2, 1996, 1762-1770 (in Japanese).
[7] M. Goto, "A predominant-F0 estimation method for CD recordings: MAP estimation using EM algorithm for adaptive tone models," Proc. IEEE International Conference on Acoustics, Speech, and Signal Processing Salt Lake City, UT, 2001, 3365-3368
[8] Anssi P. Klapuri, "Multiple Fundamental Frequency Estimation Based on Harmonicity and Spectral Smoothness," IEEE Trans. on Speech and Audio Processing, Vol.11, No.6, 804-816, 2003.
[9] M. Goto and K. Hirata, "Recent Studies on Music Information Processing," Acoustical Science and Technology, 25(4), pp. 419-425, 2004.
[10] T. Matsuno, M. Fujieda and Y. Ishida, "Pitch Estimation Using Comb Filter with Fractional Delay," Proc. IEICI, Fundamentals of Electronics, Communications and Computer Sciences, 2009, 196 p., (in Japanese)
[11] Timo I. Laakso, V. Valimaki, M. Karjalainen and Unto K. Laine "Splitting the Unit Delay," IEEE, Signal Processing Magazine, pp. 30-60, 1996.