Advanced SearchSearch Tips
Conjoined Audio Fingerprint based on Interhash and Intra hash Algorithms
facebook(new window)  Pirnt(new window) E-mail(new window) Excel Download
 Title & Authors
Conjoined Audio Fingerprint based on Interhash and Intra hash Algorithms
Kim, Dae-Jin; Choi, Hong-Sub;
  PDF(new window)
In practice, the most important performance parameters for music information retrieval (MIR) service are robustness of fingerprint in real noise environments and recognition accuracy when the obtained query clips are matched with the an entry in the database. To satisfy these conditions, we proposed a conjoined fingerprint algorithm for use in massive MIR service. The conjoined fingerprint scheme uses interhash and intrahash algorithms to produce a robust fingerprint scheme in real noise environments. Because the interhash and intrahash algorithms are masked in the predominant pitch estimation, a compact fingerprint can be produced through their relationship. Experimental performance comparison results showed that our algorithms were superior to existing algorithms, i.e., the sub-mask and Philips algorithms, in real noise environments.
Music Information Retrieval;Conjoined Fingerprint;Interhash;Intrahash;
 Cited by
P. Cano, E. Batlle, T. Kalker, and J. Haitsma, “A Review of Audio Fingerprinting,” J. VLSI Signal Processing Systems for Signal Image Video Technology, vol. 41, no. 3, 2005, pp. 271-284. crossref(new window)

J. Haitsma and T. Kalker, “A Highly Robust Audio Fingerprinting System,” Proc. Of the 3rd Int. Symposium on Music Information Retrieval, 2002, pp. 144-148.

Mansoo Park, Hoi-Rin Kim, and Seung Hyun Yang, “Frequency Temporal Filtering for a Robust Audio Fingerprinting Scheme in Real-Noise Environments,” ETRI Journal, vol. 28, no. 4, 2006, pp. 509-512. crossref(new window)

Wooram Son, Hyun-Tae Cho, Kyoungro Yoon, and Seok-Pil Lee, “Sub-fingerprint Masking for a Robust Audio Fingerprinting System in Real-noise Environment for Portable Consumer Devices,” IEEE Transactions on Consumer Electronics, vol. 56, no. 1, 2010, pp. 156-160. crossref(new window)

J. Song, S. Bae, and K. Yoon, “Mid-level music melody representation of polyphonic audio for query-by-humming system,” International Symposium on Music Information Retrieval, 2002.

J. Song, S. Bae, and K. Yoon, “Query by humming: matching humming query to polyphonic audio,” IEEE International Conference on Multimedia and Expo, 2002.

J. Chen, K. Paliwal, and S. Nakamura, “Cepstrum derived from Differentiated Power Spectrum for Robust Speech Recognition,” Speech Communication, vol. 41, 2003, pp. 469-484. crossref(new window)

H.-Y. Jung, “Filtering of Filter-Bank Energies for Robust Speech Recognition,” ETRI Journal, vol. 26, no. 3, 2004, pp. 273-276. crossref(new window)