Indexing and matching of polyphonic songs for query-by-singing system

Research output: Chapters, Conference Papers, Creative and Literary Works (RGC: 12, 32, 41, 45)32_Refereed conference paper (with host publication)peer-review

View graph of relations

Author(s)

Related Research Unit(s)

Detail(s)

Original languageEnglish
Title of host publicationACM Multimedia 2004 - proceedings of the 12th ACM International Conference on Multimedia
Pages308-311
Publication statusPublished - 2004

Conference

TitleACM Multimedia 2004 - proceedings of the 12th ACM International Conference on Multimedia
PlaceUnited States
CityNew York, NY
Period10 - 16 October 2004

Abstract

This paper investigates the issues in polyphonic popular song retrieval. The problems that we consider include singing voice extraction, melodic curve representation, and database indexing. Initially, polyphonic songs are decomposed into singing voices and instruments sounds in both time and frequency domains based on SVM and ICA. The extracted singing voices are represented as two melodic curves that model the statistical mean and neighborhood similarity of notes. To speed up the matching between songs and query, we further adopt proportional transportation distance to index the songs as vantage point trees. Encouraging results have been obtained through experiments.

Research Area(s)

  • ICA, Melodic Curve, Proportional Transportation Distance

Citation Format(s)

Indexing and matching of polyphonic songs for query-by-singing system. / Leung, Tat-Wan; Ngo, Chong-Wah.
ACM Multimedia 2004 - proceedings of the 12th ACM International Conference on Multimedia. 2004. p. 308-311.

Research output: Chapters, Conference Papers, Creative and Literary Works (RGC: 12, 32, 41, 45)32_Refereed conference paper (with host publication)peer-review