A bag of systems representation for music auto-tagging

Research output: Journal Publications and ReviewsRGC 21 - Publication in refereed journalpeer-review

17 Scopus Citations
View graph of relations

Author(s)

Related Research Unit(s)

Detail(s)

Original languageEnglish
Article number6583960
Pages (from-to)2554-2569
Journal / PublicationIEEE Transactions on Audio, Speech and Language Processing
Volume21
Issue number12
Publication statusPublished - 2013

Link(s)

Abstract

We present a content-based automatic tagging system for music that relies on a high-level, concise "Bag of Systems" (BoS) representation of the characteristics of a musical piece. The BoS representation leverages a rich dictionary of musical codewords, where each codeword is a generative model that captures timbral and temporal characteristics of music. Songs are represented as a BoS histogram over codewords, which allows for the use of traditional algorithms for text document retrieval to perform auto-tagging. Compared to estimating a single generative model to directly capture the musical characteristics of songs associated with a tag, the BoS approach offers the flexibility to combine different generative models at various time resolutions through the selection of the BoS codewords. Additionally, decoupling the modeling of audio characteristics from the modeling of tag-specific patterns makes BoS a more robust and rich representation of music. Experiments show that this leads to superior auto-tagging performance. © 2006-2012 IEEE.

Research Area(s)

  • Audio annotation and retrieval, bag of systems, content-based music processing, dynamic texture model, music information retrieval

Citation Format(s)

A bag of systems representation for music auto-tagging. / Ellis, Katherine; Coviello, Emanuele; Chan, Antoni B. et al.
In: IEEE Transactions on Audio, Speech and Language Processing, Vol. 21, No. 12, 6583960, 2013, p. 2554-2569.

Research output: Journal Publications and ReviewsRGC 21 - Publication in refereed journalpeer-review

Download Statistics

No data available