TY - JOUR
T1 - Modeling music as a dynamic texture
AU - Barrington, Luke
AU - Chan, Antoni B.
AU - Lanckriet, Gert
PY - 2010/3
Y1 - 2010/3
N2 - We consider representing a short temporal fragment of musical audio as a dynamic texture, a model of both the timbral and rhythmical qualities of sound, two of the important aspects required for automatic music analysis. The dynamic texture model treats a sequence of audio feature vectors as a sample from a linear dynamical system. We apply this new representation to the task of automatic song segmentation. In particular, we cluster audio fragments, extracted from a song, as samples from a dynamic texture mixture (DTM) model. We show that the DTM model can both accurately cluster coherent segments in music and detect transition boundaries. Moreover, the generative character of the proposed model of music makes it amenable for a wide range of applications besides segmentation. As examples, we use DTM models of songs to suggest possible improvements in other music information retrieval applications such as music annotation and similarity. © 2006 IEEE.
AB - We consider representing a short temporal fragment of musical audio as a dynamic texture, a model of both the timbral and rhythmical qualities of sound, two of the important aspects required for automatic music analysis. The dynamic texture model treats a sequence of audio feature vectors as a sample from a linear dynamical system. We apply this new representation to the task of automatic song segmentation. In particular, we cluster audio fragments, extracted from a song, as samples from a dynamic texture mixture (DTM) model. We show that the DTM model can both accurately cluster coherent segments in music and detect transition boundaries. Moreover, the generative character of the proposed model of music makes it amenable for a wide range of applications besides segmentation. As examples, we use DTM models of songs to suggest possible improvements in other music information retrieval applications such as music annotation and similarity. © 2006 IEEE.
KW - Automatic segmentation
KW - Dynamic texture model (DTM)
KW - Music modeling
KW - Music similarity
UR - http://www.scopus.com/inward/record.url?scp=76949083546&partnerID=8YFLogxK
UR - https://www.scopus.com/record/pubmetrics.uri?eid=2-s2.0-76949083546&origin=recordpage
U2 - 10.1109/TASL.2009.2036306
DO - 10.1109/TASL.2009.2036306
M3 - RGC 21 - Publication in refereed journal
SN - 1558-7916
VL - 18
SP - 602
EP - 612
JO - IEEE Transactions on Audio, Speech and Language Processing
JF - IEEE Transactions on Audio, Speech and Language Processing
IS - 3
ER -