Combining context, consistency, and diversity cues for interactive image categorization
Related Research Unit(s)
|Journal / Publication||IEEE Transactions on Multimedia|
|Publication status||Published - Apr 2010|
|Link to Scopus||https://www.scopus.com/record/display.uri?eid=2-s2.0-77949723232&origin=recordpage|
This paper presents a novel graph-based framework which can combine context, consistency, and diversity cues for interactive image categorization. The image representation is first formed with visual keywords by dividing images into blocks and then performing clustering on these blocks. The context across visual keywords within an image is further captured by proposing a 2-D spatial Markov chain model. To develop a graph-based approach to image categorization, we incorporate intra-image context into a new class of kernel called spatial Markov kernel which can be used to define the affinity matrix for a graph. After graph construction with this kernel, the large unlabeled data can be exploited by graph-based semi-supervised learning through label propagation with inter-image consistency. For interactive image categorization, we further combine this semi-supervised learning with active learning by defining a new diversity-based data selection criterion using spectral embedding. Experiments then demonstrate that the proposed framework can achieve promising results. © 2006 IEEE.
- Active learning, Image categorization, Kernel methods, Markov models, Semi-supervised learning
IEEE Transactions on Multimedia, Vol. 12, No. 3, 5398910, 04.2010, p. 194-203.
Research output: Journal Publications and Reviews (RGC: 21, 22, 62) › 21_Publication in refereed journal
Lu, Z & Ip, HHS 2010, 'Combining context, consistency, and diversity cues for interactive image categorization', IEEE Transactions on Multimedia, vol. 12, no. 3, 5398910, pp. 194-203. https://doi.org/10.1109/TMM.2010.2041100
Lu, Z., & Ip, H. H. S. (2010). Combining context, consistency, and diversity cues for interactive image categorization. IEEE Transactions on Multimedia, 12(3), 194-203. . https://doi.org/10.1109/TMM.2010.2041100
Lu Z, Ip HHS. Combining context, consistency, and diversity cues for interactive image categorization. IEEE Transactions on Multimedia. 2010 Apr;12(3):194-203. 5398910. https://doi.org/10.1109/TMM.2010.2041100