TY - GEN
T1 - Multi-modal information retrieval with a semantic view mechanism
AU - Qing, L. I.
AU - Yang, Jun
AU - Zhuang, Yueting
PY - 2005
Y1 - 2005
N2 - The explosive growth of multimedia information on the Web in recent years calls for an elegant means to model and manage multimedia content to facilitate semantic-level access and sharing across diversified applications. From the perspective of retrieval, the semantics of multimedia data features context-dependency and media-independency; both are inadequately supported by the state-of-the-art data modeling technology. In this paper, we address this problem by advocating MediaView as an extended object-oriented view mechanism to bridge the "semantic gap" between conventional databases and semantics-intensive multimedia applications. This mechanism captures the dynamic semantics of multimedia using a modeling construct named media view (MV), which formulates a customized context where heterogeneous media objects with similar/related semantics are characterized by additional properties and user-defined semantic relationships. View operators are proposed for the manipulation and derivation of individual MVs which can be fit into the desired real-life scenarios automatically. The usefulness and elegancy of MediaView are demonstrated by its applications in various (subjective) activities supporting multi-modal retrieval. © 2005 IEEE.
AB - The explosive growth of multimedia information on the Web in recent years calls for an elegant means to model and manage multimedia content to facilitate semantic-level access and sharing across diversified applications. From the perspective of retrieval, the semantics of multimedia data features context-dependency and media-independency; both are inadequately supported by the state-of-the-art data modeling technology. In this paper, we address this problem by advocating MediaView as an extended object-oriented view mechanism to bridge the "semantic gap" between conventional databases and semantics-intensive multimedia applications. This mechanism captures the dynamic semantics of multimedia using a modeling construct named media view (MV), which formulates a customized context where heterogeneous media objects with similar/related semantics are characterized by additional properties and user-defined semantic relationships. View operators are proposed for the manipulation and derivation of individual MVs which can be fit into the desired real-life scenarios automatically. The usefulness and elegancy of MediaView are demonstrated by its applications in various (subjective) activities supporting multi-modal retrieval. © 2005 IEEE.
KW - Context-dependency
KW - Media view
KW - Multi-modal retrieval
UR - http://www.scopus.com/inward/record.url?scp=33744491591&partnerID=8YFLogxK
UR - https://www.scopus.com/record/pubmetrics.uri?eid=2-s2.0-33744491591&origin=recordpage
U2 - 10.1109/AINA.2005.255
DO - 10.1109/AINA.2005.255
M3 - RGC 32 - Refereed conference paper (with host publication)
SN - 0769522491
SN - 9780769522494
VL - 1
SP - 133
EP - 138
BT - Proceedings - International Conference on Advanced Information Networking and Applications, AINA
T2 - 19th International Conference on Advanced Information Networking and Applications, AINA 2005
Y2 - 28 March 2005 through 30 March 2005
ER -