Ontology-enriched semantic space for video search

Xiao-Yong Wei, Chong-Wah Ngo

Research output: Chapters, Conference Papers, Creative and Literary WorksRGC 32 - Refereed conference paper (with host publication)peer-review

32 Citations (Scopus)

Abstract

Multimedia-based ontology construction and reasoning have recently been recognized as two important issues in video search, particularly for bridging semantic gap. The lack of coincidence between low-level features and user expectation makes concept-based ontology reasoning an attractive mid-level framework for interpreting high-level semantics. In this paper, we propose a novel model, namely ontology-enriched semantic space (OSS), to provide a computable platform for modeling and reasoning concepts in a linear space. OSS enlightens the possibility of answering conceptual questions such as a high coverage of semantic space with minimal set of concepts, and the set of concepts to be developed for video search. More importantly, the query-to-concept mapping can be more reasonably conducted by guaranteeing the uniform and consistent comparison of concept scores for video search. We explore OSS for several tasks including concept-based video search, word sense disambiguation and multi-modality fusion. Our empirical findings show that OSS is a feasible solution to timely issues such as the measurement of concept combination and query-concept dependent fusion. Copyright 2007 ACM.
Original languageEnglish
Title of host publicationProceedings of the ACM International Multimedia Conference and Exhibition
Pages981-990
DOIs
Publication statusPublished - 2007
Event15th ACM International Conference on Multimedia, MM'07 - Augsburg, Bavaria, Germany
Duration: 24 Sept 200729 Sept 2007

Conference

Conference15th ACM International Conference on Multimedia, MM'07
Country/TerritoryGermany
CityAugsburg, Bavaria
Period24/09/0729/09/07

Research Keywords

  • Concept-based video search
  • Ontology
  • Semantic space

Fingerprint

Dive into the research topics of 'Ontology-enriched semantic space for video search'. Together they form a unique fingerprint.

Cite this