Time-dependent semantic similarity measure of queries using historical click-through data

Qiankun Zhao, Steven C. H. Hoi, Tie-Yan Liu, Sourav S. Bhowmick, Michael R. Lyu, Wei-Ying Ma

Research output: Chapters, Conference Papers, Creative and Literary WorksRGC 32 - Refereed conference paper (with host publication)peer-review

40 Citations (Scopus)

Abstract

It has become a promising direction to measure similarity of Web search queries by mining the increasing amount of click-through data logged by Web search engines, which record the interactions between users and the search engines. Most existing approaches employ the click-through data for similarity measure of queries with little consideration of the temporal factor, while the click-through data is often dynamic and contains rich temporal information. In this paper we present a new framework of time-dependent query semantic similarity model on exploiting the temporal characteristics of historical click-through data. The intuition is that more accurate semantic similarity values between queries can be obtained by taking into account the timestamps of the log data. With a set of user-defined calendar schema and calendar patterns, our time-dependent query similarity model is constructed using the marginalized kernel technique, which can exploit both explicit similarity and implicit semantics from the click-through data effectively. Experimental results on a large set of click-through data acquired from a commercial search engine show that our time-dependent query similarity model is more accurate than the existing approaches. Moreover, we observe that our time-dependent query similarity model can, to some extent, reflect real-world semantics such as real-world events that are happening over time.
Original languageEnglish
Title of host publicationProceedings of the 15th International Conference on World Wide Web
Pages543-552
DOIs
Publication statusPublished - 2006
Externally publishedYes
Event15th International Conference on World Wide Web - Edinburgh, Scotland, United Kingdom
Duration: 23 May 200626 May 2006

Publication series

NameProceedings of the 15th International Conference on World Wide Web

Conference

Conference15th International Conference on World Wide Web
PlaceUnited Kingdom
CityEdinburgh, Scotland
Period23/05/0626/05/06

Bibliographical note

Publication details (e.g. title, author(s), publication statuses and dates) are captured on an “AS IS” and “AS AVAILABLE” basis at the time of record harvesting from the data source. Suggestions for further amendments or supplementary information can be sent to [email protected].

Funding

The work described in this paper was partially supported by two grants, one from the Shun Hing Institute of Advanced Engineering, and the other from the Research Grants Council of Hong Kong S.A.R., China (Project No. CUHK4205/04E).

Research Keywords

  • Click-through data
  • Event detection
  • Evolution pattern
  • Marginalized kernel
  • Semantic similarity measure

RGC Funding Information

  • RGC-funded

Fingerprint

Dive into the research topics of 'Time-dependent semantic similarity measure of queries using historical click-through data'. Together they form a unique fingerprint.

Cite this