Abstract
It has become a promising direction to measure similarity of Web search queries by mining the increasing amount of click-through data logged by Web search engines, which record the interactions between users and the search engines. Most existing approaches employ the click-through data for similarity measure of queries with little consideration of the temporal factor, while the click-through data is often dynamic and contains rich temporal information. In this paper we present a new framework of time-dependent query semantic similarity model on exploiting the temporal characteristics of historical click-through data. The intuition is that more accurate semantic similarity values between queries can be obtained by taking into account the timestamps of the log data. With a set of user-defined calendar schema and calendar patterns, our time-dependent query similarity model is constructed using the marginalized kernel technique, which can exploit both explicit similarity and implicit semantics from the click-through data effectively. Experimental results on a large set of click-through data acquired from a commercial search engine show that our time-dependent query similarity model is more accurate than the existing approaches. Moreover, we observe that our time-dependent query similarity model can, to some extent, reflect real-world semantics such as real-world events that are happening over time.
| Original language | English |
|---|---|
| Title of host publication | Proceedings of the 15th International Conference on World Wide Web |
| Pages | 543-552 |
| DOIs | |
| Publication status | Published - 2006 |
| Externally published | Yes |
| Event | 15th International Conference on World Wide Web - Edinburgh, Scotland, United Kingdom Duration: 23 May 2006 → 26 May 2006 |
Publication series
| Name | Proceedings of the 15th International Conference on World Wide Web |
|---|
Conference
| Conference | 15th International Conference on World Wide Web |
|---|---|
| Place | United Kingdom |
| City | Edinburgh, Scotland |
| Period | 23/05/06 → 26/05/06 |
Bibliographical note
Publication details (e.g. title, author(s), publication statuses and dates) are captured on an “AS IS” and “AS AVAILABLE” basis at the time of record harvesting from the data source. Suggestions for further amendments or supplementary information can be sent to [email protected].Funding
The work described in this paper was partially supported by two grants, one from the Shun Hing Institute of Advanced Engineering, and the other from the Research Grants Council of Hong Kong S.A.R., China (Project No. CUHK4205/04E).
Research Keywords
- Click-through data
- Event detection
- Evolution pattern
- Marginalized kernel
- Semantic similarity measure
RGC Funding Information
- RGC-funded