Learning to extract and summarize hot item features from multiple auction web sites
Research output: Journal Publications and Reviews › RGC 21 - Publication in refereed journal › peer-review
Author(s)
Related Research Unit(s)
Detail(s)
Original language | English |
---|---|
Pages (from-to) | 143-160 |
Number of pages | 18 |
Journal / Publication | Knowledge and Information Systems |
Volume | 14 |
Issue number | 2 |
Online published | 31 Mar 2007 |
Publication status | Published - Feb 2008 |
Conference
Title | 10th Pacific-Asia Conference on Knowledge Discovery and Data Mining |
---|---|
Place | Singapore |
City | Singapore |
Period | 9 April 2005 - 12 April 2006 |
Link(s)
Abstract
It is difficult to digest the poorly organized and vast amount of information contained in auction Web sites which are fast changing and highly dynamic. We develop a unified framework which can automatically extract product features and summarize hot item features from multiple auction sites. To deal with the irregularity in the layout format of Web pages and harness the uncertainty involved, we formulate the tasks of product feature extraction and hot item feature summarization as a single graph labeling problem using conditional random fields. One characteristic of this graphical model is that it can model the inter-dependence between neighbouring tokens in a Web page, tokens in different Web pages, as well as various information such as hot item features across different auction sites. We have conducted extensive experiments on several real-world auction Web sites to demonstrate the effectiveness of our framework.
Research Area(s)
- information extraction, web mining, conditional random fields, INFORMATION EXTRACTION
Citation Format(s)
Learning to extract and summarize hot item features from multiple auction web sites. / Wong, Tak-Lam; Lam, Wai.
In: Knowledge and Information Systems, Vol. 14, No. 2, 02.2008, p. 143-160.
In: Knowledge and Information Systems, Vol. 14, No. 2, 02.2008, p. 143-160.
Research output: Journal Publications and Reviews › RGC 21 - Publication in refereed journal › peer-review