Learning to extract and summarize hot item features from multiple auction web sites

Research output: Journal Publications and ReviewsRGC 21 - Publication in refereed journalpeer-review

32 Scopus Citations
View graph of relations

Author(s)

  • Tak-Lam Wong
  • Wai Lam

Related Research Unit(s)

Detail(s)

Original languageEnglish
Pages (from-to)143-160
Number of pages18
Journal / PublicationKnowledge and Information Systems
Volume14
Issue number2
Online published31 Mar 2007
Publication statusPublished - Feb 2008

Conference

Title10th Pacific-Asia Conference on Knowledge Discovery and Data Mining
PlaceSingapore
CitySingapore
Period9 April 2005 - 12 April 2006

Abstract

It is difficult to digest the poorly organized and vast amount of information contained in auction Web sites which are fast changing and highly dynamic. We develop a unified framework which can automatically extract product features and summarize hot item features from multiple auction sites. To deal with the irregularity in the layout format of Web pages and harness the uncertainty involved, we formulate the tasks of product feature extraction and hot item feature summarization as a single graph labeling problem using conditional random fields. One characteristic of this graphical model is that it can model the inter-dependence between neighbouring tokens in a Web page, tokens in different Web pages, as well as various information such as hot item features across different auction sites. We have conducted extensive experiments on several real-world auction Web sites to demonstrate the effectiveness of our framework.

Research Area(s)

  • information extraction, web mining, conditional random fields, INFORMATION EXTRACTION

Citation Format(s)

Learning to extract and summarize hot item features from multiple auction web sites. / Wong, Tak-Lam; Lam, Wai.
In: Knowledge and Information Systems, Vol. 14, No. 2, 02.2008, p. 143-160.

Research output: Journal Publications and ReviewsRGC 21 - Publication in refereed journalpeer-review