Representing a web page as sets of named entities of multiple types - a model and-some preliminary applications

Research output: Chapters, Conference Papers, Creative and Literary Works (RGC: 12, 32, 41, 45)32_Refereed conference paper (with ISBN/ISSN)

3 Scopus Citations
View graph of relations

Author(s)

Related Research Unit(s)

Detail(s)

Original languageEnglish
Title of host publicationProceeding of the 17th International Conference on World Wide Web 2008, WWW'08
Pages1099-1100
Publication statusPublished - 2008

Conference

Title17th International Conference on World Wide Web 2008, WWW'08
PlaceChina
CityBeijing
Period21 - 25 April 2008

Abstract

As opposed to representing a document as a "bag of words" in most information retrieval applications, we propose a model of representing a web page as sets of named entities of multiple types. Specifically, four types of named entities are extracted, namely person, geographic location, organization, and time. Moreover, the relations among these entities are also extracted, weighted, classified and marked by labels. On top of this model, some interesting applications are demonstrated. In particular, we introduce a notion of person-activity, which contains four different elements: person, location, time and activity. With this notion and based on a reasonably large set of web pages, we are able to show how one person's activities can be attributed by time and location, which gives a good idea of the mobility of the person under question.

Research Area(s)

  • Named entity, Web content mining, Web page model

Citation Format(s)

Representing a web page as sets of named entities of multiple types - a model and-some preliminary applications. / Di, Nan; Yao, Conglei; Duan, Mengcheng; Zhu, Jonathan J.H.; Li, Xiaoming.

Proceeding of the 17th International Conference on World Wide Web 2008, WWW'08. 2008. p. 1099-1100.

Research output: Chapters, Conference Papers, Creative and Literary Works (RGC: 12, 32, 41, 45)32_Refereed conference paper (with ISBN/ISSN)