An extractive text summarizer based on significant words

Research output: Chapters, Conference Papers, Creative and Literary Works (RGC: 12, 32, 41, 45)32_Refereed conference paper (with ISBN/ISSN)peer-review

16 Scopus Citations
View graph of relations

Author(s)

Detail(s)

Original languageEnglish
Title of host publicationComputer Processing of Oriental Languages: Language Technology for the Knowledge-based Economy
Subtitle of host publication22nd International Conference, ICCPOL 2009, Proceedings
PublisherSpringer Verlag
Pages168-178
Volume5459 LNAI
ISBN (Print)3642008305, 9783642008306
Publication statusPublished - 2009

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume5459 LNAI
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Title22nd International Conference on Computer Processing of Oriental Languages, ICCPOL 2009
PlaceHong Kong
Period26 - 27 March 2009

Abstract

Document summarization can be viewed as a reductive distilling of source text through content condensation, while words with high quantities of information are believed to carry more content and thereby importance. In this paper, we propose a new quantification measure for word significance used in natural language processing (NLP) tasks, and successfully apply it to an extractive text summarization approach. In a query-based summarization setting, the correlation between user queries and sentences to be scored is established from both the micro (i.e. at the word level) and the macro (i.e. at the sentence level) perspectives, resulting in an effective ranking formula. The experiments, both on a generic single document summarization evaluation, and on a query-based multi-document evaluation, verify the effectiveness of the proposed measures and show that the proposed approach achieves a state-of-the-art performance. © 2009 Springer Berlin Heidelberg.

Citation Format(s)

An extractive text summarizer based on significant words. / Liu, Xiaoyue; Webster, Jonathan J.; Kit, Chunyu.

Computer Processing of Oriental Languages: Language Technology for the Knowledge-based Economy: 22nd International Conference, ICCPOL 2009, Proceedings. Vol. 5459 LNAI Springer Verlag, 2009. p. 168-178 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 5459 LNAI).

Research output: Chapters, Conference Papers, Creative and Literary Works (RGC: 12, 32, 41, 45)32_Refereed conference paper (with ISBN/ISSN)peer-review