A two-stage information filtering based on rough decision rule and pattern mining
Research output: Journal Publications and Reviews › RGC 21 - Publication in refereed journal › peer-review
Author(s)
Related Research Unit(s)
Detail(s)
Original language | English |
---|---|
Pages (from-to) | 326-332 |
Journal / Publication | Journal of Emerging Technologies in Web Intelligence |
Volume | 2 |
Issue number | 4 |
Publication status | Published - Nov 2010 |
Link(s)
Abstract
Information Overload and Mismatch are two fundamental problems affecting the effectiveness of information filtering systems. Even though both term-based and patternbased approaches have been proposed to address the problems of overload and mismatch, neither of these approaches alone can provide a satisfactory solution to address these problems. This paper presents a novel two-stage information filtering model which combines the merits of term-based and pattern-based approaches to effectively filter sheer volume of information. In particular, the first filtering stage is supported by a novel rough analysis model which efficiently removes a large number of irrelevant documents, thereby addressing the overload problem. The second filtering stage is empowered by a semantically rich pattern taxonomy mining model which effectively fetches incoming documents according to the specific information needs of a user, thereby addressing the mismatch problem. The experimental results based on the RCV1 corpus show that the proposed twostage filtering model significantly outperforms the both termbased and pattern-based information filtering models. © 2010 ACADEMY PUBLISHER.
Research Area(s)
- Information filtering, Pattern mining, Rough set theory, User profiles
Citation Format(s)
A two-stage information filtering based on rough decision rule and pattern mining. / Zhou, Xujuan; Li, Yuefeng; Bruza, Peter et al.
In: Journal of Emerging Technologies in Web Intelligence, Vol. 2, No. 4, 11.2010, p. 326-332.
In: Journal of Emerging Technologies in Web Intelligence, Vol. 2, No. 4, 11.2010, p. 326-332.
Research output: Journal Publications and Reviews › RGC 21 - Publication in refereed journal › peer-review