Skip to main navigation Skip to search Skip to main content

Enhancing density-based data reduction using entropy

D. Huang, Tommy W. S. Chow

Research output: Journal Publications and ReviewsRGC 21 - Publication in refereed journalpeer-review

Abstract

Data reduction algorithms determine a small data subset from a given large data set. In this article, new types of data reduction criteria, based on the concept of entropy, are first presented. These criteria can evaluate the data reduction performance in a sophisticated and comprehensive way. As a result, new data reduction procedures are developed. Using the newly introduced criteria, the proposed data reduction scheme is shown to be efficient and effective. In addition, an outlier-filtering strategy, which is computationally insignificant, is developed. In some instances, this strategy can substantially improve the performance of supervised data analysis. The proposed procedures are compared with related techniques in two types of application: density estimation and classification. Extensive comparative results are included to corroborate the contributions of the proposed algorithms. © 2005 Massachusetts Institute of Technology.
Original languageEnglish
Pages (from-to)470-495
JournalNeural Computation
Volume18
Issue number2
DOIs
Publication statusPublished - Feb 2006

Fingerprint

Dive into the research topics of 'Enhancing density-based data reduction using entropy'. Together they form a unique fingerprint.

Cite this