Effective gene selection method with small sample sets using gradient-based and point injection techniques

Research output: Journal Publications and ReviewsRGC 21 - Publication in refereed journalpeer-review

8 Scopus Citations
View graph of relations

Author(s)

  • D. Huang
  • Tommy W. S. Chow

Related Research Unit(s)

Detail(s)

Original languageEnglish
Pages (from-to)467-475
Journal / PublicationIEEE/ACM Transactions on Computational Biology and Bioinformatics
Volume4
Issue number3
Publication statusPublished - Jul 2007

Abstract

Microarray gene expression data usually consist of a large amount of genes. Among these genes, only a small fraction are informative for performing a cancer diagnostic test. This paper focuses on effective identification of informative genes. We analyze gene selection models from the perspective of optimization theory. As a result, a new strategy is designed to modify conventional search engines. Also, as overfitting is likely to occur In microarray data because of their small sample set, a point injection technique Is developed to address the problem of overfitting. The proposed strategies have been evaluated on three kinds of cancer diagnosis. Our results show that the proposed strategies can improve the performance of gene selection substantially. The experimental results also indicate that the proposed methods are very robust under all of the investigated cases. © 2007 IEEE.

Research Area(s)

  • Gene selection, Gradient-based learning, Optimization theory, Point injection